Browsing by Subject "Optical Character Recognition"
Now showing items 1-2 of 2
-
Chaudhuri, Anirban Ray; Singh, Debnath; Nasipuri, Mita; Basu, Dipak Kumar (INFLIBNET Centre, February 2, 2005)[more][less]
Abstract: The transformation of a scanned paper document into an editable form suitable for further processing such as desktop publishing or archiving in a digital library is a complex process. It requires solutions to several problems – document analysis by acquiring knowledge of document layout by a Page Layout Analyzer (PLA), followed by document recognition, which mainly comprises text recognition by Optical Character Recognition (OCR). Besides these two, another important problem is document reconstruction by transforming content into an electronically editable format by keeping the original layout intact. Core OCR modules exist on different Indian scripts, but no such document reconstruction system is available for Indian scripts. The document reconstruction system reported in this paper is the first of its kind on Indian scripts and it addresses document reconstruction for Bengali document images. The system makes use of the knowledge of both document layout extracted by a PLA in a graphical user interface (GUI) and the results of text recognition steps performed by OCR for transformation of paper documents into Rich Text Format. URI: http://hdl.handle.net/1944/495 Files in this item: 1
05cali_4.pdf (528.0Kb) -
Durrani, Omar Khan; Shet, K C (INFLIBNET Centre, February 2, 2005)[more][less]
Abstract: In order to bridge the digital divide between the sightless and sighted and to encourage literacy in them, we have designed an architecture for transcribing Braille from optically recognized Indian language. The system will help to convert masses of information in different Indian languages into a tactile reading form. The system mainly consists of OCR modules designed in an efficient manner to promote portability and scalability. In first section, we have introduced the importance and necessity of the work with successive sections clarifying briefly the properties of Braille and Indian scripts. We have also described the OCR work done with respect to Indian languages and the related work to our system. Finally, the System architecture is explained clearly followed by some conclusion and future work. The paper also identifies the needs to be fulfilled to percolate the benefit of the technology developed to the masses. URI: http://hdl.handle.net/1944/494 Files in this item: 1
05cali_3.pdf (373.8Kb)
Now showing items 1-2 of 2