Pati, Peeta Basa and Raju, Sabari S and Pati, Nishikanta and Ramakrishnan, AG (2004) Gabor filters for Document analysis in Indian Bilingual Documents. In: International Conference on Intelligent Sensing and Information Processing., 4-7 January 2004, Chennai, India, pp. 123-126.
Reasonable success has been achieved at developing monolingual OCR systems in Indian scripts. Scientists, optimistically, have started t o look beyond. Development of bilingual OCR systems and OCR systems with capability t o identify the text areas are some of the pointers to future activities in Indian scenario. The separation of text and non-text regions before considering the document image for OCR is an important task. In this paper, we present a biologically inspired, multi-channel filtering scheme for page layout analysis. The same scheme has been used for script recognition as well. Parameter tuning is mostly done heuristically. It has also been seen t o be computationally viable for commercial OCR system development.
|Item Type:||Conference Paper|
|Additional Information:||©2004 IEEE. Personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution to servers or lists, or to reuse any copyrighted component of this work in other works must be obtained from the IEEE.|
|Department/Centre:||Division of Electrical Sciences > Electrical Engineering|
|Date Deposited:||25 Aug 2008|
|Last Modified:||19 Sep 2010 04:12|
Actions (login required)