Abstract
We present a robust and computationally efficient algorithm for the classification of blocks of bilevel machine-printed documents into text and halftone categories. It uses a simple mask that makes use of the different correlation properties between the text and halftone regions, and has comparable or better performance than more sophisticated and computationally intensive spectral analysis techniques. The proposed algorithm is a key component of a document recognition system that segments a document into regions, classifies them into text, halftone, line-art, etc., and then analyzes the regions to obtain a document interpretation. The input data are unusually challenging: multilingual, unoriented (e.g., upside down), and range from ideal (machine-generated) images to very low quality (e.g., copied and FAX-ed) images. We test the proposed algorithm on the University of Washington database and demonstrate its performance on a variety of images from different databases, as well as synthetic images.
Original language | English (US) |
---|---|
Pages | 1122-1125 |
Number of pages | 4 |
State | Published - Jan 1 2001 |
Event | IEEE International Conference on Image Processing (ICIP) 2001 - Thessaloniki, Greece Duration: Oct 7 2001 → Oct 10 2001 |
Other
Other | IEEE International Conference on Image Processing (ICIP) 2001 |
---|---|
Country/Territory | Greece |
City | Thessaloniki |
Period | 10/7/01 → 10/10/01 |
ASJC Scopus subject areas
- Hardware and Architecture
- Computer Vision and Pattern Recognition
- Electrical and Electronic Engineering