[1]CHEN Jie,SUN Zhong-gui,ZHOU Shu-feng.Applying image classification using wavelets to digitization of document information[J].CAAI Transactions on Intelligent Systems,2010,5(2):185-188.
Copy
CAAI Transactions on Intelligent Systems[ISSN 1673-4785/CN 23-1538/TP] Volume:
5
Number of periods:
2010 2
Page number:
185-188
Column:
学术论文—自然语言处理与理解
Public date:
2010-04-25
- Title:
-
Applying image classification using wavelets to digitization of document information
- Author(s):
-
CHEN Jie1; SUN Zhong-gui2; ZHOU Shu-feng2
-
1. Library of Liaocheng University, Liaocheng 252059, China;
2.College of Mathematics Science, Liaocheng University, Liaocheng 252059, China
-
- Keywords:
-
digitalizing document; OCR; wavelet; text image
- CLC:
-
TP18; TN911.72
- DOI:
-
-
- Abstract:
-
The accuracy of optical character recognition (OCR) technology in distinguishing between text areas and image areas has remained relatively low. Unfortunately this reduces the efficiency of OCR in digitization of document information. After analyzing the main steps of OCR applied to a digital library, the authors evolved an image classification algorithm based on wavelets. Decomposing the scanning area with wavelet transform was the first step in the algorithm. The energy value of the area could then be derived from wavelet coefficients. The task of distinguishing between text and images was accomplished by analyzing their energy values. The algorithm proved fast and automatic, characteristics increasing the efficiency of the digitization of document information. It was clear that the simulation verified the new algorithm’s feasibility.