Compound Binarization for Degraded Document Image And Feature Point Extraction For Handwritten Arabic Optical Character Recognition
Optical character recognition (OCR) is a system aims to improve human machine interaction and widely used in many areas. Recognition of Arabic characters is difficult due to the cursive nature of Arabic scripts. The Arabic OCR system consists of five components: image acquisition, pre-processing,...
Saved in:
Main Author: | |
---|---|
Format: | Thesis |
Language: | en_US |
Subjects: | |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Summary: | Optical character recognition (OCR) is a system aims to improve human machine
interaction and widely used in many areas. Recognition of Arabic characters is difficult
due to the cursive nature of Arabic scripts. The Arabic OCR system consists of five
components: image acquisition, pre-processing, segmentation, feature extraction and
classification. Binarization is the main pre-processing process that consists of the
existing local and global thresholding methods. However, those methods are not
applicable in many binarization problems especially for degraded document images.
Baseline estimation is another pre-processing method aims to extract the virtual
horizontal line where all characters lay and join in a specific part of each character. This
existing method is inaccurate due to irregularity in sub-words alignment and a wide
variety of free writing styles. The third component in OCR system is segmentation of
text into characters. Nevertheless, the cursive, ligatures and overlapping characters
differentiate the Arabic script from other languages. Therefore, Arabic OCR system
requires a highly sophisticated segmentation method. This work proposes three methods
and framework for OCR. First, the proposed compound binarization method that
combines th e advantages of local and global thresholding method, tested on DIBCO
2009, 20 1 1 and 20 13 benchmark. Based on experimental results, the F-measure of
proposed binarization method for printed document image is 88% and for handwritten
is 78%, while the PSNR measurement for printed document image is 15.99 and for
handwritten is 16.34. Secondly the proposed baseline estimation method for binary
image based on feature points detection which tested on IFNIENIT dataset, when the
estimated pixel error is less than 15 pixels the accuracy of the proposed baseline
estimation method is 87.3%. And finally the proposed segmentation method based on
baseline estimation and structural rules which tested using IFNIENIT dataset, the
accuracy of the proposed method is 87.09%. The developed methods gained better
accuracy rate when compared with the state of the art methods using quantities
measurements.I t is able to recover document image of Arabic texts. |
---|