Research Article Open Access

Optical Character Recognition System for Arabic Text Using Cursive Multi-Directional Approach

Mansoor Al-A'ali and Jamil Ahmad

Abstract

This paper presents a novel new technique based on feature extraction and on dynamic cursor sizing for the recognition of Arabic Text. The most challenging area in Arabic OCR (AOCR) research is the segmentation of words into their sub-words and their individual characters. Several rules are defined that govern the size and movement of the cursor through each segment. The features obtained from each segment are termed strokes and each segment is defined by a number of strokes where each stroke is defined mainly in terms of a sequence of directions. The basic concept followed here is a logical, dynamically sized cursor that is used to "travel" through a text image of one word at a time while extracting features of strokes. The strokes obtained are then "pieced" back together to be classified into character classes based on a knowledge base and eventual recognition of characters is achieved. The results demonstrate that the technique is successful.

Journal of Computer Science
Volume 3 No. 7, 2007, 549-555

DOI: https://doi.org/10.3844/jcssp.2007.549.555

Submitted On: 18 February 2007 Published On: 31 July 2007

How to Cite: Al-A'ali, M. & Ahmad, J. (2007). Optical Character Recognition System for Arabic Text Using Cursive Multi-Directional Approach . Journal of Computer Science, 3(7), 549-555. https://doi.org/10.3844/jcssp.2007.549.555

  • 2,540 Views
  • 2,835 Downloads
  • 15 Citations

Download

Keywords

  • Arabic
  • OCR
  • features
  • strokes
  • segments