Optical Character Recognition (OCR) is a key technology enabling access to digital text data. This technique is especially valuable for Arabic scripts, for which there has been very little digital access.
Arabic script is widely used today. It is estimated that approximately 200 million people use Arabic as a first language, and the Arabic script is shared by an additional 13 languages, making it the second most widespread script in the world. However, Arabic scripts pose unique challenges for OCR systems that cannot be simply adapted from existing Latin character-based processing...
Optical Character Recognition (OCR) is a key technology enabling access to digital text data. This technique is especially valuable for Arabic scri...
This Guide to OCR for Arabic Scripts is the first book of its kind, specifically devoted to this emerging field. Topics and features: contains contributions from the leading researchers in the field; with a Foreword by Professor Bente Maegaard of the University of Copenhagen; presents a detailed overview of Arabic character recognition technology, covering a range of different aspects of pre-processing and feature extraction; reviews a broad selection of varying approaches, including HMM-based methods and a recognition system based on multidimensional recurrent neural networks;...
This Guide to OCR for Arabic Scripts is the first book of its kind, specifically devoted to this emerging field. Topics and features: contains ...