Recognition of Handwritten Arabic Characters using Histograms of Oriented Gradient (HOG)


Citar

Texto integral

Acesso aberto Acesso aberto
Acesso é fechado Acesso está concedido
Acesso é fechado Somente assinantes

Resumo

Optical Character Recognition (OCR) is the process of recognizing printed or handwritten text on paper documents. This paper proposes an OCR system for Arabic characters. In addition to the preprocessing phase, the proposed recognition system consists mainly of three phases. In the first phase, we employ word segmentation to extract characters. In the second phase, Histograms of Oriented Gradient (HOG) are used for feature extraction. The final phase employs Support Vector Machine (SVM) for classifying characters. We have applied the proposed method for the recognition of Jordanian city, town, and village names as a case study, in addition to many other words that offers the characters shapes that are not covered with Jordan cites. The set has carefully been selected to include every Arabic character in its all four forms. To this end, we have built our own dataset consisting of more than 43.000 handwritten Arabic words (30000 used in the training stage and 13000 used in the testing stage). Experimental results showed a great success of our recognition method compared to the state of the art techniques, where we could achieve very high recognition rates exceeding 99%.

Sobre autores

Noor Jebril

Computer Sciences Department

Autor responsável pela correspondência
Email: njebril@kfu.edu.sa
Arábia Saudita, Hasa, 31982

Hussein Al-Zoubi

Computer Engineering Department

Email: njebril@kfu.edu.sa
Jordânia, Irbid, 21163

Qasem Abu Al-Haija

Electrical Engineering Department.

Email: njebril@kfu.edu.sa
Arábia Saudita, Hasa, 31982

Arquivos suplementares

Arquivos suplementares
Ação
1. JATS XML

Declaração de direitos autorais © Pleiades Publishing, Ltd., 2018