Algorithms for correcting recognition results using N-grams
- Authors: Manzhikov T.V.1, Slavin O.A.1, Faradjev I.A.1, Janiszewski I.M.1
-
Affiliations:
- Institute of System Analysis, Informatics and Management Federal Research Center
- Issue: Vol 27, No 4 (2017)
- Pages: 832-837
- Section: Applied Problems
- URL: https://bakhtiniada.ru/1054-6618/article/view/195268
- DOI: https://doi.org/10.1134/S1054661817040125
- ID: 195268
Cite item
Abstract
This paper studies the application of N-grams for correcting the results of pattern recognition of words in documents based on the example of recognition of passport fields of a citizen of the Russian Federation. Three algorithms for correcting recognition results are given for trigrams. One of them is based on the use of trigram probabilities in combination with evaluation of recognition. The other algorithms are based on the definition of marginal distributions and computations by means of graphical probability models. The results of experiments on the application of the algorithms and comparison of the characteristics of the algorithms are presented.
About the authors
T. V. Manzhikov
Institute of System Analysis, Informatics and Management Federal Research Center
Author for correspondence.
Email: tmanzhikov@gmail.com
Russian Federation, Moscow, 117312
O. A. Slavin
Institute of System Analysis, Informatics and Management Federal Research Center
Email: tmanzhikov@gmail.com
Russian Federation, Moscow, 117312
I. A. Faradjev
Institute of System Analysis, Informatics and Management Federal Research Center
Email: tmanzhikov@gmail.com
Russian Federation, Moscow, 117312
I. M. Janiszewski
Institute of System Analysis, Informatics and Management Federal Research Center
Email: tmanzhikov@gmail.com
Russian Federation, Moscow, 117312
Supplementary files
