Algorithms for correcting recognition results using N-grams


Citar

Texto integral

Acesso aberto Acesso aberto
Acesso é fechado Acesso está concedido
Acesso é fechado Somente assinantes

Resumo

This paper studies the application of N-grams for correcting the results of pattern recognition of words in documents based on the example of recognition of passport fields of a citizen of the Russian Federation. Three algorithms for correcting recognition results are given for trigrams. One of them is based on the use of trigram probabilities in combination with evaluation of recognition. The other algorithms are based on the definition of marginal distributions and computations by means of graphical probability models. The results of experiments on the application of the algorithms and comparison of the characteristics of the algorithms are presented.

Sobre autores

T. Manzhikov

Institute of System Analysis, Informatics and Management Federal Research Center

Autor responsável pela correspondência
Email: tmanzhikov@gmail.com
Rússia, Moscow, 117312

O. Slavin

Institute of System Analysis, Informatics and Management Federal Research Center

Email: tmanzhikov@gmail.com
Rússia, Moscow, 117312

I. Faradjev

Institute of System Analysis, Informatics and Management Federal Research Center

Email: tmanzhikov@gmail.com
Rússia, Moscow, 117312

I. Janiszewski

Institute of System Analysis, Informatics and Management Federal Research Center

Email: tmanzhikov@gmail.com
Rússia, Moscow, 117312

Arquivos suplementares

Arquivos suplementares
Ação
1. JATS XML

Declaração de direitos autorais © Pleiades Publishing, Ltd., 2017