Vocal Source Contribution to Speaker Recognition

V. N. Sorokin

doi:10.1134/S1054661818030197

Vocal Source Contribution to Speaker Recognition

作者: Sorokin V.N.¹
隶属关系:
1. Institute for Information Transmission Problems
期: 卷 28, 编号 3 (2018)
页面: 546-556
栏目: Applied Problems
URL: https://bakhtiniada.ru/1054-6618/article/view/195440
DOI: https://doi.org/10.1134/S1054661818030197
ID: 195440

如何引用文章

全文:

开放存取

##reader.subscriptionAccessGranted##
受限制的访问

订阅存取

详细
作者简介
参考
补充文件
统计

详细

The vocal source and the pulse shape of the glottal flow are determined through the regularized ratio of the speech signal spectra at the intervals of the open and closed vocal slit within each period of the fundamental tone. Three databases were used: Russian numerals for 216 men and 177 women, the base obtained by converting the Russian database by the codec on 9.2 kbps, and the TIMIT database. The pitch period and 7 coefficients for the principal components of the glottal flow provide an average error of recognizing males below 8% for a sequence of 6 vowels. The minimum average recognition error for the initial base of Russian numerals for females makes about 15%, for males in the codec database makes about 15%, and for males in the TIMIT makes about 44%. The minimum average error of males’ recognition in the space of 7 coefficients for the principal components in the Russian database makes about 26%, but about 27% of the speakers have an average error of less than 10%.

关键词

vocal source estimation, glottal flow analysis, spectral inversion, speaker verification

作者简介

V. Sorokin

Institute for Information Transmission Problems

编辑信件的主要联系方式.
Email: vns@iitp.ru
俄罗斯联邦, Moscow, 127051

补充文件

附件文件

动作

1. JATS XML

下载

用户名
密码
记住我

忘记您的密码?	注册

用户名
密码
记住我

忘记您的密码?	注册