Full Text

Turn on search term navigation

Copyright © 2016 Valentin Smirnov et al. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Abstract

The paper describes the key concepts of a word spotting system for Russian based on large vocabulary continuous speech recognition. Key algorithms and system settings are described, including the pronunciation variation algorithm, and the experimental results on the real-life telecom data are provided. The description of system architecture and the user interface is provided. The system is based on CMU Sphinx open-source speech recognition platform and on the linguistic models and algorithms developed by Speech Drive LLC. The effective combination of baseline statistic methods, real-world training data, and the intensive use of linguistic knowledge led to a quality result applicable to industrial use.

Details

Title
A Russian Keyword Spotting System Based on Large Vocabulary Continuous Speech Recognition and Linguistic Knowledge
Author
Smirnov, Valentin; Ignatov, Dmitry; Gusev, Michael; Mais Farkhadov; Rumyantseva, Natalia; Farkhadova, Mukhabbat
Publication year
2016
Publication date
2016
Publisher
John Wiley & Sons, Inc.
ISSN
20900147
e-ISSN
20900155
Source type
Scholarly Journal
Language of publication
English
ProQuest document ID
1860184545
Copyright
Copyright © 2016 Valentin Smirnov et al. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.