Connected Digits Recognition Task: ISTC–CNR Comparison of Open Source Tools

EVALITA is a recent initiative devoted to the evaluation of Natural Language and Speech Processing tools for Italian. In this work, the results of three open source ASR toolkits will be described. CSLU Speech Tools, CSLR SONIC, CMU SPHINX are applied on the EVALITA clean and noisy digits recognition task and this report will describe the complete evaluation
methodology. CSLR SONIC has resulted to have the best performances in all the tasks and even with high specialized trainings. We think that it is mostly because of the PMVDR features used in this system. CMU SPHINX has been the easiest system to train and test and its general performances are only slightly lower than SONIC. CSLU Speech Tools is the most specialized recognition system on digit and its score stands in the middle of the others. Overall, the three systems have Word Accuracy score over 90%.

Publication type: 
Contributo in atti di convegno
Author or Creator: 
Piero Cosi
Mauro Nicolao
Publisher: 
Associazione Italiana per l'Intelligenza Artificiale, Cesena, ITA
Source: 
EVALITA Workshop 2009, Workshop of the XI Conference of the Italian Association for Artificial Intelligence, Reggio Emilia, Italy, December 9-12, 2009
Date: 
2009
Resource Identifier: 
http://www.cnr.it/prodotto/i/140171
https://mailserver.di.unipi.it/ricerca/proceedings/AIIA09workshops/EVALITA/reports/Connected%20Digits%20Recognition/DIGITS_ISTC-SPFD_CNR.pdf
Language: 
Eng
ISTC Author: 
Piero Cosi's picture
Real name: