Phonetically-Based Multi-Layered Neural Networks for Vowel Classification | Istituto di Scienze e Tecnologie della Cognizione

The vowel sub-component of a speaker-independent phoneme classification system will be described. The architecture of the vowel classifier is based on an ear model followed by a set of Multi-Layered Neural Networks (MLNN). MLNNs are trained to learn how to recognize articulatory features like the place of articulation and the manner of articulation related to tongue position.
Experiments are performed on 10 English vowels showing a recognition rate higher than 95% on new speakers. When features are used for recognition, comparable results are obtained for vowels and diphthongs not used for training and pronounced by new speakers. This suggests that MLNNs suitably fed by the data computed by an ear model have good generalization capabilities over new speakers and new sounds.

Publication type:

Articolo

Author or Creator:

Cosi P.

Bengio Y.

De Mori R.

Publisher:

North-Holland, Amsterdam , Paesi Bassi

Source:

Speech communication (Print) 9 (1990): 15–29. doi:10.1016/0167-6393(90)90041-7

info:cnr-pdr/source/autori:Cosi P., Bengio Y., De Mori R./titolo:Phonetically-Based Multi-Layered Neural Networks for Vowel Classification/doi:10.1016/0167-6393(90)90041-7/rivista:Speech communication (Print)/anno:1990/pagina_da:15/pagina_a:29/intervallo_

Date:

1990

Resource Identifier:

http://www.cnr.it/prodotto/i/209915

https://dx.doi.org/10.1016/0167-6393(90)90041-7

info:doi:10.1016/0167-6393(90)90041-7

http://www.sciencedirect.com/science/article/pii/0167639390900417

Language:

Eng

ISTC Author:

Real name:

Piero Cosi