Use of Multi-Layered Networks for Coding Speech with Phonetic Features

Preliminary results on speaker-independant speech recognition are reported. A method that combines expertise on neural networks with expertise on speech recognition is used to build the recognition systems. For transient sounds, event-driven property extractors with variable resolution in the time and frequency domains are used. For sonorant speech, a model of the human auditory system is preferred to FFT as a front-end module.

Publication type: 
Contributo in volume
Author or Creator: 
Bengio Y.
Cardin R.
Cosi P.
De Mori R.
Publisher: 
MORGAN KAUFMANN, Palo Alto, USA
Source: 
Advances in Neural Information Processing Systems, edited by D.S. Touretzky, pp. 224–231. Palo Alto: MORGAN KAUFMANN, 1989
Date: 
1989
Resource Identifier: 
http://www.cnr.it/prodotto/i/238307
http://books.nips.cc/papers/files/nips01/0224.pdf
urn:isbn:1-558-60015-9
Language: 
Eng
ISTC Author: 
Piero Cosi's picture
Real name: