TY - JOUR
T1 - New neural network for articulatory speech recognition and its application to vowel identification
AU - Zacks, Jeff
AU - Thomas, Timothy R.
PY - 1994/7
Y1 - 1994/7
N2 - A system for automatic speech recognition (ASR) based on a new neural network design and a theory of articulatory phonology is presented. This system operates in two stages. In the first, speech acoustics are mapped by a neural network onto the movements of the tongue and lips that produced those acoustics (the neural networks are trained on X-ray microbeam recordings of actual articulatory movements); in the second stage, gestures are recovered from those movements. The neural network is built around a new objective function, Correlational + Scaling Error (COSE). When compared to a traditional neural network system, the COSE system trains faster, produces output which better represents the shape of the articulatory movements, and yields higher recognition rates for vowel gestures. After training on two speakers, recognition rates up to 96% for tokens from the training set and 87% for tokens spoken by a novel speaker were achieved.
AB - A system for automatic speech recognition (ASR) based on a new neural network design and a theory of articulatory phonology is presented. This system operates in two stages. In the first, speech acoustics are mapped by a neural network onto the movements of the tongue and lips that produced those acoustics (the neural networks are trained on X-ray microbeam recordings of actual articulatory movements); in the second stage, gestures are recovered from those movements. The neural network is built around a new objective function, Correlational + Scaling Error (COSE). When compared to a traditional neural network system, the COSE system trains faster, produces output which better represents the shape of the articulatory movements, and yields higher recognition rates for vowel gestures. After training on two speakers, recognition rates up to 96% for tokens from the training set and 87% for tokens spoken by a novel speaker were achieved.
UR - https://www.scopus.com/pages/publications/0028464701
U2 - 10.1006/csla.1994.1009
DO - 10.1006/csla.1994.1009
M3 - Article
AN - SCOPUS:0028464701
SN - 0885-2308
VL - 8
SP - 189
EP - 209
JO - Computer Speech and Language
JF - Computer Speech and Language
IS - 3
ER -