TY - GEN
T1 - Support vector machines for segmental minimum Bayes risk decoding of continuous speech
AU - Venkataramani, Veera
AU - Chakrabartty, Shantanu
AU - Byrne, William
N1 - Publisher Copyright:
© 2003 IEEE.
PY - 2003
Y1 - 2003
N2 - Segmental Minimum Bayes Risk (SMBR) Decoding involves the refinement of the search space into sequences of small sets of confusable words. We describe the application of Support Vector Machines (SVMs) as discriminative models for the refined search spaces. We show that SVMs, which in their basic formulation are binary classifiers of fixed dimensional observations, can be used for continuous speech recognition. We also study the use of GiniSVMs, which is a variant of the basic SVM. On a small vocabulary task, we show this two pass scheme outperforms MMI trained HMMs. Using system combination we also obtain further improvements over discriminatively trained HMMs.
AB - Segmental Minimum Bayes Risk (SMBR) Decoding involves the refinement of the search space into sequences of small sets of confusable words. We describe the application of Support Vector Machines (SVMs) as discriminative models for the refined search spaces. We show that SVMs, which in their basic formulation are binary classifiers of fixed dimensional observations, can be used for continuous speech recognition. We also study the use of GiniSVMs, which is a variant of the basic SVM. On a small vocabulary task, we show this two pass scheme outperforms MMI trained HMMs. Using system combination we also obtain further improvements over discriminatively trained HMMs.
UR - http://www.scopus.com/inward/record.url?scp=33645775754&partnerID=8YFLogxK
U2 - 10.1109/ASRU.2003.1318396
DO - 10.1109/ASRU.2003.1318396
M3 - Conference contribution
AN - SCOPUS:33645775754
T3 - 2003 IEEE Workshop on Automatic Speech Recognition and Understanding, ASRU 2003
SP - 13
EP - 18
BT - 2003 IEEE Workshop on Automatic Speech Recognition and Understanding, ASRU 2003
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - IEEE Workshop on Automatic Speech Recognition and Understanding, ASRU 2003
Y2 - 30 November 2003 through 4 December 2003
ER -