Sparse kernel cepstral coefficients (SKCC): Inner-product based features for noise-robust speech recognition

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

In this paper we present a novel speech feature extraction algorithm based on sparse auditory coding and regression techniques in a reproducing kernel Hilbert space (RKHS). The features known as sparse kernel cepstral coefficients (SKCC) are extracted under the hypothesis that the noise-robust information in speech signal is embedded in a subspace spanned by overcomplete, regularized and normalized gamma-tone basis functions. After identifying the information bearing subspace, noise-robustness is achieved by sparsifying the SKCC features using simple thresholding. We show that computing the SKCC features involves correlating the speech signal with a pre-computed matrix, thus making the algorithm amenable to DSP based implementation. Speech recognition experiments using AURORA 2 dataset demonstrate that the SKCC features delivers consistent improvements in recognition performance over the state-of-the-art features under different noisy recording conditions.

Original languageEnglish
Title of host publication2011 IEEE International Symposium of Circuits and Systems, ISCAS 2011
Pages2401-2404
Number of pages4
DOIs
StatePublished - 2011
Event2011 IEEE International Symposium of Circuits and Systems, ISCAS 2011 - Rio de Janeiro, Brazil
Duration: May 15 2011May 18 2011

Publication series

NameProceedings - IEEE International Symposium on Circuits and Systems
ISSN (Print)0271-4310

Conference

Conference2011 IEEE International Symposium of Circuits and Systems, ISCAS 2011
Country/TerritoryBrazil
CityRio de Janeiro
Period05/15/1105/18/11

Fingerprint

Dive into the research topics of 'Sparse kernel cepstral coefficients (SKCC): Inner-product based features for noise-robust speech recognition'. Together they form a unique fingerprint.

Cite this