TY - GEN
T1 - Sparse kernel cepstral coefficients (SKCC)
T2 - 2011 IEEE International Symposium of Circuits and Systems, ISCAS 2011
AU - Fazel, Amin
AU - Chakrabartty, Shantanu
PY - 2011
Y1 - 2011
N2 - In this paper we present a novel speech feature extraction algorithm based on sparse auditory coding and regression techniques in a reproducing kernel Hilbert space (RKHS). The features known as sparse kernel cepstral coefficients (SKCC) are extracted under the hypothesis that the noise-robust information in speech signal is embedded in a subspace spanned by overcomplete, regularized and normalized gamma-tone basis functions. After identifying the information bearing subspace, noise-robustness is achieved by sparsifying the SKCC features using simple thresholding. We show that computing the SKCC features involves correlating the speech signal with a pre-computed matrix, thus making the algorithm amenable to DSP based implementation. Speech recognition experiments using AURORA 2 dataset demonstrate that the SKCC features delivers consistent improvements in recognition performance over the state-of-the-art features under different noisy recording conditions.
AB - In this paper we present a novel speech feature extraction algorithm based on sparse auditory coding and regression techniques in a reproducing kernel Hilbert space (RKHS). The features known as sparse kernel cepstral coefficients (SKCC) are extracted under the hypothesis that the noise-robust information in speech signal is embedded in a subspace spanned by overcomplete, regularized and normalized gamma-tone basis functions. After identifying the information bearing subspace, noise-robustness is achieved by sparsifying the SKCC features using simple thresholding. We show that computing the SKCC features involves correlating the speech signal with a pre-computed matrix, thus making the algorithm amenable to DSP based implementation. Speech recognition experiments using AURORA 2 dataset demonstrate that the SKCC features delivers consistent improvements in recognition performance over the state-of-the-art features under different noisy recording conditions.
UR - http://www.scopus.com/inward/record.url?scp=79960855406&partnerID=8YFLogxK
U2 - 10.1109/ISCAS.2011.5938087
DO - 10.1109/ISCAS.2011.5938087
M3 - Conference contribution
AN - SCOPUS:79960855406
SN - 9781424494736
T3 - Proceedings - IEEE International Symposium on Circuits and Systems
SP - 2401
EP - 2404
BT - 2011 IEEE International Symposium of Circuits and Systems, ISCAS 2011
Y2 - 15 May 2011 through 18 May 2011
ER -