[1]ZHANG Yi,XIE Yanyi,LUO Yuan,et al.Postprocessing method of MFCC in speech feature extraction[J].CAAI Transactions on Intelligent Systems,2016,11(2):208-215.[doi:10.11992/tis.201511008]
Copy
CAAI Transactions on Intelligent Systems[ISSN 1673-4785/CN 23-1538/TP] Volume:
11
Number of periods:
2016 2
Page number:
208-215
Column:
学术论文—机器感知与模式识别
Public date:
2016-04-25
- Title:
-
Postprocessing method of MFCC in speech feature extraction
- Author(s):
-
ZHANG Yi1; XIE Yanyi2; LUO Yuan3; XI Bing3
-
1. Institute of Advanced Manufacturing Engineering, Chongqing University of Posts and Telecommunications, Chongqing 400065, China;
2. College of Automation, Chongqing University of Posts and Telecommunications, Chongqing 400065, China;
3. College of Opto Electronic Engineering, Chongqing University of Posts and Telecommunications, Chongqing 400065, China
-
- Keywords:
-
postprocessing; phonetic feature; speech recognition; noise; robustness
- CLC:
-
TP391.4
- DOI:
-
10.11992/tis.201511008
- Abstract:
-
To improve the robustness of automatic speech recognition systems, a new speech feature postprocessing method based on the Mel-frequency Cepstral Coefficient (MFCC) is proposed, which is named the MVDA postprocessing method. The postprocessed feature parameters are named MVDAs. This technique combines mean subtraction, variance normalization, time sequence fltering, and autoregressive moving average flters. Experiments were conducted to compare MVDA and MFCC. Changes in the Euclidean distance of the speech with noise and the speech signal were analyzed, proving that every step of MVDA postprocessing could effectively reduce the noise interference. Thus, all MVDAs in different noise environments were superior. This simple feature does not only achieve the effect of many complex speech feature processing methods but also effectively reduces the computational complexity of automatic speech recognition systems.