[1]王振力,裴凌波,于元斌.一种基于噪声对消与倒谱均值相减的鲁棒语音识别方法[J].智能系统学报,2008,3(6):552-556.
WANG Zhen-li,PEI Ling-bo,YU Yuan-bin.A robust speech recognition method by combining noise cancelling and cepstral mean subtraction[J].CAAI Transactions on Intelligent Systems,2008,3(6):552-556.
点击复制
《智能系统学报》[ISSN 1673-4785/CN 23-1538/TP] 卷:
3
期数:
2008年第6期
页码:
552-556
栏目:
学术论文—机器感知与模式识别
出版日期:
2008-12-25
- Title:
-
A robust speech recognition method by combining noise cancelling and cepstral mean subtraction
- 文章编号:
-
1673-4785(2008)06-0552-05
- 作者:
-
王振力1, 裴凌波2, 于元斌1
-
1.南京国际关系学院博士后流动站,江苏南京210039; 2.工程兵指挥学院训练部,江苏徐州221004
- Author(s):
-
WANG Zhen-li1, PEI Ling-bo2, YU Yuan-bin1
-
1.Postdoctoral Station, Nanjing University of International Relations, Nanjing 210039, China;2.Training Department, The Command Academy of Engineer Corps, Xuzhou 221004, China
-
- 关键词:
-
自适应噪声对消; 语音增强; 谱减法; 噪声鲁棒语音识别; 倒谱均值相减法
- Keywords:
-
adaptive noise cancelling; speech enhancement; spectral subtraction; noise robust speech recognition; cepstral mean subtraction
- 分类号:
-
TN912.34
- 文献标志码:
-
A
- 摘要:
-
提出一种基于语音增强算法的噪声鲁棒语音识别方法.在语音识别预处理阶段,通过噪声对消语音增强法来抑制噪声提高信噪比.然后对增强语音提取Mel频段倒谱特征参数,并在倒谱域应用倒谱均值相减处理来补偿增强语音中的失真成分和剩余噪声.实验结果表明,在低信噪比(-12~0dB)条件下,该方法对于数字语音识别具有较好的识别率,其性能明显优于基本的Mel频段倒谱参数识别器、传统的谱减法和噪声对消语音增强法.
- Abstract:
-
A noise resistant speech recognition method based on a speech enhancement algorithm was implemented. First,it obtains the denoised speech, with significant SNR (signaltonoise ratio) improvement, by applying adaptive noise cancelling (ANC) to the pretreatment stage of speech recognition. Then Melfrequency cepstral coefficients(MFCC)are computed from the enhanced speech. Then cepstral mean subtraction (CMS) is used to compensate for components of distortion and the residual noise of the enhanced speech in the cepstral domain. When speech samples have a low SNR, ranging from 0 to 12dB, experimental results indicate that the proposed method performs better than a standard MFCC recognizer, conventional spectral subtraction (SS) and the ANC speech enhancement for digital speech recognition.
备注/Memo
收稿日期:2008- 03- 06.
基金项目:江苏省博士后科研基金资助项目(0701008C);中国博士后科学基金资助项目(20070420561).
作者简介:
王振力,男,1977年生,工程师,博士后,主要研究方向为人工智能、多媒体信息处理等.发表学术论文20余篇,被SCI、EI、ISTP收录10余篇.
裴凌波,男,1972年生,讲师,主要研究方向为网络测量、网络性能建模和智能化信息检索等.发表学术论文20余篇.
于元斌,男,1973年生,讲师,博士后.主要研究方向为作战指挥.发表学术论文10余篇,出版专著3部.
更新日期/Last Update:
2009-04-06