[1]罗元,童开国,张毅,等.多个声源下基于人耳听觉特性的语音分离[J].智能系统学报,2012,7(2):121-128.
 LUO Yuan,TONG Kaiguo,ZHANG Yi,et al.Sound source separation of a multi voice environment based on human ear listening properties[J].CAAI Transactions on Intelligent Systems,2012,7(2):121-128.
点击复制

多个声源下基于人耳听觉特性的语音分离

参考文献/References:
[1]OZEROV A, VINCENT E, BIMBOT F. A general modular framework for audio source separation[C]//9th International Conference on Latent Variable Analysis and Signal Separation (LVA/ICA’10). SaintMalo, France, 2010: 3340.
[2]VINCENT E, BERTIN N, BADEAU R. Harmonic and inharmonic on negative matrix factorization for polyphonic pitch transcription[C]//Proc of IEEE International Conference on Acoustics, Speech, and Signal Processing. Rennes Cedex, France, 2008: 109112.
[3]FITZGERALD D, GAINZA M. Single channel vocal separation using median filtering and factorization techniques[J]. ISAST Transactions on Electronic and Signal Processing, 2010, 4(1): 6273.
[4]赵鹤鸣,葛良,陈雪勤,等. 基于声音定位和听觉掩蔽效应的语音分离研究[J]. 半导体学报, 2005, 33(1): 158160.
ZHAO Heming, GE Liang, CHEN Xueqin, et al. Research based on sound localization and auditory masking effect of voice separation[J].Journal of Semiconductors, 2005, 33(1): 158160.
[5]LIU Jindong, ERWIN H, WERMTER S. Mobile robot broadband sound localisation using a biologically inspired spiking neural network[C]//Proceedings of IEEE/RSJ Int Conf on Intelligent Robots and Systems in Nice. [S.l.], 2008: 21912196.
[6]DURRIEU J L, RICHARD G, DAVID B. An iterative approach to monaural musical mixture desoloing[C]//Proc of IEEE International Conference on Acoustics, Speech, and Signal Processing. Paris, France, 2009: 105108.
[7]KONIARIS C, CHATTERJEE S, KLEIJN W B. Towards effective singing voice extraction from stereophonic recordings[C]//2010 IEEE International Conference on Acoustics Speech and Signal Processing(ICASSP). Hatfield, UK, 2010: 233236.
[8]BROWN G J, FERRY R T, MEDDIS R. A computer model of auditory efferent suppression: implications for the recognition of speech in noise[J]. Acoustical Society of America, 2010, 127(2): 943954. 
[9]DUONG N, VINCENT E, GRIBONVAL R. Spatial covariance models for underdetermined reverberant audio source separation[C]//Applications of Signal Processing to Audio and Acoustics 2009 (WASPAA’09). Rennes, France, 2009: 129132.
[10]DONG Yi, MIHALAS S, NIEBUR E. Improved integral equation solution for the first passage time of leaky integrateandfire neurons[J]. Neural Computation, 2011, 23(2): 421434.
[11]VOUTSAS K, ADAMY J. A biologically inspired spiking neural network for sound source lateralization[J]. IEEE Trans Neural Networks, 2007, 18(6): 17851799.

备注/Memo

收稿日期: 2011-09-28.
基金项目:科技部国际合作资助项目(2010DF12160);重庆市攻关计划资助项目(CSTC:2010AA2055).
通信作者:童开国.            E-mail:359018647@qq.com.
作者简介:
罗元,女,1972年生,教授,博士.近年来参与和负责了包括科技部国际合作项目、教育部留学回国人员项目、重庆市科研项目等多项国家级、省部级项目.主要研究方向为机器视觉、人机交互、基于图像视频处理的测试.近年来发表学术论文60余篇,其中20余篇被SCI、EI检索,获得国家发明专利3项.
童开国,男,1985年生,硕士研究生,主要研究方向为语音识别与智能机器人,发表学术论文4篇.
张毅,男,1966年生,教授,博士生导师,博士后,近年来承担了科技部国际合作项目、人事部留学人员科技活动项目择优资助重点项目以及重庆市科技攻关项目“轮椅式机器人导航与控制系统研发”课题;国际期刊International Journal of Modelling, Identification and Control、International Journal of Automation and Computing和International Journal of Advanced Mechatronic Systems关于智能系统及机器人专刊的编委.

更新日期/Last Update: 2012-07-12
Copyright @ 《 智能系统学报》 编辑部
地址:(150001)黑龙江省哈尔滨市南岗区南通大街145-1号楼 电话:0451- 82534001、82518134