[1]张少白,刘欣.基于DIVA模型的语音-映射单元自动获取[J].智能系统学报,2013,8(4):305-311.[doi:10.3969/j.issn.1673-4785.201304049]
ZHANG Shaobai,LIU Xin.Automatic acquisition of speech sound-target cells based on DIVA model[J].CAAI Transactions on Intelligent Systems,2013,8(4):305-311.[doi:10.3969/j.issn.1673-4785.201304049]
点击复制
《智能系统学报》[ISSN 1673-4785/CN 23-1538/TP] 卷:
8
期数:
2013年第4期
页码:
305-311
栏目:
学术论文—机器感知与模式识别
出版日期:
2013-08-25
- Title:
-
Automatic acquisition of speech sound-target cells based on DIVA model
- 文章编号:
-
1673-4785(2013)04-0305-07
- 作者:
-
张少白,刘欣
-
南京邮电大学 计算机学院,江苏 南京 210046
- Author(s):
-
ZHANG Shaobai, LIU Xin
-
College of Computer, Nanjing University of Posts and Telecommunications, Nanjing 210046, China
-
- 关键词:
-
DIVA模型; 音素; 语音-映射单元; 语音生成与获取
- Keywords:
-
DIVA model; phoneme; speech sound-target cells; speech acquisition and production
- 分类号:
-
TP31
- DOI:
-
10.3969/j.issn.1673-4785.201304049
- 文献标志码:
-
A
- 摘要:
-
针对DIVA模型中存在的“感知能力与语音生成技巧发育不平衡”问题,提出了一种自动获取语音-映射单元的方法.该方法将人耳模拟为一个具有不同带宽的并联带通滤波器组,分别与模型中21维度的听觉存储空间相关联,对不同听觉的不同反应,分别考虑其频带的屏蔽效应、听觉响度与频率的关系.在读取语音输入信号的过程中,模型能较好地获得初始听觉表示,其方式与婴儿咿呀学语的过程基本一致.仿真实验表明,通过边界定义、相似性比较以及搜索更新等步骤,此方法能很好地进行初始输入模式的自组织匹配,并最终使DIVA模型更具语音获取的自然特性.
- Abstract:
-
Contraposing the shortage of Directions Into Velocities of Articulators (DIVA) model about “infants perceptual abilities do develop faster at first than their speech production skills”, the paper presents an automatic acquisition method of speech sound-target cells. The method simulates the human ear as a parallel band-pass filter group with different bandwidth and associates respectively; the filter with the 21-dimensional storage space of auditory sense in DIVA model. This method was done in order for different auditory reactions, the shielding effect of frequency band, sound loudness, and frequency relation could be considered respectively for this study. In the process of reading the input signal of speech, the model can acquire good initial hearing and the process is consistent with baby’s babble. The simulation results show that through boundary definition, similarity comparison, searching and updates and so on, the method has nicer self-organized pattern matching effect for initial input, which makes the DIVA model a more natural characteristic regarding speech acquisition.
更新日期/Last Update:
2013-09-25