[1]李 蕾,周延泉,钟义信.基于语用的自然语言处理研究与应用初探[J].智能系统学报,2006,1(2):1-6.
LI Lei,ZHOU Yan-quan,ZHONG Yi-xin.Pragmatic Information Based NLP Research and Application[J].CAAI Transactions on Intelligent Systems,2006,1(2):1-6.
点击复制
《智能系统学报》[ISSN 1673-4785/CN 23-1538/TP] 卷:
1
期数:
2006年第2期
页码:
1-6
栏目:
综述
出版日期:
2006-10-25
- Title:
-
Pragmatic Information Based NLP Research and Application
- 文章编号:
-
1673-4785(2006)02-0001-06
- 作者:
-
李 蕾,周延泉,钟义信
-
北京邮电大学智能科学技术研究中心,北京100876
- Author(s):
-
LI Lei,?? ZHOU Yan-quan,??? ZHONG Yi-xin
-
Center for Intelligence Science and Technology Research, Beijing Uni versity of Posts & Telecommunications, Beijing 100876, China
-
- 关键词:
-
自然语言处理; 语用信息; 语音识别检错纠错
- Keywords:
-
language processing (NLP); pragmatic information; error detect ion and correction for SR
- 分类号:
-
TP391
- 文献标志码:
-
A
- 摘要:
-
首先分析了语用信息的必要性和重要性,认为只有融入语用研究的自然语言处理技术才能显示“以人为本"和智能化的特色,只有语用、语义和语法信息的研究都成熟了,才能使计算机真正获得自然语言所表达的信息,达到与人类交流对话的水平.接着介绍了语用学的产生、发展和运用状况,剖析了存在的主要问题,提出了基于语用的自然语言处理.然后结合典型应用背景——奥运多语言信息服务示范终端 “CityGuide"语音识别后文本的检错纠错需求,探索并尝试了一种基于语用信息的自然语言处理检错纠错方法,并通过真实语料的测试来检验效果.结果表明,当前算法可以使中文语音识别正确率提高29%.
- Abstract:
-
Pragmatic information is looked on as the next focus for natural langu age processing (NLP) research. The necessity and importance of pragmatic informa tion are analyzed firstly. It is pointed out that NLP could be charaterized as h umanity and intelligence only after pragmatic information are integrated into it . And only when syntactic, semantic and pragmatic information are all fully stud ied could computers understand the information expressed in human natural language. Thus computers could really communicate with human. Then details of pr agmatics research are introduced, including its origin, growing history and appl ications. Problems are also analyzed for its current status. As a result, pragma tic information based NLP is put forward. Then a grope research of this, i.e. th e sentence error detection and correction in the application domain of “CityGui de” Speech Recognition (SR) interface is reported. The “CityGuide” is a demo terminal for the National 863 project of “Olympics Oriented Multilingual Inform ation Service”. A method containing pragmatic information analysis is studied a nd tested using realistic corpus. Results show that the precision of Chinese SR can be improved by 29%.
备注/Memo
收稿日期:2006-05-16.
基金项目:国家自然科学基金资助项目(60575034);国家“863 ”资助项目(2004AA117010,2005AA117010)
作者简介:李 蕾,女,1974年生,讲师,毕业于北京邮电大学.主要研究方向为自然语言处理及信息抽取.发表学术论文多篇.
E-mail:lilei@nlu.caai.cn.
周延泉,女,1970年,副教授,毕业于西北工业大学.研究方向为智能信息处理移动信息服务,发表学术论文多篇.
钟义信,男,1940年生,教授,博士生导师,中国人工智能学会理事长.主要研究方向为信息科学、人工智能和神经网络,已在国内外发表多篇著作和论文.?
更新日期/Last Update:
2009-05-04