<-上一篇/Previous Article 下一篇/Next Article->

[1]穆妮热·穆合塔尔,李晓,杨雅婷,等.基于词缀的维吾尔谚语识别关键技术研究[J].智能系统学报,2018,13(3):452-457.[doi:10.11992/tis.201706092]
　Munire·Muhetaer,LI Xiao,YANG Yating,et al.Affix-based key technology for Uyghur proverb recognition[J].CAAI Transactions on Intelligent Systems,2018,13(3):452-457.[doi:10.11992/tis.201706092]

点击复制

基于词缀的维吾尔谚语识别关键技术研究

PDF下载 HTML

《智能系统学报》[ISSN 1673-4785/CN 23-1538/TP] 卷: 13 期数: 2018年第3期页码: 452-457 栏目: 学术论文—自然语言处理与理解出版日期: 2018-05-05

Title:: Affix-based key technology for Uyghur proverb recognition

作者:: 穆妮热·穆合塔尔^1,2,3, 李晓^1,2, 杨雅婷^1,2, 艾孜尔古丽⁴, 周喜^1,2; 1. 中国科学院新疆理化技术研究所, 新疆乌鲁木齐 830011;
2. 新疆民族语音语言信息处理实验室, 新疆乌鲁木齐 830011;
3. 中国科学院大学, 北京 100049;
4. 新疆师范大学计算机科学技术学院, 新疆乌鲁木齐 830054

Author(s):: Munire·Muhetaer^1,2,3, LI Xiao^1,2, YANG Yating^1,2, AZRAGUL⁴, ZHOU Xi^1,2; 1. Xinjiang Technical Institute of Physics and Chemistry, Chinese Academy of Science, Urumqi 830011, China;
2. Xinjiang Key Laboratory of Minority Speech and Language Information Processing, Urumqi 830011, China;
3. University of Chinese Academy of Science, Beijing 100049, China;
4. School of Computer Science and Technology, Xinjiang Normal University, Urumqi 830054, China

关键词:: 维吾尔谚语; 谚语词缀; 谚语规则; 词缀覆盖率; 谚语规则库; 谚语语料库; 识别系统

Keywords:: Uyghur proverbs; proverbs affix; proverb rules; coverage rate of affix; proverb rule bases; proverb corpus; recognition system

分类号:: TP391.1

DOI:: 10.11992/tis.201706092

摘要:: 在自然语言理解、机器翻译、舆情分析等自然语言处理领域中，维吾尔谚语识别是整个文本实体识别的重要组成部分。为满足维吾尔谚语信息化的需求，本文构建了比较完善的维吾尔谚语语料库。同时，从传统语言学角度对维吾尔谚语的语法、语义结构进行分析，构建了一个由维吾尔谚语功能语类（词缀）组成的、专属维吾尔谚语规则的知识库，并将此知识库与自然语言处理技术相结合，实现一个既能够从文本中识别出维吾尔谚语，又能提供维汉互译等功能的信息软件系统。该系统也为开展计算机理解与处理维吾尔文字奠定了一个崭新的基础。

Abstract:: In fields of natural language processing such as natural language understanding, machine translation, and public opinion analysis, Uyghur proverb recognition is an important part of the whole text entity recognition. To meet the need of Uyghur proverb informationization, this paper establishes a relatively complete corpus of Uyghur proverbs. The grammar and semantic structure of Uygur proverbs were analyzed from the perspective of traditional linguistics, and a knowledge base that comprises functional genres (affixes) of Uyghur proverbs and obeys Uyghur proverb rules was constructed. In addition, the knowledge base was combined with natural language processing technologies to realize an information software system that can recognize Uyghur proverbs from text and mutually translate between Chinese and Uyghur language. The system also laid a new foundation for understanding and processing Uyghur language and characters by computer.

参考文献/References:: [1] 杨艳, 何天宇. 基于短语的关系数据库关键词查询方法[J]. 计算机研究与发展, 2012, 49(S1):276-282. YANG Yan, HE Tianyu. Phrase-based keyword search over relational databases[J]. Computer research and development, 2012, 49(S1):276-282.
[2] 张红雨. 汉维谚语对比[D]. 重庆:西南大学, 2010:15-20. ZHANG Hongyu. The development of chemical curriculum comparision of Chinese Uighur proverbs[D]. Chongqing:Southwest University, 2010:15-20.
[3] 杨德明. 再论维吾尔谚语的翻译[J]. 民族翻译, 2014(4):81-86. YANG Deming. On the translation of Uyghur proverbs[J]. Minority translators journal, 2014(4):81-86.
[4] 马俊民. 论维语谚语的汉译[J]. 语言与翻译, 1985(3):47-52. MA Junmin. On the translation of Uyghur language into Chinese[J]. Language and translation, 1985(3):47-52.
[5] 芦风娟. 基于语料库的英语谚语及其变体使用研究[D]. 临汾:山西师范大学, 2013:12-15. LU Fengjuan. A corpus-based study on the Usage of English proverbs and their variations[D]. Linfen:Shanxi Normal University, 2013:12-15.
[6] 马俊民, 廖泽余. 维汉对照维吾尔谚语[M]. 乌鲁木齐:新疆人民出版社, 2007:1-300.
[7] 陈雪慧. 汉维谚语中比喻修辞对比研究[D]. 乌鲁木齐:新疆师范大学, 2013:8-12. CHEN Xuehui. Chinese proverbs and Uygur proverbs comparative study of figures of speech[D]. Urumchi:Xinjiang Normal University, 2013:8-12.
[8] 力提甫·托乎提. 从短语结构到最简方案:阿尔泰语言的句法结构[M]. 北京:中央民族大学出版社, 2004:80-140.
[9] 刘正江. 维吾尔语成语研究概述[J]. 喀什师范学院学报, 2004, 25(2):59-61. LIU Zhengjiang. The general research situation about idiom of Uygur language[J]. Journal of Kashgar teachers college, 2004, 25(2):59-61.
[10] 力提甫·托乎提. 维吾尔语及其他阿尔泰语言的生成句法研究[M]. 北京:民族出版社, 2001:60-120.
[11] 张勇. 维吾尔谚语研究[D]. 乌鲁木齐:新疆大学, 2005:10-97. ZHANG Yong. Study on Uygur proverbs[D]. Urumchi:Xinjiang University, 2005:10-97.
[12] 张勇. 维吾尔谚语研究视角与理论方法琐谈[J]. 新疆大学学报:社会科学版, 2004, 32(1):130-134. ZHANG Yong. On the perspective and theoretical methods of Uyghur proverb research[J]. Journal of Xinjiang university:social science edition, 2004, 32(1):130-134.
[13] 阿比达·吾买尔, 吐尔根·依布拉音. 维吾尔语句子边界识别算法的设计与实现[J]. 新疆大学学报:自然科学版, 2008(3):360-363. UMAR A, IBRAYIM T. Research and implementation of Uyghur sentence boundary detection[J]. Journal of Xinjiang university:natural science edition, 2008(3):360-363.
[14] 艾斯卡尔·亚克甫. 维吾尔谚语检索系统的研究与实现[D]. 青岛:中国海洋大学, 2012:34-70. YAKUP A. Research and realization of Uyghur ideom retriaval system[D]. Qingdao:Ocean University of China, 2012:34-70.
[15] 刘正江. 谈维吾尔语成语汉译的可译性限度[J]. 西北民族大学学报:哲学社会科学版, 2005(3):47-50. LIU Zhengjiang. On the translatability limits of Uyghur idioms in Chinese translation[J]. Journal of northwest university for nationalities:philosophy and social science, 2005(3):47-50.
[16] 艾孜尔古丽·玉素甫. 现代维吾尔语常用词计量研究[D]. 乌鲁木齐:新疆师范大学, 2013:40-48. AZRAGUL. The research on modern Uygur language common words[D]. Urumchi:Xinjiang Normal University, 2013:40-48.
[17] 艾孜尔古丽, 努尔艾合买提, 玉素甫·艾白都拉. 现代维吾尔语常用词统计关键技术研究[J]. 中文信息学报, 2014, 28(5):192-197. AZRAGUL, NURAHMAT, ABAYDULA Y. Research on key technology of modern Uyghur language common[J]. Journal of Chinese information processing, 2014, 28(5):192-197.
[18] 玉素甫·艾白都拉, 艾孜尔古丽, 祖丽皮亚. 基于网站用词调查的现代维吾尔语词长研究[J]. 计算机应用与软件, 2012, 29(5):32-34. ABAYDULA Y, AZRAGUL, ZULPIYA. Research on word length of modern Uyghur based on surveying website wording[J]. Computer applications and software, 2012, 29(5):32-34.
[19] 李燕萍. 从维吾尔语谚语中的复句看现代维吾尔语复句的演变[J]. 喀什师范学院学报, 2006, 27(5):57-60. LI Yanping. Observe and study the development orbit of the sentence of two clauses in the modern Uighur language from those are retained in the Uighur proverbs[J]. Journal of Kashgar teachers college, 2006, 27(5):57-60.
[20] 付东明, 陈得军. 维吾尔谚语研究趋势与反思[J]. 语言与翻译, 2014(1):29-33. FU Dongming, CHEN Dejun. Uygur proverb research trends and reflection[J]. Language and translation, 2014(1):29-33.

备注/Memo

收稿日期:2017-06-30。
基金项目:新疆维吾尔自治区重点实验室开放课题（2015KL031）；新疆维吾尔自治区重大科技专项课题（2016A03007-3）；新疆维吾尔自治区自然科学基金项目（2015211B034）；中科院战略性先导科技专项项目（XDA06030400）；新疆维吾尔自治区社会科学基金项目（2016CYY067）.
作者简介:穆妮热·穆合塔尔,女,1989年生,博士,主要研究方向为多种语言信息处理、机器翻译、自然语言处理;李晓,男,1957年生,研究员,主要研究方向为多语种信息处理、人工智能。主持或承担过多项国家863、中科院战略先导项目。发表学术论文60余篇;杨雅婷,女,1985年生,副研究员,博士,主要研究方向为机器翻译和自然语言处理。承担过多项国家863、中科院战略先导项目。发表学术论文30余篇。
通讯作者:李晓.E-mail:xiaoli@ms.xjb.ac.cn.

更新日期/Last Update: 2018-06-25

基于词缀的维吾尔谚语识别关键技术研究 PDF下载HTML

备注/Memo

基于词缀的维吾尔谚语识别关键技术研究

PDF下载 HTML