[1]Munire·Muhetaer,LI Xiao,YANG Yating,et al.Affix-based key technology for Uyghur proverb recognition[J].CAAI Transactions on Intelligent Systems,2018,13(3):452-457.[doi:10.11992/tis.201706092]
Copy
CAAI Transactions on Intelligent Systems[ISSN 1673-4785/CN 23-1538/TP] Volume:
13
Number of periods:
2018 3
Page number:
452-457
Column:
学术论文—自然语言处理与理解
Public date:
2018-05-05
- Title:
-
Affix-based key technology for Uyghur proverb recognition
- Author(s):
-
Munire·Muhetaer1; 2; 3; LI Xiao1; 2; YANG Yating1; 2; AZRAGUL4; ZHOU Xi1; 2
-
1. Xinjiang Technical Institute of Physics and Chemistry, Chinese Academy of Science, Urumqi 830011, China;
2. Xinjiang Key Laboratory of Minority Speech and Language Information Processing, Urumqi 830011, China;
3. University of Chinese Academy of Science, Beijing 100049, China;
4. School of Computer Science and Technology, Xinjiang Normal University, Urumqi 830054, China
-
- Keywords:
-
Uyghur proverbs; proverbs affix; proverb rules; coverage rate of affix; proverb rule bases; proverb corpus; recognition system
- CLC:
-
TP391.1
- DOI:
-
10.11992/tis.201706092
- Abstract:
-
In fields of natural language processing such as natural language understanding, machine translation, and public opinion analysis, Uyghur proverb recognition is an important part of the whole text entity recognition. To meet the need of Uyghur proverb informationization, this paper establishes a relatively complete corpus of Uyghur proverbs. The grammar and semantic structure of Uygur proverbs were analyzed from the perspective of traditional linguistics, and a knowledge base that comprises functional genres (affixes) of Uyghur proverbs and obeys Uyghur proverb rules was constructed. In addition, the knowledge base was combined with natural language processing technologies to realize an information software system that can recognize Uyghur proverbs from text and mutually translate between Chinese and Uyghur language. The system also laid a new foundation for understanding and processing Uyghur language and characters by computer.