[1]鉴?? 萍,宗成庆.基于双向标注融合的汉语最长短语识别方法[J].智能系统学报,2009,4(5):406-413.[doi:10.3969/j.issn.1673-4785.2009.05.004]
 JIAN Ping,ZONG Cheng-qing.A new approach to identifying Chinese maximal-length phrases using bidirectional labeling[J].CAAI Transactions on Intelligent Systems,2009,4(5):406-413.[doi:10.3969/j.issn.1673-4785.2009.05.004]
点击复制

基于双向标注融合的汉语最长短语识别方法

参考文献/References:
[1]XUE Nanwen, XIA Fei, CHIOU Fudong, et al. The Penn Chinese Treebank: phrase structure annotation of a large corpus[J]. Natural Language Engineering, 2005, 11(2): 207-238.
[2]李文捷,周??? 明,潘海华,等. 基于语料库的中文最长名词短语的自动抽取[C]//计算语言学进展与应用. 北京:清华大学出版社,1995:119-124.
LI Wenjie, ZHOU Ming, PAN Haihua, et al. Corpusbased maximal-length Chinese noun phrases extraction[C]//Advances and Applications on Computational Linguistics. Beijing: Tsinghua University Press, 1995: 119-124.
[3]周??? 强,孙茂松,黄昌宁. 汉语最长名词短语的自动识别[J]. 软件学报,2000,11(2):195-201.
?ZHOU Qiang, SUN Maosong, HUANG Changning. Automatic identification of Chinese maximal noun phrases[J]. Journal of Software, 2000, 11(2): 195-201.
[4]王立霞,孙宏林. 现代汉语介词短语边界识别研究[J]. 中文信息学报,2005,19(3):80-86.
WANG Lixia, SUN Honglin. Automatic recognition of prepositional phrases in Chinese[J]. Journal of Chinese Information Processing, 2005, 19(3): 80-86.
[5]干俊伟,黄德根. 汉语介词短语的自动识别[J]. 中文信息学报,2005,19(4):17-23.
GAN Junwei, HUANG Degen. Automatic identification of Chinese prepositional phrase[J]. Journal of Chinese Information Processing, 2005, 19(4): 17-23.
[6]ZHOU Guodong, SU Jian, TEY Tongguan. Hybrid text chunking[C]//Proceedings of the 2nd Workshop on Learning Language in Logic and the 4th Conference on Computational Natural Language Learning. Lisbon, Portugal, 2000: 163-165.
[7]KUDO T, MATSUMOTO Y. Chunking with support vector machines[C]//Proceedings of the North American Chapter of the Association for Computational Linguistics. Pittsburgh, USA, 2001: 192-199.
[8]SHA Fei, PEREIRA F. Shallow parsing with conditional random fields[C]//Proceedings of the North American Chapter of the Association for Computational Linguistics. Edmonton, Canada, 2003: 213-220.
[9]BAI Xuemei, LI Jinji, KIM Dongil, et al. Identification of maximal-length noun phrases based on expanded chunks and classified punctuations in Chinese[C]//Proceedings of International Conference on Computer Processing of Oriental Languages. Singapore, 2006: 268-276.
[10]冯??? 冲,陈肇雄,黄河燕,等. 基于条件随机域的复杂最长名词短语识别[J]. 小型微型计算机系统,2006,27(6):1134-1139.
FENG Chong, CHEN Zhaoxiong, HUANG Heyan, et al. Recognition of complex maximal length noun phrase using conditional random fields[J]. MiniMicro Systems, 2006, 27(6): 1134-1139.
[11]TJONG KIM SANG E F. Noun phrase recognition by system combination[C]//Proceedings of the North American Chapter of the Association for Computational Linguistics. Seattle, USA, 2000: 50-55.
[12]CHEN Wenliang, ZHANG Yujie, ISAHARA H. An empirical study of Chinese chunking[C]//Proceedings of the Joint Conference of the International Committee on Computational Linguistics and the Association for Computational Linguistics. Sydney, Australia, 2006: 97-104.
[13]LEE Linshan, LIN Longji, CHEN Kehjiann. An efficient natural language processing system specially designed for the Chinese language[J]. Computational Linguistics, 1991, 17(4): 347-374.
[14]WU Yuchieh, YANG Jiechi, LEE Yueshi, et al. Efficient and robust phrase chunking using support vector machines[C]//Proceedings of Asia Information Retrieval Symposium. Singapore, 2006: 350-361.
[15]RATNAPARKHI A. A maximum entropy model for part-of-speech tagging[C]//Proceedings of the Empirical Methods in Natural Language Processing. New Brunswick, USA, 1996: 133-142.
[16]MCCALLUM A, FREITAG D, PEREIRA F. Maximum entropy Markov models for information extraction and segmentation[C]//Proceedings of the International Conference on Machine Learning. Stanford, USA, 2000: 591-598.
[17]TAN Yongmei, YAO Tianshun, CHEN Qing, et al. Applying conditional random fields to Chinese shallow parsing[C]// Proceedings of the Conference on Intelligent Text Processing and Computational Linguistics. Mexico City, Mexico, 2005: 167-176.
[18]宗成庆. 统计自然语言处理[M]. 北京:清华大学出版社,2008:175-177, 179-181.
[19]KITTLER J, HATEF M, DUIN R P W, et al. On combining classifiers[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 1998, 20(3): 226-239.
[20]TJONG KIM SANG E F, VEENSTRA J. Representing text chunks[C]// Proceedings of European Chapter of the Association for Computational Linguistics. Bergen, Norway, 1999: 173-179.
[21]KUDO T. YamCha: Yet another multipurpose chunk annotator[EB/OL]. (2005-09-05)[2009-02-25]. http://www.chasen.org/~tAKu/software/yamcha/.
[22]KUDO T. CRF++: Yet another CRF toolkit[EB/OL]. (2007-03-07)[2009-02-25]. http://crfpp.sourceforge.net/.
[23]QIAN X. Pocket CRF[EB/OL]. (2008-08-05)[2009-02-25]. http://sourceforge.net/projects/pocket-crf-1/files/.

备注/Memo

作者简介:
鉴??? 萍,女,1982年生,博士研究生,主要研究方向为自然语言处理、依存句法分析.
宗成庆,男,1963年生,研究员、博士生导师.中国科学院自动化研究所模式识别国家重点实验室副主任,国际学术期刊 IEEE Intelligent Systems 副主编,清华大学特邀学术顾问和讲座教授,中国科学院研究生院兼职教授,亚洲自然语言处理联合会(AFNLP)执行理事,中国人工智能学会理事及自然语言处理专业委员会副主任,中国中文信息学会理事及机器翻译专业委员会副主任,担任若干国际学术会议的程序委员会主席、委员等职务.主要研究方向为自然语言处理理论与方法、机器翻译、人机对话等技术.作为项目负责人承担国家及国际合作项目10余项,申请国家发明专利多项.发表学术论文70余篇,出版学术专著1部.

更新日期/Last Update: 2009-12-29
Copyright © 《 智能系统学报》 编辑部
地址:(150001)黑龙江省哈尔滨市南岗区南通大街145-1号楼 电话:0451- 82534001、82518134 邮箱:tis@vip.sina.com