[1]何力,谭霜,贾焰,等.基于无标记Web数据的层次式文本分类[J].智能系统学报,2014,9(3):330-335.[doi:10.3969/j.issn.1673-4785.201310014]
 HE Li,TAN Shuang,JIA Yan,et al.Hierarchical text classification with non-labeled web data[J].CAAI Transactions on Intelligent Systems,2014,9(3):330-335.[doi:10.3969/j.issn.1673-4785.201310014]
点击复制

基于无标记Web数据的层次式文本分类

参考文献/References:
[1] CHEN Y, LI Z, NIE L, et al. A semi-supervised bayesian network model for microblog topic classification[C]//Proceedings of the 24th International Conference on Computational Linguistics. Mumbai, India, 2012:561-576.
[2] HA-THUC V, RENDERS J M. Large-scale hierarchical text classification without labelled data[C]//Proceedings of the fourth ACM International Conference on Web Search and Data Mining. Hong Kong, China, 2011:685-694.
[3] WETZKER R, ALPCAN T, BAUCKHAGE C, et al. An unsupervised hierarchical approach to document categorization[C]//Proceedings of the IEEE/WIC/ACM International Conference on Web Intelligence. Silicon Valley, USA, 2007:482-486.
[4] ZHANG C, XUE G R, YU Y. Knowledge supervised text classification with no labeled documents[C]//Proceedings of the 10th Pacific Rim International Conference on Artificial Intelligence. Hanoi, Vietnam, 2008:509-520.
[5] HUANG C C, CHUANG S L, CHIEN L F. Liveclassifier:creating hierarchical text classifiers through Web corpora[C]//Proceedings of the 13th International Conference on World Wide Web. New York, USA, 2004:184-192.
[6] WANG P, DOMENICONI C. Towards a universal text classifier:transfer learning using encyclopedic knowledge[C]//Proceedings of the Ninth IEEE International Conference on Data Mining Workshops. Miami, USA, 2009:435-440.
[7] HUNG C M, CHIEN L F. Web-based text classification in the absence of manually labeled training documents[J]. Journal of the American Society for Information Science and Technology, 2007, 58(1):88-96.
[8] HUNG C M, CHIEN L F. Text classification using Web corpora and em algorithms[C]//Proceedings of the Asia Information Retrieval Symposium. Beijing, China, 2005:12-23.
[9] 刘丽珍, 宋瀚涛, 陆玉昌. 无标记训练样本的Web文本分类方法[J]. 计算机科学, 2006, 33(3):200-201.LIU Lizhen, SONG Hantao, LU Yuchang. The method of Web text classification of using non-labeled training sample[J]. Computer Science, 2006, 33(3):200-201.
[10] WEISS G M. Mining with rarity:a unifying framework[J]. ACM SIGKDD Explorations Newsletter, 2004, 6(1):7-19.
[11] CHEN S, HE H, GARCIA E A. Ramoboost:ranked minority oversampling in boosting[J]. Neural Networks, IEEE Transactions on. 2010, 21(10):1624-1642.
[12] NGUYEN H M, COOPER E W, KAMEI K. Borderline over-sampling for imbalanced data classification[J]. International Journal of Knowledge Engineering and Soft Data Paradigms, 2011, 3(1):4-21.
[13] GAO M, HONG X, CHEN S, et al. A combined smote and pso based rbf classifier for two-class imbalanced problems[J]. Neurocomputing, 2011, 74(17):3456-3466.
[14] KO Y, SEO J. Learning with unlabeled data for text categorization using bootstrapping and feature projection techniques[C]//Proceedings of the 42nd Annual Meeting on Association for Computational Linguistics. Barcelona, Spain, 2004:255-262.
[15] VEERAMACHANENI S, SONA D, AVESANI P. Hierarchical dirichlet model for document classification[C]//Proceedings of the 22nd International Conference on Machine Learning. Bonn, Germany, 2005:928-935.
[16] CAI L, HOFMANN T. Hierarchical document categoriza-tion with support vector machines[C]//Proceedings of the thirteenth ACM International Conference on Information and Knowledge Management. Washington, DC, USA, 2004:78-87.
[17] FAN R E, CHANG K W, HSIEH C J, et al. Liblinear:a library for large linear classification[J]. Journal of Machine Learning Research, 2008, 9:1871-1874.

备注/Memo

收稿日期:2014-03-25。
基金项目:国家"863"计划资助项目(2010AA012505, 2011AA010702, 2012AA01A401, 2012AA01A402);国家重点基础研究发展计划资助项目(2013CB329601, 2013CB329602);国家自然科学基金资助项目(60933005, 91124002);国家科技支撑计划资助项目(2012BAH38B04);国家242信息安全计划资助项目(2011A010)
作者简介:谭霜,男,1984年生,博士研究生,主要研究方向为网络与信息安全、云计算,发表学术论文5篇;贾焰,女,1960年生,教授,博士生导师,主要研究方向为网络与信息安全、数据库与数据挖掘、社会网络,发表的学术论文被SCI和EI检索200余篇。
通讯作者:何力,男,1984年生,博士研究生,主要研究方向为网络与信息安全、数据库与数据挖掘,发表学术论文6篇,E-mail:lihe@nudt.edu.cn。

更新日期/Last Update: 1900-01-01
Copyright © 《 智能系统学报》 编辑部
地址:(150001)黑龙江省哈尔滨市南岗区南通大街145-1号楼 电话:0451- 82534001、82518134 邮箱:tis@vip.sina.com