[1]李春英,汤庸,陈国华,等.面向学术社区的专家推荐模型[J].智能系统学报,2012,7(4):365-369.
LI Chunying,TANG Yong,CHEN Guohua,et al.Research on an expert recommendation model based on the scholar community SCHOLAT[J].CAAI Transactions on Intelligent Systems,2012,7(4):365-369.
点击复制
《智能系统学报》[ISSN 1673-4785/CN 23-1538/TP] 卷:
7
期数:
2012年第4期
页码:
365-369
栏目:
学术论文—知识工程
出版日期:
2012-08-25
- Title:
-
Research on an expert recommendation model based on the scholar community SCHOLAT
- 文章编号:
-
1673-4785(2012)04-0365-05
- 作者:
-
李春英1,汤庸2,陈国华2 ,汤志康3
-
1.肇庆学院 计算机学院,广东 肇庆 526061;
2.华南师范大学 计算机学院,广东 广州 510631;
3.广东技术师范学院 计算机学院,广东 广州 510665
- Author(s):
-
LI Chunying1, TANG Yong2, CHEN Guohua2, TANG Zhikang3
-
1.School of Computer, Zhaoqing University, Zhaoqing 526061, China;
2.School of Computer Science, South China Normal University, Guangzhou 510631, China;
?3.School of Computer Science, Guangdong Polytechnic Normal University, Guangzhou 510665, China
-
- 关键词:
-
学术专家推荐; H参数; 概率主题模型; 查询扩展
- Keywords:
-
expert recommendation; H index; probabilistic topic model; query expansion
- 分类号:
-
TP393
- 文献标志码:
-
A
- 摘要:
-
在学术社区提供的服务中,对于研究者特别是青年研究者来说,专家推荐是一个必不可少的部分.目前提供学术信息服务的所有中文搜索引擎中,都没有提供用户感兴趣的专家推荐服务.因此,提出了一个面向学术社区的专家推荐模型.使用改进的H参数对学者n年时间内发表的论文成果进行量化,获取专家列表;使用概率主题模型从作者发表的论文中提取主题向量作为学者的研究方向;根据矩阵奇异值分解对构建的词项〖KG-*1/3〗-〖KG-*1/3〗文档矩阵进行降维,进而生成词项〖KG-*1/3〗-〖KG-*1/3〗词项关系矩阵,实现对搜索关键词的查询扩展,并计算查询扩展向量与作者主题向量之间的相关度,根据相关度大小进行排序推荐.在SCHOLAT(学者网)数据集上验证模型的有效性,实验结果表明提出的模型达到了预期的效果.
- Abstract:
-
Among the services offered by the academic community, expert recommendation is an indispensable component for researchers, especially young researchers. At present, expert recommendation services have not been offered to users on all of the Chinese search engines offering academic information services. Thus, a scholar community oriented expert recommendation model was proposed. The Hindex was improved to quantify the achievements of a scholar based on the published papers in the last n years, and then the expert list was given based on the improved Hindex. The research interests of a researcher were obtained based on the topics extracted by the probabilistic topic model. In order to carry out high recall retrieval, a query expansion strategy was used: the singular value decomposition step was applied to the termdocument matrix to reduce the dimensionality of the matrix and obtain the termterm relationship matrix, and then the highly related terms were selected to make up the expanded query. Finally, the relevance between the expanded query and the scholar’s topic vectors was calculated and the results were represented in a descending order. An experiment was conducted on the dataset collected from an existing scholar community, SCHOLAT, to verify the effectiveness of the proposed model. The experimental results demonstrate that the proposed model produces the expected results.
备注/Memo
收稿日期: 2012-05-24.
网络出版日期:2012-07-20.
基金项目:国家自然科学基金资助项目(60970044);广东省科技计划资助项目(2010B010600031);广州市科技计划资助项目(2010JD00511).
通信作者:李春英.
E_mail:zqxylcy@163.com.
作者简介:
李春英,女,1978年生,讲师,CCF会员(E200019159M),主要研究方向为学术信息检索与推荐、人工智能.
?汤庸,男,1964年生,教授,博士生导师,博士,中国计算机学会协同计算专委会副主任,中国人工智能学会网络专委会副主任,广东省计算机学会常务副理事长,广东省网络文化协会副会长.主要研究方向为数据库、协同计算、云服务软件,发表学术论文多篇.
陈国华,男,1984年生,讲师,博士,主要研究方向为学术信息检索、机器学习.
更新日期/Last Update:
2012-09-26