Journal of Guangxi Normal University(Natural Science Edition) ›› 2011, Vol. 29 ›› Issue (1): 157-161.

Previous Articles     Next Articles

Method of Chinese Expert Entity Homepage Recognition

LI Li-na1, YU Zheng-tao1,2, WANG Ya-sheng1, MAO Cun-li1,2, GUO Jian-yi1,2   

  1. 1.College of Information Engineering and Automation,Kunming University of Science and Technology,Kunming Yunnan 650051,China;
    2.Institute of Intelligent Information Processing,Computer TechnologyApplication Key Laboratory of Yunnan Province,Kunming Yunnan 650051,China
  • Received:2010-12-29 Online:2007-03-25 Published:2018-11-16

Abstract: Expert Entity Homepage Recognition is one of the keypoints in expert search.In this paper,a method based on J48 is proposed.2113 Chinese expert entities and the corresponding entity homepages are collectedby analyzing the expert resources,and the expert entity features relatedto the features of link and webpage content are defined.Besides,these features are also extractedto form a training data set;and then different learning algorithms with different features are adopted to recognize the expert homepage for finding the most effectiveclassification features and homepage recognition learning algorithm.The experiment results show that the best method is obtained by using J48 algorithm,specifically,when the features of link and webpage content are combined with,theexpert homepage recognition accuracy rate reaches 81.05%.

Key words: Chinese expert entity, homepage recognition, link feature, Webpage feature, J48

CLC Number: 

  • TP391.3
[1] 陆伟,韩曙光.组织专家的检索系统设计与实现[J].情报学报,2008,27(5):657-663.
[2] DAVENPORT T.Knowledge management at hewlett packard,center forbusiness innovation[EB/OL].(1996)[2010-06-09].http://www.businessinnovation.ey.com/research/researchf.html.
[3] BALOG K,AZZOPARDI L,de RIJKE M.Formal models for expert finding inenterprise corpora[C]//Proceedings of the 29th Annual International ACM SIGIRConference on Research and Development in Information Retrieval.New York:ACM Press,2006:43-50.
[4] CAMPBELL C S,MAGLIO P P,COZZI A,et al.Expertise identification using email communications[C]//Proceedings of the twelfth international conference on Information and knowledge management.New York:ACM Pres,2003:528-531.
[5] YI Fang,LUO Si,MATHUR A.FacFinder:search for expertise in academicinstitutions:technical report SERC-TR-294[R].West Lafayette,IN:Department of Computer Science,Purdue University,2008.
[6] DAVENPORT T H,PRUSAK L.Working knowledge:how organizations managewhat they know[M].Boston,MA:Harvard Business School Press,1998.
[7] LIN Ching-yung,GRIFFITHS-FISHER V,EHRLICH K,et al.SmallBlue:people mining for expertise search and social network analysis[J].IEEE MultimediaMagazine,2008,15(1):78-84.
[8] HAN Jia-wei,KAMBER M.Data mining:concepts and techniques[M].2nded.San Francisco:Morgan Kaufman Publishers,2000.
[1] LI Shuang-qun, XU Jiu-cheng, ZHANG Ling-jun, LI Xiao-yan. Color Image Retrieval Based on Tolerance Granules [J]. Journal of Guangxi Normal University(Natural Science Edition), 2011, 29(3): 173-178.
[2] XIA Tian. Content Extraction of Web Page Based on Extended Label Tree [J]. Journal of Guangxi Normal University(Natural Science Edition), 2011, 29(1): 133-137.
[3] CUI Lin-wei, SU Wei, GUO Wei, LI Lian. Extraction of Web Mathematical Formulas Based on Nutch [J]. Journal of Guangxi Normal University(Natural Science Edition), 2011, 29(1): 167-172.
[4] TANG Nan, YANG Zhi-hao, WU Jia-jin, WANG Yan-hua, LIN Hong-fei. Method of Predicting Protein Complex Based on Supervised Learning [J]. Journal of Guangxi Normal University(Natural Science Edition), 2011, 29(2): 174-179.
[5] LUO Xin, PAN Qiao, WANG Hong-ya, CHEN Mei, KITA Kenji. Realization of High-speed Image Search Based on SOFM [J]. Journal of Guangxi Normal University(Natural Science Edition), 2011, 29(2): 180-184.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!