Journal of Guangxi Normal University(Natural Science Edition) ›› 2011, Vol. 29 ›› Issue (2): 151-155.

Previous Articles     Next Articles

Algorithm to Cluster Search Results Based on Frequent Itemsets

SHA Bei-bei, XIE Li-cong   

  1. College of Mathematics and Computer Science,Fuzhou University,Fuzhou Fujian 350002,China
  • Received:2011-05-18 Published:2018-11-19

Abstract: Clustering method of search engines can help the users locate the relevant information quickly and efficiently.A method is proposed TS-FIC algorithm which takes the frequent itemsets mining from association rules as class label,and then organizes the initial cluster into the tree structure using thesemantic relations among frequent item sets.When the final cluster is formed,thesemantic similarity is introduced as an approach to compute the class similarity.Finally,by means of a novel ordering scheme,the ordered results can be displayed to the users.The simulation results demonstrate that the proposed algorithmis of certain feasibility and has excellent performance in terms of efficiency and accuracy.

Key words: clustering, association rule, semantic similarity, class labels

CLC Number: 

  • TP391
[1] JANSEN B J,SPINK A,BATEMAN J,et al.Real life information retrieval:a study of user queries on the web[J].SIGIR Forum,1998,32(1):5-17.
[2] ZAMIR O,ETZIONI O.Grouper:a dynamic clustering interface to websearch results[J].Computer Networks,1999,31(11/16):1361-1374.
[3] ZHANG Dell,DONG Yi-sheng.Semantic,hierarchical,online clustering of web search results[M].JEFFREY X X,LIN Xue-min,LU Hong-jun,et al.Advanced Web Technologies and Applications.Berlin Heidelberg:Springer-Verlag,2004:69-78.
[4] Vivisimo Inc.Vivisimo technology & innovation overview[EB/OL].[2011-05-18].http://vivisimo.com/technology/technology.html.
[5] 肖欣延,张东站,高君杰,等.一种新的Web检索结果聚类方法[J].计算机研究与发展,2007,44(S2):79-83.
[6] 宋擒豹,沈钧毅.基于关联规则的Web文档聚类算法[J].软件学报,2002,13(3):417-423.
[7] 宋春芳,石冰.一种基于关联规则的搜索引擎结果聚类算法[J].山东大学学报:理学版,2006,41(3):68-72.
[8] 钱功伟,倪林,田甜,等.带聚类处理的元搜索引擎的设计与实现[J].计算机工程与应用,2007,43(22):182-185.
[1] WANG Xun, LI Tinghui, PAN Xiao, TIAN Yu. Image Segmentation Method Based on Improved Fuzzy C-means Clustering and Otsu Maximum Variance [J]. Journal of Guangxi Normal University(Natural Science Edition), 2019, 37(4): 68-73.
[2] SU Lei, LI Junying. Discussion on Classification Standard of Eco-environment Quality in Counties of National Key Eco-functional Areas [J]. Journal of Guangxi Normal University(Natural Science Edition), 2019, 37(3): 196-202.
[3] LIU Jinlong,GUO Yan, YU Zhihua, LIU Yue,YU Xiaoming,CHENGXueqi. A New Method to Detect Busty Events with Different Media Data Based on Word Clustering [J]. Journal of Guangxi Normal University(Natural Science Edition), 2019, 37(1): 23-31.
[4] LIN Yue, LIU Tingzhang, HUANG Lirong, XI Xiaoye, PAN Jian. Anomalous State Detection of Power Transformer Basedon Bidirectional KL Distance Clustering Algorithm [J]. Journal of Guangxi Normal University(Natural Science Edition), 2018, 36(4): 20-26.
[5] LIN Yue. The Fault Diagnosis of Charging Piles Based on Hybrid AP-HMM Model [J]. Journal of Guangxi Normal University(Natural Science Edition), 2018, 36(1): 25-33.
[6] YAN Yan, HU Baoqing, HOU Manfu, SHI Shana. Suitability Assessment of Karst Rocky Desertification Control Patternsin Karst Counties of Guangxi, China [J]. Journal of Guangxi Normal University(Natural Science Edition), 2017, 35(4): 145-153.
[7] TANG Qiling, CHEN Zhilin, ZHOU Shanyi. Geographic Division of Chinese Ants (Hymenoptera: Formicidae) Based on Generic Category [J]. Journal of Guangxi Normal University(Natural Science Edition), 2017, 35(1): 82-91.
[8] SHI Ya-bing, HUANG Yu, QIN Xiao, YUAN Chang-an. K-Means Clustering Algorithm Based on a Novel Approach for Improved Initial Seeds [J]. Journal of Guangxi Normal University(Natural Science Edition), 2013, 31(4): 33-40.
[9] CAO Yong-chun, SHAO Ya-bin, TIAN Shuang-liang, CAI Zheng-qi. A Clustering Method Based on Immune Genetic Algorithm [J]. Journal of Guangxi Normal University(Natural Science Edition), 2013, 31(3): 59-64.
[10] MA Jing, ZOU Yan-li, LI Fu-tao, MO Yu-fang. Limited-maximum-degree LBA Network Model [J]. Journal of Guangxi Normal University(Natural Science Edition), 2011, 29(4): 21-24.
[11] ZHENG Lei, ZHU Zheng-li, HOU Ying-kun. Deployment Strategy of Wireless Sensor Network Nodes Based on Improved Particle Swarm Optimization [J]. Journal of Guangxi Normal University(Natural Science Edition), 2011, 29(4): 56-62.
[12] SHEN Ze-hao, YE Zhong-xing. Fuzzy Clustering Analysis of Customer Credit Risk of Futures Company [J]. Journal of Guangxi Normal University(Natural Science Edition), 2011, 29(3): 101-104.
[13] XU Li, DING Shi-fei, GUO Feng-feng. A Rough Kernel Clustering Algorithm Based on ImprovedAttribute Reduction [J]. Journal of Guangxi Normal University(Natural Science Edition), 2011, 29(3): 105-109.
[14] ZHOU Xin, HAO Zhi-feng, CAI Rui-chu, WEN Wen. Text Clustering with Noise and It's Application in Anti-spam Systems [J]. Journal of Guangxi Normal University(Natural Science Edition), 2011, 29(2): 156-160.
[15] GAO Shi-jian, WANG Li-zhen, FENG Ling, CHEN Hong-mei. Co-location Patterns Mining Based on Agglomerative Hierarchical Clustering [J]. Journal of Guangxi Normal University(Natural Science Edition), 2011, 29(2): 167-173.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!