广西师范大学学报(自然科学版) ›› 2011, Vol. 29 ›› Issue (1): 87-91.

• • 上一篇    下一篇

半监督聚类中成对约束的主动学习

杨洋, 王立宏   

  1. 烟台大学计算机学院,山东烟台264005
  • 收稿日期:2010-12-08 发布日期:2018-11-16
  • 通讯作者: 王立宏(1970—),女,吉林镇赉人,烟台大学教授,博士。E-mail: wanglh000@163.com
  • 基金资助:
    国家自然科学基金资助项目(61070118)

Active Learning of Pair-wise Constraints in Semi-supervised Clustering

YANG Yang, WANG Li-hong   

  1. College of Computer Science and Technology,Yantai University,Yantai Shandong 264005,China
  • Received:2010-12-08 Published:2018-11-16

摘要: 本文提出一种纠错式主动学习成对约束的方法,探讨了主动学习的停止条件,在较少的约束下可得到较好的聚类结果。通过在UCI基准数据集以及人工数据集的实验表明,在该学习策略下,半监督聚类算法的性能好于对比算法;在停止条件下,每个数据集的聚类结果都是可接受的。

关键词: 半监督聚类, 主动式学习, 监督信息

Abstract: An active learning method of pair-wise constraints based on error correction is proposed in this paper,and stopping criterion is also presented in order to get better clustering result with less pair-wise constraints.Experiments on the UCI benchmark datasets and artificial datasets show that theperformance of semi-supervised clustering algorithm with the proposed strategyis better than that of compared strategies.In addition,the clustering result of each tested dataset is acceptable under the stopping criterion.

Key words: semi-supervised clustering, active learning, supervision information

中图分类号: 

  • TP181
[1] DAN K,SEPANDAR D K,CHRISTOPHER D M.From instance level constraintsto space-level constraints:making the most of prior knowledge in data clustering[C]//Proc of the 19th International Conference on Machine Learning (ICML 2002).San Fransisco:Morgan Kaufmann Publishers,2002:307-314.
[2] BASU S,BANERJEE A.Active semi-supervised for pairwise constrained clustering[C]//Proc of the 4th SIAM International Conference on Data Mining.Philadelphia:Society for Industrial Mathematics,2004:333-344.
[3] 王娜,李霞.基于监督信息特性的主动半监督谱聚类算法[J].电子学报,2010,38(1):172-176.
[4] BURR S.Active learning literature survey[EB/OL].(2010-01-26)[2010-11-06].http://www.cs.cmu.edu/~bsettles/pub/settles.activelearning.pdf.
[5] DAVIDSON I,WAGSTAFF K.Measuring constraint-set utility for partitional clustering algorithms[M]//Lecture Notes in Computer Science Vol4213.Berlin:Springer,2006:115-125.
[6] VLACHOS A.A stopping criterion for active learning[J].Computer,Speech and Language,2008,22(3):295-312.
[7] 王玲,薄列峰,焦李成.密度敏感的半监督谱聚类[J].软件学报,2007,18(10):2412-242.
[8] 屈婉玲,耿素云,张立昂.离散数学[M].北京:高等教育出版社,2008
[1] 黎佳, 王明文, 何世柱, 柯丽. 基于特征加权的半监督聚类研究[J]. 广西师范大学学报(自然科学版), 2011, 29(1): 92-97.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!
版权所有 © 广西师范大学学报(自然科学版)编辑部
地址:广西桂林市三里店育才路15号 邮编:541004
电话:0773-5857325 E-mail: gxsdzkb@mailbox.gxnu.edu.cn
本系统由北京玛格泰克科技发展有限公司设计开发