The server is under maintenance between 08:00 to 12:00 (GMT+08:00), and please visit
later.
We apologize for any inconvenience caused
A clustering algorithm for scalable datasets based on semi-supervision technology
Author(s): SHEN Yan, SONG Shunlin, ZHU Yuquan
Pages: 372-
382
Year: 2011
Issue:
4
Journal: Journal of Nanjing University(Natural Sciences)
Keyword: scalable datasets clustering; semi-supervised clustering; mined data compression; data mining; kmeans;
Abstract: 待挖掘数据集规模的不断增长,以往的聚类算法由于需要多次扫描原始数据集而不再适用,现阶段,一遍扫描原始数据集即完成聚类的算法成为了首要的研究目标.但是,现有针对大规模数据集的算法容易受到初始化参数以及原始数据集分布的影响,聚类结果质量不高,并且也不稳定.对此,吸收半监督聚类的思想,提出了基于标记集的半监督一遍扫描K均值算法,该算法利用驻留主存的标记集指导聚类过程,使得聚类效率以及聚类结果的质量得到了...
Citations
System Exception