Representative Points and Cluster Attributes Based Incremental Sequence Clustering Algorithm

Authors

  • Di Wu Department of Information and Electronic Engineering, Hebei University of Engineering
  • Jiadong Ren College of Information Science and Engineering, Yanshan University

Keywords:

sequence clustering, incremental clustering, representative point, cluster attributes, similarity

Abstract

In order to improve the execution time and clustering quality of sequence clustering algorithm in large-scale dynamic dataset, a novel algorithm RPCAISC (Representative Points and Cluster Attributes Based Incremental Sequence Clustering) was presented. In this paper, density factor is defined. The primary representative point that has a density factor less than the prescribed threshold will be deleted directly. New representative points can be reselected from non-representative points. Moreover, the representative points of each cluster are modeled using the K-nearest neighbor method. The definition of the relevant degree (RD) between clusters was also proposed. The RD is computed by comprehensively considering the correlations of objects within a cluster and between different clusters. Then, whether the two clusters need to merge is determined. Additionally, the cluster attributes of the initial clustering are retained with this process. By calculating the matching degree between the incremental sequence and the existing cluster attributes, dynamic sequence clustering can be achieved. The theoretic experimental results and analysis prove that RPCAISC has better correct rate of clustering results and execution efficiency.

Downloads

Download data is not yet available.

Downloads

Published

2018-02-09

How to Cite

Wu, D., & Ren, J. (2018). Representative Points and Cluster Attributes Based Incremental Sequence Clustering Algorithm. COMPUTING AND INFORMATICS, 36(6), 1361–1384. Retrieved from https://www.cai.sk/ojs/index.php/cai/article/view/2017_6_1361