[1]LIU Beibei,MA Runing,DING Jundi.Density-based statistical merging clustering algorithm[J].CAAI Transactions on Intelligent Systems,2015,10(5):712-721.[doi:10.11992/tis.201410028]
Copy
CAAI Transactions on Intelligent Systems[ISSN 1673-4785/CN 23-1538/TP] Volume:
10
Number of periods:
2015 5
Page number:
712-721
Column:
学术论文—机器学习
Public date:
2015-10-25
- Title:
-
Density-based statistical merging clustering algorithm
- Author(s):
-
LIU Beibei1; MA Runing1; DING Jundi2
-
1. College of Science, Nanjing University of Aeronautics and Astronautics, Nanjing 211100, China;
2. School of Computer Science and Technology, Nanjing University of Science and Technology, Nanjing 210094, China
-
- Keywords:
-
data points; density; random variable; merging; clustering algorithm; noise
- CLC:
-
O235;TP311
- DOI:
-
10.11992/tis.201410028
- Abstract:
-
The ability of existing clustering algorithms to deal with noise is poor, and the speed is slow, instead this paper proposes a density-based statistical merging clustering algorithm (DSMC). The new algorithm takes each group of data points as a set of independent random variables, and gathers statistical criteria from the independent bounded difference inequality. Meanwhile, combined with the density information of the data points, the DSMC algorithm takes the descending order of the density as the merging order in the process of condensation, and thereby achieves statistical merging of different types of data points. The experimental results with both artificial datasets and real datasets show that the DSMC algorithm can not only deal with convex data set, and also has good clustering effects on nonconvex shaped, overlapped and noisy, data sets. This proves that the algorithm has good applicability and validity.