[1]ZHAO Yanwei,ZHU Fen,GUI Fangzhi,et al.Improved k-means algorithm based on extension distance[J].CAAI Transactions on Intelligent Systems,2020,15(2):344-351.[doi:10.11992/tis.201811020]
Copy
CAAI Transactions on Intelligent Systems[ISSN 1673-4785/CN 23-1538/TP] Volume:
15
Number of periods:
2020 2
Page number:
344-351
Column:
学术论文—人工智能基础
Public date:
2020-03-05
- Title:
-
Improved k-means algorithm based on extension distance
- Author(s):
-
ZHAO Yanwei1; ZHU Fen1; GUI Fangzhi1; REN Shedong2; XIE Zhiwei1; XU Chen1
-
1. Key Lab of Special Purpose Equipment and Advanced Manufacturing Technology, Ministry of Education & Zhejiang Province, Zhejiang University of Technology, Hangzhou 310014, China;
2. College of Computer Science and Technology, Zhejiang University of T
-
- Keywords:
-
extension distance; k-means clustering algorithm; scaling factor; initial cluster center; intensity; alienation
- CLC:
-
TP181
- DOI:
-
10.11992/tis.201811020
- Abstract:
-
An improved k -means algorithm optimizing the initial cluster centers based on extension distance was proposed to solve several problems that lead to clustering imbalance of the algorithm, such as the poor quality of initial cluster center selection or the first initial cluster center easily falling into the non-dense area of the data boundary. First, the classical distance of the sample was mapped onto the extension interval, and the extension left-side and right-side distances were obtained using the extension distance calculation method. Then, the average extension side distance was determined, and the extension left-side and right-side distances were taken as the quantitative indicators of sample density and cluster center distance, respectively. Subsequently, the selection criteria of the initial cluster center were given. Finally, compared with the traditional k-means algorithm, the improved k-means algorithm obtained higher clustering accuracy and better balance, particularly in high-dimensional data clustering.