[1]周治平,王杰锋,朱书伟,等.一种改进的自适应快速AF-DBSCAN聚类算法[J].智能系统学报编辑部,2016,11(1):93-98.[doi:10.11992/tis.201410021]
ZHOU Zhiping,WANG Jiefeng,ZHU Shuwei,et al.An improved adaptive and fast AF-DBSCAN clustering algorithm[J].CAAI Transactions on Intelligent Systems,2016,11(1):93-98.[doi:10.11992/tis.201410021]
点击复制
《智能系统学报》编辑部[ISSN 1673-4785/CN 23-1538/TP] 卷:
11
期数:
2016年第1期
页码:
93-98
栏目:
学术论文—机器学习
出版日期:
2016-02-25
- Title:
-
An improved adaptive and fast AF-DBSCAN clustering algorithm
- 作者:
-
周治平, 王杰锋, 朱书伟, 孙子文
-
江南大学物联网工程学院, 江苏无锡 214122
- Author(s):
-
ZHOU Zhiping, WANG Jiefeng, ZHU Shuwei, SUN Ziwen
-
School of Internet of Things Engineering, Jiangnan University, Wuxi 214122, China
-
- 关键词:
-
密度聚类; DBSCAN; 区域查询; 全局参数; KNN分布; 数学统计分析
- Keywords:
-
density clustering; DBSCAN; region query; global parameters; KNN distribution; mathematical statistics and analysis
- 分类号:
-
TP181
- DOI:
-
10.11992/tis.201410021
- 摘要:
-
基于密度的DBSCAN聚类算法可以识别任意形状簇,但存在全局参数Eps与MinPts的选择需人工干预,采用的区域查询方式过程复杂且易丢失对象等问题,提出了一种改进的参数自适应以及区域快速查询的密度聚类算法。根据KNN分布与数学统计分析自适应计算出最优全局参数Eps与MinPts,避免聚类过程中的人工干预,实现了聚类过程的全自动化。通过改进种子代表对象选取方式进行区域查询,无需漏检操作,有效提高了聚类的效率。对4种典型数据集的密度聚类实验结果表明,本文算法使得聚类精度提高了8.825%,聚类的平均时间减少了0.92 s。
- Abstract:
-
The density-based DBSCAN clustering algorithm can identify clusters with arbitrary shape, however, the choice of the global parameters Eps and MinPts requires manual intervention, the process of regional query is complex and loses objects easily. Therefore, an improved density clustering algorithm with adaptive parameter for fast regional queries is proposed. Using KNN distribution and mathematical statistical analysis, the optimal global parameters Eps and MinPts are adaptively calculated, so as to avoid manual intervention and enable full automation of the clustering process. The regional query is conducted by improving the selection manner of the object, which is represented by a seed and thus avoiding manual intervention, and so the clustering efficiency is effectively increased. The experiment results looking at density clustering of four typical data sets show that the proposed method effectively improves clustering accuracy by 8.825% and reduces the average time of clustering by 0.92 s.
备注/Memo
收稿日期:2014-10-13;改回日期:。
基金项目:国家自然科学基金资助项目(61373126);江苏省产学研联合创新资金-前瞻性联合研究基金资助项目(BY2013015-33).
作者简介:周治平,男,1962年生,教授,博士,主要研究方向为检测技术与自动化装置、信息安全等;王杰锋,男,1989年生,硕士研究生,主要研究方向为智能信息处理;朱书伟,男,1990年生,硕士研究生,主要研究方向为数据挖掘与人工智能。
通讯作者:王杰锋.E-mail:18352513420@163.com.
更新日期/Last Update:
1900-01-01