[1]GAO Xiaofang,JIA Zonghan,LIANG Jiye.Contrastive clustering algorithm based on trend consistency learning[J].CAAI Transactions on Intelligent Systems,2026,21(2):389-398.[doi:10.11992/tis.202506027]
Copy
CAAI Transactions on Intelligent Systems[ISSN 1673-4785/CN 23-1538/TP] Volume:
21
Number of periods:
2026 2
Page number:
389-398
Column:
学术论文—机器学习
Public date:
2026-05-16
- Title:
-
Contrastive clustering algorithm based on trend consistency learning
- Author(s):
-
GAO Xiaofang1; JIA Zonghan1; LIANG Jiye1; 2
-
1. School of Computer and Information Technology, Shanxi University, Taiyuan 030006, China;
2. Laboratory of Computational Intelligence and Chinese Processing, Ministry of Education, Taiyuan 030006, China
-
- Keywords:
-
contrast clustering; contrastive learning; false negatives; trend consistency; pseudo labels; semantic information; inter-class distinguishability; mask matrix
- CLC:
-
TP18
- DOI:
-
10.11992/tis.202506027
- Abstract:
-
In recent years, contrastive clustering has become a research hotspot in the fields of data mining and machine learning, aiming to enhance clustering performance by leveraging the powerful feature representation capabilities of contrastive learning. However, the use of contrastive learning often introduces the problem of false negative examples due to category conflicts, thereby reducing the performance of contrastive clustering. To address this issue, this paper proposes a contrastive clustering algorithm based on a trend consistency constraint strategy (CCTC). By marking high-confidence sample pairs with consistent category information in the trend consistency array and using this semantic information to calculate the trend constraint matrix to assist in selecting positive samples, the algorithm achieves dynamic interaction between cluster-level and instance-level sample information through the combination of instance-level and cluster-level consistency loss functions, thereby enhancing sample consistency and inter-class distinguishability. Compared with other contrastive clustering algorithms, this method can utilize the pseudo-label change trends in the multi-round training process to obtain sample pairs with high-confidence category trend consistency, thus improving the clustering performance of the model. Experiments have demonstrated the effectiveness of the algorithm.