[1]ZHAO Yafeng,YU Jichao,SUN Qian,et al.Unsupervised framework for Amur tiger re-identification with Cluster Contrast and ViT[J].CAAI Transactions on Intelligent Systems,2026,21(2):435-443.[doi:10.11992/tis.202507012]
Copy
CAAI Transactions on Intelligent Systems[ISSN 1673-4785/CN 23-1538/TP] Volume:
21
Number of periods:
2026 2
Page number:
435-443
Column:
学术论文—机器感知与模式识别
Public date:
2026-05-16
- Title:
-
Unsupervised framework for Amur tiger re-identification with Cluster Contrast and ViT
- Author(s):
-
ZHAO Yafeng1; YU Jichao1; SUN Qian2; KANG Jialu1; WANG Zicheng1
-
1. College of Computer and Control Engineering, Northeast Forestry University, Harbin 150040, China;
2. College of Information And Communication Engineering, Harbin Engineering University, Harbin 150001, China
-
- Keywords:
-
Amur tiger; re-identification; deep learning; unsupervised learning; ATRW dataset; Cluster Contrast mechanism; vision Transformer; coordinate attention
- CLC:
-
TP391.4
- DOI:
-
10.11992/tis.202507012
- Abstract:
-
To address challenges like difficult annotation of wild data and sample imbalance in Amur tiger individual re-identification, a collaborative framework based on the ATRW dataset was developed for unsupervised re-identification. The ViT self-attention mechanism was used to capture global long-distance dependent features of stripes. Combined with the coordinate attention mechanism, it enhanced spatial position analysis of stripes, making up for missing feature correlations caused by the locality of traditional CNNs. The Cluster Contrast mechanism was introduced to build a cluster-level memory dictionary. Momentum update balanced feature optimization rates of individuals with different sample sizes, alleviating feature biases from sample imbalance in unsupervised learning. Experiments show the model achieves an mAP of86.4% on the ATRW (r+i) dataset, outperforming the original feature extraction models (ViT and Resnet50_ibn). It exhibits good generalization ability across different data distributions and data volumes, and is suitable for the demand of collaborative monitoring with visible light/infrared multi-devices in the wild. The method proposed in this study provides a technical solution with both accuracy and robustness for Amur tiger individual identification.