[1]赵亚凤,于继超,孙骞,等.基于Cluster Contrast和ViT的东北虎个体重识别无监督学习框架研究[J].智能系统学报,2026,21(2):435-443.[doi:10.11992/tis.202507012]
ZHAO Yafeng,YU Jichao,SUN Qian,et al.Unsupervised framework for Amur tiger re-identification with Cluster Contrast and ViT[J].CAAI Transactions on Intelligent Systems,2026,21(2):435-443.[doi:10.11992/tis.202507012]
点击复制
《智能系统学报》[ISSN 1673-4785/CN 23-1538/TP] 卷:
21
期数:
2026年第2期
页码:
435-443
栏目:
学术论文—机器感知与模式识别
出版日期:
2026-03-05
- Title:
-
Unsupervised framework for Amur tiger re-identification with Cluster Contrast and ViT
- 作者:
-
赵亚凤1, 于继超1, 孙骞2, 康嘉璐1, 王梓丞1
-
1. 东北林业大学 计算机与控制工程学院, 黑龙江 哈尔滨 150040;
2. 哈尔滨工程大学 信息与通信工程学院, 黑龙江 哈尔滨 150001
- Author(s):
-
ZHAO Yafeng1, YU Jichao1, SUN Qian2, KANG Jialu1, WANG Zicheng1
-
1. College of Computer and Control Engineering, Northeast Forestry University, Harbin 150040, China;
2. College of Information And Communication Engineering, Harbin Engineering University, Harbin 150001, China
-
- 关键词:
-
东北虎; 重识别; 深度学习; 无监督学习; ATRW 数据集; Cluster Contrast机制; vision Transformer; 坐标注意力
- Keywords:
-
Amur tiger; re-identification; deep learning; unsupervised learning; ATRW dataset; Cluster Contrast mechanism; vision Transformer; coordinate attention
- 分类号:
-
TP391.4
- DOI:
-
10.11992/tis.202507012
- 摘要:
-
针对东北虎个体重识别中野外数据标注困难、样本失衡等挑战,以野外东北虎 (Amur tiger re-identification in the wild,ATRW)数据集为基础提出一种“全局特征提取-空间位置强化-无监督均衡训练”的协同框架,完成无监督重识别。使用vision Transformer(ViT)自注意力机制捕捉东北虎条纹的全局长距离的依赖特征,通过坐标注意力机制加强对条纹空间位置的精确解析,解决传统卷积神经网络局部性导致的特征关联缺失问题。引入Cluster Contrast机制构建簇级内存字典,通过动量更新平衡不同样本量东北虎的特征优化速率,避免无监督学习中样本失衡导致的特征偏差。实验表明,模型在ATRW(r+i)数据集上平均精度的值为86.4%,高于原有的特征提取ViT和Resnet50_ibn模型,对不同数据分布和数据量具有良好泛化能力,适配野外可见光/红外多设备协同监测需求。本文所提方法为东北虎个体识别提供了兼具准确性与鲁棒性的技术方案。
- Abstract:
-
To address challenges like difficult annotation of wild data and sample imbalance in Amur tiger individual re-identification, a collaborative framework based on the ATRW dataset was developed for unsupervised re-identification. The ViT self-attention mechanism was used to capture global long-distance dependent features of stripes. Combined with the coordinate attention mechanism, it enhanced spatial position analysis of stripes, making up for missing feature correlations caused by the locality of traditional CNNs. The Cluster Contrast mechanism was introduced to build a cluster-level memory dictionary. Momentum update balanced feature optimization rates of individuals with different sample sizes, alleviating feature biases from sample imbalance in unsupervised learning. Experiments show the model achieves an mAP of86.4% on the ATRW (r+i) dataset, outperforming the original feature extraction models (ViT and Resnet50_ibn). It exhibits good generalization ability across different data distributions and data volumes, and is suitable for the demand of collaborative monitoring with visible light/infrared multi-devices in the wild. The method proposed in this study provides a technical solution with both accuracy and robustness for Amur tiger individual identification.
备注/Memo
收稿日期:2025-7-12。
基金项目:国家自然科学基金项目(32371864).
作者简介:赵亚凤,副教授,博士,主要研究方向为计算机视觉与模式识别、人工智能与智能控制、无线传感器网络。主持和参与国家自然科学基金、黑龙江省自然科学基金等项目10余项,获发明专利及实用新型专利授权10余项,发表学术论文20余篇,作为主编及副主编出版著作2部。E-mail:zyf@nefu.edu.cn。;于继超,硕士研究生,主要研究方向为计算机视觉。E-mail:jichaoyu@nefu.edu.cn。;王梓丞,副教授,博士后。黑龙江省光学学会会员,国际期刊《Photonics》特邀编辑,《半导体光电》编委。主要研究方向为高密度集成光学器件设计、智能光纤传感系统开发及多物理场耦合仿真技术。主持或参与省部级及横向项目10余项,获黑龙江省科学技术发明二等奖,指导学生获国家级、省部级竞赛奖项多项。获发明专利及软件著作授权12项。近5年第一作者/通信作者发表论文共13篇。E-mail:wangzicheng1992@163.com。
通讯作者:王梓丞. E-mail:wangzicheng1992@163.com
更新日期/Last Update:
1900-01-01