[1]彭晨阳,何立风,王梦溪,等.SDA U-Mamba: 基于频域动态特征融合与双极路由注意力的医学图像分割[J].智能系统学报,2026,21(1):284-294.[doi:10.11992/tis.202508032]
PENG Chenyang,HE Lifeng,WANG Mengxi,et al.SDA U-Mamba: spectral-domain dynamic fusion and bipolar routing attention for medical image segmentation[J].CAAI Transactions on Intelligent Systems,2026,21(1):284-294.[doi:10.11992/tis.202508032]
点击复制
《智能系统学报》[ISSN 1673-4785/CN 23-1538/TP] 卷:
21
期数:
2026年第1期
页码:
284-294
栏目:
人工智能院长论坛
出版日期:
2026-03-05
- Title:
-
SDA U-Mamba: spectral-domain dynamic fusion and bipolar routing attention for medical image segmentation
- 作者:
-
彭晨阳1,2, 何立风1, 王梦溪1,2, 杜晓刚1,2, 王营博1,2, 路艳3, 雷涛1,2
-
1. 陕西科技大学 电子信息与人工智能学院, 陕西 西安 710021;
2. 陕西科技大学 陕西省人工智能联合实验室, 陕西 西安 710021;
3. 陕西科技大学 轻工科学与工程学院, 陕西 西安 710021
- Author(s):
-
PENG Chenyang1,2, HE Lifeng1, WANG Mengxi1,2, DU Xiaogang1,2, WANG Yingbo1,2, LU Yan3, LEI Tao1,2
-
1. School of Electronic Information and Artificial Intelligence, Shaanxi University of Science and Technology, Xi’an 710021, China;
2. Shaanxi Provincial Joint Laboratory of Artificial Intelligence, Shaanxi University of Science and Technology, Xi’an 710021, China;
3. School of Light Industry Science and Engineering, Shaanxi University of Science and Technology, Xi’an 710021, China
-
- 关键词:
-
Mamba; 医学图像分割; 状态空间模型; 谱动态特征融合; 傅里叶变换; 注意力; U-Net架构; 多尺度特征建模
- Keywords:
-
Mamba; medical image segmentation; state space model; spectral-dynamic feature fusion; Fourier transform; attention mechanism; U-Net architecture; multi-scale feature modeling.
- 分类号:
-
TP391.4
- DOI:
-
10.11992/tis.202508032
- 摘要:
-
以Mamba为代表的状态空间模型(state space models,SSM)凭借其出色的长程依赖建模能力与较低的线性计算复杂度,在医学图像分割领域展现出广阔的应用前景。但该类方法对图像逐像素展平的处理方式会破坏图像空间结构,导致局部细节提取不足,且缺乏对器官与病灶的聚焦机制,在复杂背景下易引入冗余背景信息。为解决上述问题,本文提出了一种融合谱域动态特征与注意力机制的医学图像分割网络——频域动态注意力U型Mamba(spectral dynamic attention U-Mamba,SDA U-Mamba)。该网络采用分层U型结构设计,分别从空间连续性建模和区域聚焦能力两方面对Mamba进行优化。在网络浅层设计Mamba空频注意力模块,通过融合空域卷积、频域变换和金字塔自注意力结构以提升模型的局部信息感知能力与多尺度上下文建模效果;在网络深层引入双极路由注意力模块,通过动态路由选择与稀疏激活机制增强模型对医学图像器官或病灶的表征。实验结果表明,SDA U-Mamba在BUSI、CVC-ClinicDB与CHAOS-Liver这3个公开医学数据集上的分割性能显著优于当前主流方法,平均交并比(intersection over union,IoU)提升2.61%。本文所提算法可用于临床医学图像分割。
- Abstract:
-
State space models(SSM) represented by Mamba have demonstrated broad application prospects in medical image segmentation owing to their excellent long-range dependency modeling and low linear computational complexity. However, the pixel-wise flattening operation used in these methods destroys the spatial structure of images, leading to insufficient extraction of local details. Additionally, they lack a mechanism to focus on organs and lesions, making them prone to introducing redundant background information in medical images with complex contexts. To address these issues,this study proposes spectral dynamic attention U-Mamba(SDA U-Mamba), a medical image segmentation network that integrates spectral-domain dynamic features with attention mechanisms. The network adopts a hierarchical U-shaped architecture, optimizing Mamba in two aspects: spatial continuity modeling and regional focusing capability. Specifically, a Mamba Spatial-Frequency Attention module is introduced into the network’s shallow layers. This module enhances the model’s local information perception and multi-scale context modeling performance by fusing spatial convolution, frequency-domain transformation, and a pyramid self-attention structure. Furthermore, a Bipolar Routing Attention module is introduced in the deep layers, strengthening the network’s representation of organs and lesions in medical images through dynamic routing selection and sparse activation mechanisms. Experimental results show that SDA U-Mamba achieved significantly better segmentation performance than current state-of-the-art methods across three public medical datasets (BUSI, CVC-ClinicDB, and CHAOS-Liver), with an average improvement in intersection over union(IoU) of 2.61%. The results indicate that the proposed algorithm can be applied to clinical medical image segmentation.
备注/Memo
收稿日期:2025-8-28。
基金项目:国家自然科学基金项目( 62271296;62201334);陕西省创新能力支撑计划项目(2025RS-CXTD-012);陕西省重点研发计划项目(2024GX-YBXM-121);西安市中青年科技创新领军人才项目(25ZQRC00019);陕西省教育厅科学研究计划项目(23JP014; 23JP022).
作者简介:彭晨阳,硕士研究生,主要研究方向为计算机视觉、机器学习。E-mail:2046591497@qq.com。;何立风,教授,博士,主要研究方向为数字图像处理、机器学习。 主持国家自然科学基金2项,发表学术论文60余篇,申请国家发明专利10余项。E-mail:helifeng@ist.aichipu.ac.jp。;雷涛,教授,博士,陕西科技大学电子信息与人工智能学院副院长,主要研究方向为人工智能、计算机视觉。主持国家自然基金5项,发表学术论文100余篇,其中12篇论文入选ESI高被引论文。E-mail:leitao@sust.edu.cn。
通讯作者:雷涛. E-mail:leitao@sust.edu.cn
更新日期/Last Update:
2026-01-05