Mamba Based Feature Extraction and Adaptive Multilevel Feature Fusion for 3D Tumor Segmentation from Multi-modal Medical Image

  • Zexin Ji
  • , Beiji Zou
  • , Xiaoyan Kui
  • , Hua Li
  • , Pierre Vera
  • , Su Ruan

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

1 Scopus citations

Abstract

Multi-modal 3D medical image segmentation aims to accurately identify tumor regions across different modalities, facing challenges from variations in image intensity and tumor morphology. Traditional convolutional neural network (CNN)-based methods struggle with capturing global features, while Transformers-based methods, despite effectively capturing global context, encounter high computational costs in 3D medical image segmentation. The Mamba model combines linear scalability with long-distance modeling, making it a promising approach for visual representation learning. However, Mamba-based 3D multi-modal segmentation still struggles to leverage modality-specific features and fuse complementary information effectively. In this paper, we propose a Mamba based feature extraction and adaptive multilevel feature fusion for 3D tumor segmentation using multi-modal medical image. We first develop the specific modality Mamba encoder to efficiently extract long-range relevant features that represent anatomical and pathological structures present in each modality. Moreover, we design a bi-level synergistic integration block that dynamically merges multi-modal and multi-level complementary features by the modality attention and channel attention learning. Lastly, the decoder combines deep semantic in- formation with fine-grained details to generate the tumor segmentation map. Experimental results on medical image datasets (PET/CT and MRI multi-sequence) show that our approach achieve competitive performance compared to the state-of-the-art CNN, Transformer, and Mamba-based approaches.

Original languageEnglish
Title of host publicationAdvanced Intelligent Computing Technology and Applications - 21st International Conference, ICIC 2025, Proceedings
EditorsDe-Shuang Huang, Qinhu Zhang, Chuanlei Zhang, Wei Chen
PublisherSpringer Science and Business Media Deutschland GmbH
Pages342-352
Number of pages11
ISBN (Print)9789819500352
DOIs
StatePublished - 2025
Event21st International Conference on Intelligent Computing, ICIC 2025 - Ningbo, China
Duration: Jul 26 2025Jul 29 2025

Publication series

NameLecture Notes in Computer Science
Volume15869 LNBI
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference21st International Conference on Intelligent Computing, ICIC 2025
Country/TerritoryChina
CityNingbo
Period07/26/2507/29/25

Keywords

  • 3D Medical Image
  • Mamba
  • Multi-modal feature fusion
  • Tumor Segmentation

Fingerprint

Dive into the research topics of 'Mamba Based Feature Extraction and Adaptive Multilevel Feature Fusion for 3D Tumor Segmentation from Multi-modal Medical Image'. Together they form a unique fingerprint.

Cite this