← Back to Research Radar
Academic Publication Academic Publication

VM-UNet: Vision Mamba UNet for Medical Image Segmentation

340
Citations
September 16, 2025
Published Date

Research Abstract & Technology Focus

In the realm of medical image segmentation, both CNN-based and Transformer-based models have been extensively explored. However, CNNs exhibit limitations in long-range modeling capabilities, whereas Transformers are hampered by their quadratic computational complexity. Recently, State Space Models (SSMs), exemplified by Mamba, have emerged as a promising approach. They not only excel in modeling long-range interactions but also maintain a linear computational complexity. In this paper, leveraging state space models, we propose a U-shape architecture model for medical image segmentation, named Vision Mamba UNet (VM-UNet). Specifically, the Visual State Space (VSS) block is introduced as the foundation block to capture extensive contextual information, and an asymmetrical encoder-decoder structure is constructed with fewer convolution layers to save calculation cost. We conduct comprehensive experiments on the ISIC17, ISIC18, and Synapse datasets, and the results indicate that VM-UNet performs competitively in medical image segmentation tasks, e.g. obtaining 89.03, 89.71 and 81.08 in terms of DSC score on three datasets respectively. To our best knowledge, this is the first medical image segmentation model constructed based on the pure SSM-based model. We aim to establish a baseline and provide valuable insights for the future development of more efficient and effective SSM-based segmentation systems. Our code is available at
https://github.com/JCruan519/VM-UNet
.
vm-unet vision mamba medical image segmentation
Read Full Literature

Correlated Market Trend: Biomedical Engineering

Bridging academia to market: The 60-day public search velocity mapping directly to the core technology of this paper. Dashed line represents 7-day moving average.

Commercial Realization

Startups and Open Source tools heavily associated with the concepts explored in this paper.

  • Product Hunt
    Zzzappy
    Science-backed breaks to protect your vision & prevent RSI
  • Product Hunt
    GLM-5V-Turbo
    Vision-to-code foundation model for real GUI automation

Associated Media Narrative