Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:HRMedSeg: Unlocking High-resolution Medical Image segmentation via Memory-efficient Attention Modeling

Apr 08, 2025

Qing Xu, Zhenye Lou, Chenxin Li, Xiangjian He, Rong Qu, Tesema Fiseha Berhanu, Yi Wang, Wenting Duan, Zhen Chen

Figure 1 for HRMedSeg: Unlocking High-resolution Medical Image segmentation via Memory-efficient Attention Modeling

Figure 2 for HRMedSeg: Unlocking High-resolution Medical Image segmentation via Memory-efficient Attention Modeling

Figure 3 for HRMedSeg: Unlocking High-resolution Medical Image segmentation via Memory-efficient Attention Modeling

Figure 4 for HRMedSeg: Unlocking High-resolution Medical Image segmentation via Memory-efficient Attention Modeling

Share this with someone who'll enjoy it:

Abstract:High-resolution segmentation is critical for precise disease diagnosis by extracting micro-imaging information from medical images. Existing transformer-based encoder-decoder frameworks have demonstrated remarkable versatility and zero-shot performance in medical segmentation. While beneficial, they usually require huge memory costs when handling large-size segmentation mask predictions, which are expensive to apply to real-world scenarios. To address this limitation, we propose a memory-efficient framework for high-resolution medical image segmentation, called HRMedSeg. Specifically, we first devise a lightweight gated vision transformer (LGViT) as our image encoder to model long-range dependencies with linear complexity. Then, we design an efficient cross-multiscale decoder (ECM-Decoder) to generate high-resolution segmentation masks. Moreover, we utilize feature distillation during pretraining to unleash the potential of our proposed model. Extensive experiments reveal that HRMedSeg outperforms state-of-the-arts in diverse high-resolution medical image segmentation tasks. In particular, HRMedSeg uses only 0.59GB GPU memory per batch during fine-tuning, demonstrating low training costs. Besides, when HRMedSeg meets the Segment Anything Model (SAM), our HRMedSegSAM takes 0.61% parameters of SAM-H. The code is available at https://github.com/xq141839/HRMedSeg.

* Under Review

View paper on

Share this with someone who'll enjoy it:

Title:HRMedSeg: Unlocking High-resolution Medical Image segmentation via Memory-efficient Attention Modeling

Paper and Code