Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:DiN: Diffusion Model for Robust Medical VQA with Semantic Noisy Labels

Mar 24, 2025

Erjian Guo, Zhen Zhao, Zicheng Wang, Tong Chen, Yunyi Liu, Luping Zhou

Figure 1 for DiN: Diffusion Model for Robust Medical VQA with Semantic Noisy Labels

Figure 2 for DiN: Diffusion Model for Robust Medical VQA with Semantic Noisy Labels

Figure 3 for DiN: Diffusion Model for Robust Medical VQA with Semantic Noisy Labels

Figure 4 for DiN: Diffusion Model for Robust Medical VQA with Semantic Noisy Labels

Share this with someone who'll enjoy it:

Abstract:Medical Visual Question Answering (Med-VQA) systems benefit the interpretation of medical images containing critical clinical information. However, the challenge of noisy labels and limited high-quality datasets remains underexplored. To address this, we establish the first benchmark for noisy labels in Med-VQA by simulating human mislabeling with semantically designed noise types. More importantly, we introduce the DiN framework, which leverages a diffusion model to handle noisy labels in Med-VQA. Unlike the dominant classification-based VQA approaches that directly predict answers, our Answer Diffuser (AD) module employs a coarse-to-fine process, refining answer candidates with a diffusion model for improved accuracy. The Answer Condition Generator (ACG) further enhances this process by generating task-specific conditional information via integrating answer embeddings with fused image-question features. To address label noise, our Noisy Label Refinement(NLR) module introduces a robust loss function and dynamic answer adjustment to further boost the performance of the AD module.

View paper on

Share this with someone who'll enjoy it:

Title:DiN: Diffusion Model for Robust Medical VQA with Semantic Noisy Labels

Paper and Code