Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Masks and Manuscripts: Advancing Medical Pre-training with End-to-End Masking and Narrative Structuring

Jul 23, 2024

Shreyank N Gowda, David A. Clifton

Share this with someone who'll enjoy it:

Abstract:Contemporary medical contrastive learning faces challenges from inconsistent semantics and sample pair morphology, leading to dispersed and converging semantic shifts. The variability in text reports, due to multiple authors, complicates semantic consistency. To tackle these issues, we propose a two-step approach. Initially, text reports are converted into a standardized triplet format, laying the groundwork for our novel concept of ``observations'' and ``verdicts''. This approach refines the {Entity, Position, Exist} triplet into binary questions, guiding towards a clear ``verdict''. We also innovate in visual pre-training with a Meijering-based masking, focusing on features representative of medical images' local context. By integrating this with our text conversion method, our model advances cross-modal representation in a multimodal contrastive learning framework, setting new benchmarks in medical image analysis.

* Accepted in MICCAI-24

View paper on

Share this with someone who'll enjoy it:

Title:Masks and Manuscripts: Advancing Medical Pre-training with End-to-End Masking and Narrative Structuring

Paper and Code