Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Pablo Meseguer

Exploring visual language models as a powerful tool in the diagnosis of Ewing Sarcoma

Jan 14, 2025

Alvaro Pastor-Naranjo, Pablo Meseguer, Rocío del Amor, Jose Antonio Lopez-Guerrero, Samuel Navarro, Katia Scotlandi, Antonio Llombart-Bosch, Isidro Machado, Valery Naranjo

Figure 1 for Exploring visual language models as a powerful tool in the diagnosis of Ewing Sarcoma

Figure 2 for Exploring visual language models as a powerful tool in the diagnosis of Ewing Sarcoma

Figure 3 for Exploring visual language models as a powerful tool in the diagnosis of Ewing Sarcoma

Figure 4 for Exploring visual language models as a powerful tool in the diagnosis of Ewing Sarcoma

Abstract:Ewing's sarcoma (ES), characterized by a high density of small round blue cells without structural organization, presents a significant health concern, particularly among adolescents aged 10 to 19. Artificial intelligence-based systems for automated analysis of histopathological images are promising to contribute to an accurate diagnosis of ES. In this context, this study explores the feature extraction ability of different pre-training strategies for distinguishing ES from other soft tissue or bone sarcomas with similar morphology in digitized tissue microarrays for the first time, as far as we know. Vision-language supervision (VLS) is compared to fully-supervised ImageNet pre-training within a multiple instance learning paradigm. Our findings indicate a substantial improvement in diagnostic accuracy with the adaption of VLS using an in-domain dataset. Notably, these models not only enhance the accuracy of predicted classes but also drastically reduce the number of trainable parameters and computational costs.

* 11 pages, 5 figures, 2 tables. Oral presentation at KES-InMed 2024 held in Madeira, Portugal

Via

Access Paper or Ask Questions

Enhancing Whole Slide Image Classification through Supervised Contrastive Domain Adaptation

Dec 05, 2024

Ilán Carretero, Pablo Meseguer, Rocío del Amor, Valery Naranjo

Figure 1 for Enhancing Whole Slide Image Classification through Supervised Contrastive Domain Adaptation

Figure 2 for Enhancing Whole Slide Image Classification through Supervised Contrastive Domain Adaptation

Figure 3 for Enhancing Whole Slide Image Classification through Supervised Contrastive Domain Adaptation

Figure 4 for Enhancing Whole Slide Image Classification through Supervised Contrastive Domain Adaptation

Abstract:Domain shift in the field of histopathological imaging is a common phenomenon due to the intra- and inter-hospital variability of staining and digitization protocols. The implementation of robust models, capable of creating generalized domains, represents a need to be solved. In this work, a new domain adaptation method to deal with the variability between histopathological images from multiple centers is presented. In particular, our method adds a training constraint to the supervised contrastive learning approach to achieve domain adaptation and improve inter-class separability. Experiments performed on domain adaptation and classification of whole-slide images of six skin cancer subtypes from two centers demonstrate the method's usefulness. The results reflect superior performance compared to not using domain adaptation after feature extraction or staining normalization.

* Accepted in CASEIB 2024

Via

Access Paper or Ask Questions

MI-VisionShot: Few-shot adaptation of vision-language models for slide-level classification of histopathological images

Oct 21, 2024

Pablo Meseguer, Rocío del Amor, Valery Naranjo

Figure 1 for MI-VisionShot: Few-shot adaptation of vision-language models for slide-level classification of histopathological images

Figure 2 for MI-VisionShot: Few-shot adaptation of vision-language models for slide-level classification of histopathological images

Figure 3 for MI-VisionShot: Few-shot adaptation of vision-language models for slide-level classification of histopathological images

Figure 4 for MI-VisionShot: Few-shot adaptation of vision-language models for slide-level classification of histopathological images

Abstract:Vision-language supervision has made remarkable strides in learning visual representations from textual guidance. In digital pathology, vision-language models (VLM), pre-trained on curated datasets of histological image-captions, have been adapted to downstream tasks, such as region of interest classification. Zero-shot transfer for slide-level prediction has been formulated by MI-Zero, but it exhibits high variability depending on the textual prompts. Inspired by prototypical learning, we propose MI-VisionShot, a training-free adaptation method on top of VLMs to predict slide-level labels in few-shot learning scenarios. Our framework takes advantage of the excellent representation learning of VLM to create prototype-based classifiers under a multiple-instance setting by retrieving the most discriminative patches within each slide. Experimentation through different settings shows the ability of MI-VisionShot to surpass zero-shot transfer with lower variability, even in low-shot scenarios. Code coming soon at thttps://github.com/cvblab/MIVisionShot.

* Manuscript accepted for oral presentation at KES-InnovationInMedicine 2024 held on Madeira, Portugal

Via

Access Paper or Ask Questions

Foundation Models for Slide-level Cancer Subtyping in Digital Pathology

Oct 21, 2024

Pablo Meseguer, Rocío del Amor, Adrian Colomer, Valery Naranjo

Figure 1 for Foundation Models for Slide-level Cancer Subtyping in Digital Pathology

Figure 2 for Foundation Models for Slide-level Cancer Subtyping in Digital Pathology

Figure 3 for Foundation Models for Slide-level Cancer Subtyping in Digital Pathology

Figure 4 for Foundation Models for Slide-level Cancer Subtyping in Digital Pathology

Abstract:Since the emergence of the ImageNet dataset, the pretraining and fine-tuning approach has become widely adopted in computer vision due to the ability of ImageNet-pretrained models to learn a wide variety of visual features. However, a significant challenge arises when adapting these models to domain-specific fields, such as digital pathology, due to substantial gaps between domains. To address this limitation, foundation models (FM) have been trained on large-scale in-domain datasets to learn the intricate features of histopathology images. In cancer diagnosis, whole-slide image (WSI) prediction is essential for patient prognosis, and multiple instance learning (MIL) has been implemented to handle the giga-pixel size of WSI. As MIL frameworks rely on patch-level feature aggregation, this work aims to compare the performance of various feature extractors developed under different pretraining strategies for cancer subtyping on WSI under a MIL framework. Results demonstrate the ability of foundation models to surpass ImageNet-pretrained models for the prediction of six skin cancer subtypes

* Manuscript accepted for oral presentation at Decision Science Allieance -INternational Summer Conference (DSA-ISC) 2024 held on Valencia, Spain

Via

Access Paper or Ask Questions