Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Thomas Walter

Efficient Fine-Tuning of DINOv3 Pretrained on Natural Images for Atypical Mitotic Figure Classification in MIDOG 2025

Aug 28, 2025

Guillaume Balezo, Raphaël Bourgade, Thomas Walter

Abstract:Atypical mitotic figures (AMFs) are markers of abnormal cell division associated with poor prognosis, yet their detection remains difficult due to low prevalence, subtle morphology, and inter-observer variability. The MIDOG 2025 challenge introduces a benchmark for AMF classification across multiple domains. In this work, we evaluate the recently published DINOv3-H+ vision transformer, pretrained on natural images, which we fine-tuned using low-rank adaptation (LoRA, 650k trainable parameters) and extensive augmentation. Despite the domain gap, DINOv3 transfers effectively to histopathology, achieving a balanced accuracy of 0.8871 on the preliminary test set. These results highlight the robustness of DINOv3 pretraining and show that, when combined with parameter-efficient fine-tuning, it provides a strong baseline for atypical mitosis classification in MIDOG 2025.

* 3 pages. Challenge report for MIDOG 2025 (Task 2: Atypical Mitotic Figure Classification)

Via

Access Paper or Ask Questions

MIPHEI-ViT: Multiplex Immunofluorescence Prediction from H&E Images using ViT Foundation Models

May 15, 2025

Guillaume Balezo, Roger Trullo, Albert Pla Planas, Etienne Decenciere, Thomas Walter

Abstract:Histopathological analysis is a cornerstone of cancer diagnosis, with Hematoxylin and Eosin (H&E) staining routinely acquired for every patient to visualize cell morphology and tissue architecture. On the other hand, multiplex immunofluorescence (mIF) enables more precise cell type identification via proteomic markers, but has yet to achieve widespread clinical adoption due to cost and logistical constraints. To bridge this gap, we introduce MIPHEI (Multiplex Immunofluorescence Prediction from H&E), a U-Net-inspired architecture that integrates state-of-the-art ViT foundation models as encoders to predict mIF signals from H&E images. MIPHEI targets a comprehensive panel of markers spanning nuclear content, immune lineages (T cells, B cells, myeloid), epithelium, stroma, vasculature, and proliferation. We train our model using the publicly available ORION dataset of restained H&E and mIF images from colorectal cancer tissue, and validate it on two independent datasets. MIPHEI achieves accurate cell-type classification from H&E alone, with F1 scores of 0.88 for Pan-CK, 0.57 for CD3e, 0.56 for SMA, 0.36 for CD68, and 0.30 for CD20, substantially outperforming both a state-of-the-art baseline and a random classifier for most markers. Our results indicate that our model effectively captures the complex relationships between nuclear morphologies in their tissue context, as visible in H&E images and molecular markers defining specific cell types. MIPHEI offers a promising step toward enabling cell-type-aware analysis of large-scale H&E datasets, in view of uncovering relationships between spatial cellular organization and patient outcomes.

Via

Access Paper or Ask Questions

UWB Anchor Based Localization of a Planetary Rover

Apr 10, 2025

Andreas Nüchter, Lennart Werner, Martin Hesse, Dorit Borrmann, Thomas Walter, Sergio Montenegro, Gernot Grömer

Figure 1 for UWB Anchor Based Localization of a Planetary Rover

Figure 2 for UWB Anchor Based Localization of a Planetary Rover

Figure 3 for UWB Anchor Based Localization of a Planetary Rover

Figure 4 for UWB Anchor Based Localization of a Planetary Rover

Abstract:Localization of an autonomous mobile robot during planetary exploration is challenging due to the unknown terrain, the difficult lighting conditions and the lack of any global reference such as satellite navigation systems. We present a novel approach for robot localization based on ultra-wideband (UWB) technology. The robot sets up its own reference coordinate system by distributing UWB anchor nodes in the environment via a rocket-propelled launcher system. This allows the creation of a localization space in which UWB measurements are employed to supplement traditional SLAM-based techniques. The system was developed for our involvement in the ESA-ESRIC challenge 2021 and the AMADEE-24, an analog Mars simulation in Armenia by the Austrian Space Forum (\"OWF).

* International Symposium on Artificial Intelligence, Robotics and Automation in Space (i-SAIRAS '24)

Via

Access Paper or Ask Questions

Interpretable Embeddings for Segmentation-Free Single-Cell Analysis in Multiplex Imaging

Nov 02, 2024

Simon Gutwein, Daria Lazic, Thomas Walter, Sabine Taschner-Mandl, Roxane Licandro

Figure 1 for Interpretable Embeddings for Segmentation-Free Single-Cell Analysis in Multiplex Imaging

Figure 2 for Interpretable Embeddings for Segmentation-Free Single-Cell Analysis in Multiplex Imaging

Figure 3 for Interpretable Embeddings for Segmentation-Free Single-Cell Analysis in Multiplex Imaging

Figure 4 for Interpretable Embeddings for Segmentation-Free Single-Cell Analysis in Multiplex Imaging

Abstract:Multiplex Imaging (MI) enables the simultaneous visualization of multiple biological markers in separate imaging channels at subcellular resolution, providing valuable insights into cell-type heterogeneity and spatial organization. However, current computational pipelines rely on cell segmentation algorithms, which require laborious fine-tuning and can introduce downstream errors due to inaccurate single-cell representations. We propose a segmentation-free deep learning approach that leverages grouped convolutions to learn interpretable embedded features from each imaging channel, enabling robust cell-type identification without manual feature selection. Validated on an Imaging Mass Cytometry dataset of 1.8 million cells from neuroblastoma patients, our method enables the accurate identification of known cell types, showcasing its scalability and suitability for high-dimensional MI data.

* 5 Pages, 5 Figures, Submitted to ISBI 2025

Via

Access Paper or Ask Questions

Simple and Efficient Confidence Score for Grading Whole Slide Images

Mar 08, 2023

Mélanie Lubrano, Yaëlle Bellahsen-Harrar, Rutger Fick, Cécile Badoual, Thomas Walter

Figure 1 for Simple and Efficient Confidence Score for Grading Whole Slide Images

Figure 2 for Simple and Efficient Confidence Score for Grading Whole Slide Images

Figure 3 for Simple and Efficient Confidence Score for Grading Whole Slide Images

Figure 4 for Simple and Efficient Confidence Score for Grading Whole Slide Images

Abstract:Grading precancerous lesions on whole slide images is a challenging task: the continuous space of morphological phenotypes makes clear-cut decisions between different grades often difficult, leading to low inter- and intra-rater agreements. More and more Artificial Intelligence (AI) algorithms are developed to help pathologists perform and standardize their diagnosis. However, those models can render their prediction without consideration of the ambiguity of the classes and can fail without notice which prevent their wider acceptance in a clinical context. In this paper, we propose a new score to measure the confidence of AI models in grading tasks. Our confidence score is specifically adapted to ordinal output variables, is versatile and does not require extra training or additional inferences nor particular architecture changes. Comparison to other popular techniques such as Monte Carlo Dropout and deep ensembles shows that our method provides state-of-the art results, while being simpler, more versatile and less computationally intensive. The score is also easily interpretable and consistent with real life hesitations of pathologists. We show that the score is capable of accurately identifying mispredicted slides and that accuracy for high confidence decisions is significantly higher than for low-confidence decisions (gap in AUC of 17.1% on the test set). We believe that the proposed confidence score could be leveraged by pathologists directly in their workflow and assist them on difficult tasks such as grading precancerous lesions.

Via

Access Paper or Ask Questions

PointFISH -- learning point cloud representations for RNA localization patterns

Feb 21, 2023

Arthur Imbert, Florian Mueller, Thomas Walter

Abstract:Subcellular RNA localization is a critical mechanism for the spatial control of gene expression. Its mechanism and precise functional role is not yet very well understood. Single Molecule Fluorescence in Situ Hybridization (smFISH) images allow for the detection of individual RNA molecules with subcellular accuracy. In return, smFISH requires robust methods to quantify and classify RNA spatial distribution. Here, we present PointFISH, a novel computational approach for the recognition of RNA localization patterns. PointFISH is an attention-based network for computing continuous vector representations of RNA point clouds. Trained on simulations only, it can directly process extracted coordinates from experimental smFISH images. The resulting embedding allows scalable and flexible spatial transcriptomics analysis and matches performance of hand-crafted pipelines.

Via

Access Paper or Ask Questions

Learning with minimal effort: leveraging in silico labeling for cell and nucleus segmentation

Jan 10, 2023

Thomas Bonte, Maxence Philbert, Emeline Coleno, Edouard Bertrand, Arthur Imbert, Thomas Walter

Figure 1 for Learning with minimal effort: leveraging in silico labeling for cell and nucleus segmentation

Figure 2 for Learning with minimal effort: leveraging in silico labeling for cell and nucleus segmentation

Figure 3 for Learning with minimal effort: leveraging in silico labeling for cell and nucleus segmentation

Figure 4 for Learning with minimal effort: leveraging in silico labeling for cell and nucleus segmentation

Abstract:Deep learning provides us with powerful methods to perform nucleus or cell segmentation with unprecedented quality. However, these methods usually require large training sets of manually annotated images, which are tedious and expensive to generate. In this paper we propose to use In Silico Labeling (ISL) as a pretraining scheme for segmentation tasks. The strategy is to acquire label-free microscopy images (such as bright-field or phase contrast) along fluorescently labeled images (such as DAPI or CellMask). We then train a model to predict the fluorescently labeled images from the label-free microscopy images. By comparing segmentation performance across several training set sizes, we show that such a scheme can dramatically reduce the number of required annotations.

Via

Access Paper or Ask Questions

Giga-SSL: Self-Supervised Learning for Gigapixel Images

Dec 06, 2022

Tristan Lazard, Marvin Lerousseau, Etienne Decencière, Thomas Walter

Figure 1 for Giga-SSL: Self-Supervised Learning for Gigapixel Images

Figure 2 for Giga-SSL: Self-Supervised Learning for Gigapixel Images

Figure 3 for Giga-SSL: Self-Supervised Learning for Gigapixel Images

Figure 4 for Giga-SSL: Self-Supervised Learning for Gigapixel Images

Abstract:Whole slide images (WSI) are microscopy images of stained tissue slides routinely prepared for diagnosis and treatment selection in medical practice. WSI are very large (gigapixel size) and complex (made of up to millions of cells). The current state-of-the-art (SoTA) approach to classify WSI subdivides them into tiles, encodes them by pre-trained networks and applies Multiple Instance Learning (MIL) to train for specific downstream tasks. However, annotated datasets are often small, typically a few hundred to a few thousand WSI, which may cause overfitting and underperforming models. Conversely, the number of unannotated WSI is ever increasing, with datasets of tens of thousands (soon to be millions) of images available. While it has been previously proposed to use these unannotated data to identify suitable tile representations by self-supervised learning (SSL), downstream classification tasks still require full supervision because parts of the MIL architecture is not trained during tile level SSL pre-training. Here, we propose a strategy of slide level SSL to leverage the large number of WSI without annotations to infer powerful slide representations. Applying our method to The Cancer-Genome Atlas, one of the most widely used data resources in cancer research (16 TB image data), we are able to downsize the dataset to 23 MB without any loss in predictive power: we show that a linear classifier trained on top of these embeddings maintains or improves previous SoTA performances on various benchmark WSI classification tasks. Finally, we observe that training a classifier on these representations with tiny datasets (e.g. 50 slides) improved performances over SoTA by an average of +6.3 AUC points over all downstream tasks.

Via

Access Paper or Ask Questions

Assessment of algorithms for mitosis detection in breast cancer histopathology images

Nov 21, 2014

Mitko Veta, Paul J. van Diest, Stefan M. Willems, Haibo Wang, Anant Madabhushi, Angel Cruz-Roa, Fabio Gonzalez, Anders B. L. Larsen, Jacob S. Vestergaard, Anders B. Dahl(+19 more)

Figure 1 for Assessment of algorithms for mitosis detection in breast cancer histopathology images

Figure 2 for Assessment of algorithms for mitosis detection in breast cancer histopathology images

Figure 3 for Assessment of algorithms for mitosis detection in breast cancer histopathology images

Figure 4 for Assessment of algorithms for mitosis detection in breast cancer histopathology images

Abstract:The proliferative activity of breast tumors, which is routinely estimated by counting of mitotic figures in hematoxylin and eosin stained histology sections, is considered to be one of the most important prognostic markers. However, mitosis counting is laborious, subjective and may suffer from low inter-observer agreement. With the wider acceptance of whole slide images in pathology labs, automatic image analysis has been proposed as a potential solution for these issues. In this paper, the results from the Assessment of Mitosis Detection Algorithms 2013 (AMIDA13) challenge are described. The challenge was based on a data set consisting of 12 training and 11 testing subjects, with more than one thousand annotated mitotic figures by multiple observers. Short descriptions and results from the evaluation of eleven methods are presented. The top performing method has an error rate that is comparable to the inter-observer agreement among pathologists.

* 23 pages, 5 figures, accepted for publication in the journal Medical Image Analysis

Via

Access Paper or Ask Questions