Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Hassan Rivaz

Lightweight Physics-Informed Zero-Shot Ultrasound Plane Wave Denoising

Jun 26, 2025

Hojat Asgariandehkordi, Mostafa Sharifzadeh, Hassan Rivaz

Abstract:Ultrasound Coherent Plane Wave Compounding (CPWC) enhances image contrast by combining echoes from multiple steered transmissions. While increasing the number of angles generally improves image quality, it drastically reduces the frame rate and can introduce blurring artifacts in fast-moving targets. Moreover, compounded images remain susceptible to noise, particularly when acquired with a limited number of transmissions. We propose a zero-shot denoising framework tailored for low-angle CPWC acquisitions, which enhances contrast without relying on a separate training dataset. The method divides the available transmission angles into two disjoint subsets, each used to form compound images that include higher noise levels. The new compounded images are then used to train a deep model via a self-supervised residual learning scheme, enabling it to suppress incoherent noise while preserving anatomical structures. Because angle-dependent artifacts vary between the subsets while the underlying tissue response is similar, this physics-informed pairing allows the network to learn to disentangle the inconsistent artifacts from the consistent tissue signal. Unlike supervised methods, our model requires no domain-specific fine-tuning or paired data, making it adaptable across anatomical regions and acquisition setups. The entire pipeline supports efficient training with low computational cost due to the use of a lightweight architecture, which comprises only two convolutional layers. Evaluations on simulation, phantom, and in vivo data demonstrate superior contrast enhancement and structure preservation compared to both classical and deep learning-based denoising methods.

Via

Access Paper or Ask Questions

Medical Image Classification with KAN-Integrated Transformers and Dilated Neighborhood Attention

Feb 19, 2025

Omid Nejati Manzari, Hojat Asgariandehkordi, Taha Koleilat, Yiming Xiao, Hassan Rivaz

Abstract:Convolutional networks, transformers, hybrid models, and Mamba-based architectures have demonstrated strong performance across various medical image classification tasks. However, these methods were primarily designed to classify clean images using labeled data. In contrast, real-world clinical data often involve image corruptions that are unique to multi-center studies and stem from variations in imaging equipment across manufacturers. In this paper, we introduce the Medical Vision Transformer (MedViTV2), a novel architecture incorporating Kolmogorov-Arnold Network (KAN) layers into the transformer architecture for the first time, aiming for generalized medical image classification. We have developed an efficient KAN block to reduce computational load while enhancing the accuracy of the original MedViT. Additionally, to counteract the fragility of our MedViT when scaled up, we propose an enhanced Dilated Neighborhood Attention (DiNA), an adaptation of the efficient fused dot-product attention kernel capable of capturing global context and expanding receptive fields to scale the model effectively and addressing feature collapse issues. Moreover, a hierarchical hybrid strategy is introduced to stack our Local Feature Perception and Global Feature Perception blocks in an efficient manner, which balances local and global feature perceptions to boost performance. Extensive experiments on 17 medical image classification datasets and 12 corrupted medical image datasets demonstrate that MedViTV2 achieved state-of-the-art results in 27 out of 29 experiments with reduced computational complexity. MedViTV2 is 44\% more computationally efficient than the previous version and significantly enhances accuracy, achieving improvements of 4.6\% on MedMNIST, 5.8\% on NonMNIST, and 13.4\% on the MedMNIST-C benchmark.

Via

Access Paper or Ask Questions

Ultrasound Image Generation using Latent Diffusion Models

Feb 12, 2025

Benoit Freiche, Anthony El-Khoury, Ali Nasiri-Sarvi, Mahdi S. Hosseini, Damien Garcia, Adrian Basarab, Mathieu Boily, Hassan Rivaz

Abstract:Diffusion models for image generation have been a subject of increasing interest due to their ability to generate diverse, high-quality images. Image generation has immense potential in medical imaging because open-source medical images are difficult to obtain compared to natural images, especially for rare conditions. The generated images can be used later to train classification and segmentation models. In this paper, we propose simulating realistic ultrasound (US) images by successive fine-tuning of large diffusion models on different publicly available databases. To do so, we fine-tuned Stable Diffusion, a state-of-the-art latent diffusion model, on BUSI (Breast US Images) an ultrasound breast image dataset. We successfully generated high-quality US images of the breast using simple prompts that specify the organ and pathology, which appeared realistic to three experienced US scientists and a US radiologist. Additionally, we provided user control by conditioning the model with segmentations through ControlNet. We will release the source code at http://code.sonography.ai/ to allow fast US image generation to the scientific community.

* 6 pages conference paper for SPIE medical imaging

Via

Access Paper or Ask Questions

Constrained and Regularized Quantitative Ultrasound Parameter Estimation using ADMM

Jan 07, 2025

Ali K. Z. Tehrani, Hassan Rivaz, Ivan M. Rosado-Mendez

Figure 1 for Constrained and Regularized Quantitative Ultrasound Parameter Estimation using ADMM

Figure 2 for Constrained and Regularized Quantitative Ultrasound Parameter Estimation using ADMM

Figure 3 for Constrained and Regularized Quantitative Ultrasound Parameter Estimation using ADMM

Figure 4 for Constrained and Regularized Quantitative Ultrasound Parameter Estimation using ADMM

Abstract:Regularized estimation of quantitative ultrasound (QUS) parameters, such as attenuation and backscatter coefficients, has gained research interest. Recently, the alternating direction method of multipliers (ADMM) has been applied successfully to estimate these parameters, by utilizing L2 and L1 norms for attenuation and backscatter coefficient regularization, respectively. While this method improves upon previous approaches, it does not fully leverage the prior knowledge of minimum physically feasible parameter values, sometimes yielding values outside the realistic range. This work addresses this limitation by incorporating minimum QUS parameter values as constraints to enhance ADMM estimation. The proposed method is validated using experimental phantom data.

* accepted in ISBI 2025

Via

Access Paper or Ask Questions

CAMLD: Contrast-Agnostic Medical Landmark Detection with Consistency-Based Regularization

Nov 26, 2024

Soorena Salari, Arash Harirpoush, Hassan Rivaz, Yiming Xiao

Abstract:Anatomical landmark detection in medical images is essential for various clinical and research applications, including disease diagnosis and surgical planning. However, manual landmark annotation is time-consuming and requires significant expertise. Existing deep learning (DL) methods often require large amounts of well-annotated data, which are costly to acquire. In this paper, we introduce CAMLD, a novel self-supervised DL framework for anatomical landmark detection in unlabeled scans with varying contrasts by using only a single reference example. To achieve this, we employed an inter-subject landmark consistency loss with an image registration loss while introducing a 3D convolution-based contrast augmentation strategy to promote model generalization to new contrasts. Additionally, we utilize an adaptive mixed loss function to schedule the contributions of different sub-tasks for optimal outcomes. We demonstrate the proposed method with the intricate task of MRI-based 3D brain landmark detection. With comprehensive experiments on four diverse clinical and public datasets, including both T1w and T2w MRI scans at different MRI field strengths, we demonstrate that CAMLD outperforms the state-of-the-art methods in terms of mean radial errors (MREs) and success detection rates (SDRs). Our framework provides a robust and accurate solution for anatomical landmark detection, reducing the need for extensively annotated datasets and generalizing well across different imaging contrasts. Our code will be publicly available at: https://github.com/HealthX-Lab/CAMLD.

* 14 pages, 6 figures, 3 tables

Via

Access Paper or Ask Questions

Reliability of deep learning models for anatomical landmark detection: The role of inter-rater variability

Nov 26, 2024

Soorena Salari, Hassan Rivaz, Yiming Xiao

Figure 1 for Reliability of deep learning models for anatomical landmark detection: The role of inter-rater variability

Figure 2 for Reliability of deep learning models for anatomical landmark detection: The role of inter-rater variability

Figure 3 for Reliability of deep learning models for anatomical landmark detection: The role of inter-rater variability

Figure 4 for Reliability of deep learning models for anatomical landmark detection: The role of inter-rater variability

Abstract:Automated detection of anatomical landmarks plays a crucial role in many diagnostic and surgical applications. Progresses in deep learning (DL) methods have resulted in significant performance enhancement in tasks related to anatomical landmark detection. While current research focuses on accurately localizing these landmarks in medical scans, the importance of inter-rater annotation variability in building DL models is often overlooked. Understanding how inter-rater variability impacts the performance and reliability of the resulting DL algorithms, which are crucial for clinical deployment, can inform the improvement of training data construction and boost DL models' outcomes. In this paper, we conducted a thorough study of different annotation-fusion strategies to preserve inter-rater variability in DL models for anatomical landmark detection, aiming to boost the performance and reliability of the resulting algorithms. Additionally, we explored the characteristics and reliability of four metrics, including a novel Weighted Coordinate Variance metric to quantify landmark detection uncertainty/inter-rater variability. Our research highlights the crucial connection between inter-rater variability, DL-models performances, and uncertainty, revealing how different approaches for multi-rater landmark annotation fusion can influence these factors.

* Accepted to SPIE Medical Imaging 2025

Via

Access Paper or Ask Questions

Comparative Analysis of Diffusion Generative Models in Computational Pathology

Nov 24, 2024

Denisha Thakkar, Vincent Quoc-Huy Trinh, Sonal Varma, Samira Ebrahimi Kahou, Hassan Rivaz, Mahdi S. Hosseini

Figure 1 for Comparative Analysis of Diffusion Generative Models in Computational Pathology

Figure 2 for Comparative Analysis of Diffusion Generative Models in Computational Pathology

Figure 3 for Comparative Analysis of Diffusion Generative Models in Computational Pathology

Figure 4 for Comparative Analysis of Diffusion Generative Models in Computational Pathology

Abstract:Diffusion Generative Models (DGM) have rapidly surfaced as emerging topics in the field of computer vision, garnering significant interest across a wide array of deep learning applications. Despite their high computational demand, these models are extensively utilized for their superior sample quality and robust mode coverage. While research in diffusion generative models is advancing, exploration within the domain of computational pathology and its large-scale datasets has been comparatively gradual. Bridging the gap between the high-quality generation capabilities of Diffusion Generative Models and the intricate nature of pathology data, this paper presents an in-depth comparative analysis of diffusion methods applied to a pathology dataset. Our analysis extends to datasets with varying Fields of View (FOV), revealing that DGMs are highly effective in producing high-quality synthetic data. An ablative study is also conducted, followed by a detailed discussion on the impact of various methods on the synthesized histopathology images. One striking observation from our experiments is how the adjustment of image size during data generation can simulate varying fields of view. These findings underscore the potential of DGMs to enhance the quality and diversity of synthetic pathology data, especially when used with real data, ultimately increasing accuracy of deep learning models in histopathology. Code is available from https://github.com/AtlasAnalyticsLab/Diffusion4Path

* Submitted paper under review

Via

Access Paper or Ask Questions

BiomedCoOp: Learning to Prompt for Biomedical Vision-Language Models

Nov 21, 2024

Taha Koleilat, Hojat Asgariandehkordi, Hassan Rivaz, Yiming Xiao

Figure 1 for BiomedCoOp: Learning to Prompt for Biomedical Vision-Language Models

Figure 2 for BiomedCoOp: Learning to Prompt for Biomedical Vision-Language Models

Figure 3 for BiomedCoOp: Learning to Prompt for Biomedical Vision-Language Models

Figure 4 for BiomedCoOp: Learning to Prompt for Biomedical Vision-Language Models

Abstract:Recent advancements in vision-language models (VLMs), such as CLIP, have demonstrated substantial success in self-supervised representation learning for vision tasks. However, effectively adapting VLMs to downstream applications remains challenging, as their accuracy often depends on time-intensive and expertise-demanding prompt engineering, while full model fine-tuning is costly. This is particularly true for biomedical images, which, unlike natural images, typically suffer from limited annotated datasets, unintuitive image contrasts, and nuanced visual features. Recent prompt learning techniques, such as Context Optimization (CoOp) intend to tackle these issues, but still fall short in generalizability. Meanwhile, explorations in prompt learning for biomedical image analysis are still highly limited. In this work, we propose BiomedCoOp, a novel prompt learning framework that enables efficient adaptation of BiomedCLIP for accurate and highly generalizable few-shot biomedical image classification. Our approach achieves effective prompt context learning by leveraging semantic consistency with average prompt ensembles from Large Language Models (LLMs) and knowledge distillation with a statistics-based prompt selection strategy. We conducted comprehensive validation of our proposed framework on 11 medical datasets across 9 modalities and 10 organs against existing state-of-the-art methods, demonstrating significant improvements in both accuracy and generalizability. The code will be publicly available at https://github.com/HealthX-Lab/BiomedCoOp.

* 18 pages, 5 figures, 10 tables

Via

Access Paper or Ask Questions

Ensemble Learning for Microbubble Localization in Super-Resolution Ultrasound

Nov 11, 2024

Sepideh K. Gharamaleki, Brandon Helfield, Hassan Rivaz

Figure 1 for Ensemble Learning for Microbubble Localization in Super-Resolution Ultrasound

Figure 2 for Ensemble Learning for Microbubble Localization in Super-Resolution Ultrasound

Figure 3 for Ensemble Learning for Microbubble Localization in Super-Resolution Ultrasound

Figure 4 for Ensemble Learning for Microbubble Localization in Super-Resolution Ultrasound

Abstract:Super-resolution ultrasound (SR-US) is a powerful imaging technique for capturing microvasculature and blood flow at high spatial resolution. However, accurate microbubble (MB) localization remains a key challenge, as errors in localization can propagate through subsequent stages of the super-resolution process, affecting overall performance. In this paper, we explore the potential of ensemble learning techniques to enhance MB localization by increasing detection sensitivity and reducing false positives. Our study evaluates the effectiveness of ensemble methods on both in vivo and simulated outputs of a Deformable DEtection TRansformer (Deformable DETR) network. As a result of our study, we are able to demonstrate the advantages of these ensemble approaches by showing improved precision and recall in MB detection and offering insights into their application in SR-US.

Via

Access Paper or Ask Questions

Evaluating Detection Thresholds: The Impact of False Positives and Negatives on Super-Resolution Ultrasound Localization Microscopy

Nov 11, 2024

Sepideh K. Gharamaleki, Brandon Helfield, Hassan Rivaz

Figure 1 for Evaluating Detection Thresholds: The Impact of False Positives and Negatives on Super-Resolution Ultrasound Localization Microscopy

Figure 2 for Evaluating Detection Thresholds: The Impact of False Positives and Negatives on Super-Resolution Ultrasound Localization Microscopy

Figure 3 for Evaluating Detection Thresholds: The Impact of False Positives and Negatives on Super-Resolution Ultrasound Localization Microscopy

Abstract:Super-resolution ultrasound imaging with ultrasound localization microscopy (ULM) offers a high-resolution view of microvascular structures. Yet, ULM image quality heavily relies on precise microbubble (MB) detection. Despite the crucial role of localization algorithms, there has been limited focus on the practical pitfalls in MB detection tasks such as setting the detection threshold. This study examines how False Positives (FPs) and False Negatives (FNs) affect ULM image quality by systematically adding controlled detection errors to simulated data. Results indicate that while both FP and FN rates impact Peak Signal-to-Noise Ratio (PSNR) similarly, increasing FP rates from 0\% to 20\% decreases Structural Similarity Index (SSIM) by 7\%, whereas same FN rates cause a greater drop of around 45\%. Moreover, dense MB regions are more resilient to detection errors, while sparse regions show high sensitivity, showcasing the need for robust MB detection frameworks to enhance super-resolution imaging.

Via

Access Paper or Ask Questions