Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Gongbo Liang

Benchmarking Robustness of Contrastive Learning Models for Medical Image-Report Retrieval

Jan 15, 2025

Demetrio Deanda, Yuktha Priya Masupalli, Jeong Yang, Young Lee, Zechun Cao, Gongbo Liang

Figure 1 for Benchmarking Robustness of Contrastive Learning Models for Medical Image-Report Retrieval

Figure 2 for Benchmarking Robustness of Contrastive Learning Models for Medical Image-Report Retrieval

Figure 3 for Benchmarking Robustness of Contrastive Learning Models for Medical Image-Report Retrieval

Figure 4 for Benchmarking Robustness of Contrastive Learning Models for Medical Image-Report Retrieval

Abstract:Medical images and reports offer invaluable insights into patient health. The heterogeneity and complexity of these data hinder effective analysis. To bridge this gap, we investigate contrastive learning models for cross-domain retrieval, which associates medical images with their corresponding clinical reports. This study benchmarks the robustness of four state-of-the-art contrastive learning models: CLIP, CXR-RePaiR, MedCLIP, and CXR-CLIP. We introduce an occlusion retrieval task to evaluate model performance under varying levels of image corruption. Our findings reveal that all evaluated models are highly sensitive to out-of-distribution data, as evidenced by the proportional decrease in performance with increasing occlusion levels. While MedCLIP exhibits slightly more robustness, its overall performance remains significantly behind CXR-CLIP and CXR-RePaiR. CLIP, trained on a general-purpose dataset, struggles with medical image-report retrieval, highlighting the importance of domain-specific training data. The evaluation of this work suggests that more effort needs to be spent on improving the robustness of these models. By addressing these limitations, we can develop more reliable cross-domain retrieval models for medical applications.

* This work is accepted to AAAI 2025 Workshop -- the 9th International Workshop on Health Intelligence

Via

Access Paper or Ask Questions

Mutation-Based Adversarial Attacks on Neural Text Detectors

Feb 11, 2023

Gongbo Liang, Jesus Guerrero, Izzat Alsmadi

Figure 1 for Mutation-Based Adversarial Attacks on Neural Text Detectors

Figure 2 for Mutation-Based Adversarial Attacks on Neural Text Detectors

Figure 3 for Mutation-Based Adversarial Attacks on Neural Text Detectors

Abstract:Neural text detectors aim to decide the characteristics that distinguish neural (machine-generated) from human texts. To challenge such detectors, adversarial attacks can alter the statistical characteristics of the generated text, making the detection task more and more difficult. Inspired by the advances of mutation analysis in software development and testing, in this paper, we propose character- and word-based mutation operators for generating adversarial samples to attack state-of-the-art natural text detectors. This falls under white-box adversarial attacks. In such attacks, attackers have access to the original text and create mutation instances based on this original text. The ultimate goal is to confuse machine learning models and classifiers and decrease their prediction accuracy.

Via

Access Paper or Ask Questions

A Mutation-based Text Generation for Adversarial Machine Learning Applications

Dec 21, 2022

Jesus Guerrero, Gongbo Liang, Izzat Alsmadi

Abstract:Many natural language related applications involve text generation, created by humans or machines. While in many of those applications machines support humans, yet in few others, (e.g. adversarial machine learning, social bots and trolls) machines try to impersonate humans. In this scope, we proposed and evaluated several mutation-based text generation approaches. Unlike machine-based generated text, mutation-based generated text needs human text samples as inputs. We showed examples of mutation operators but this work can be extended in many aspects such as proposing new text-based mutation operators based on the nature of the application.

Via

Access Paper or Ask Questions

Benchmark Assessment for DeepSpeed Optimization Library

Feb 12, 2022

Gongbo Liang, Izzat Alsmadi

Figure 1 for Benchmark Assessment for DeepSpeed Optimization Library

Figure 2 for Benchmark Assessment for DeepSpeed Optimization Library

Abstract:Deep Learning (DL) models are widely used in machine learning due to their performance and ability to deal with large datasets while producing high accuracy and performance metrics. The size of such datasets and the complexity of DL models cause such models to be complex, consuming large amount of resources and time to train. Many recent libraries and applications are introduced to deal with DL complexity and efficiency issues. In this paper, we evaluated one example, Microsoft DeepSpeed library through classification tasks. DeepSpeed public sources reported classification performance metrics on the LeNet architecture. We extended this through evaluating the library on several modern neural network architectures, including convolutional neural networks (CNNs) and Vision Transformer (ViT). Results indicated that DeepSpeed, while can make improvements in some of those cases, it has no or negative impact on others.

Via

Access Paper or Ask Questions

Dynamic Feature Alignment for Semi-supervised Domain Adaptation

Oct 18, 2021

Yu Zhang, Gongbo Liang, Nathan Jacobs

Figure 1 for Dynamic Feature Alignment for Semi-supervised Domain Adaptation

Figure 2 for Dynamic Feature Alignment for Semi-supervised Domain Adaptation

Figure 3 for Dynamic Feature Alignment for Semi-supervised Domain Adaptation

Figure 4 for Dynamic Feature Alignment for Semi-supervised Domain Adaptation

Abstract:Most research on domain adaptation has focused on the purely unsupervised setting, where no labeled examples in the target domain are available. However, in many real-world scenarios, a small amount of labeled target data is available and can be used to improve adaptation. We address this semi-supervised setting and propose to use dynamic feature alignment to address both inter- and intra-domain discrepancy. Unlike previous approaches, which attempt to align source and target features within a mini-batch, we propose to align the target features to a set of dynamically updated class prototypes, which we use both for minimizing divergence and pseudo-labeling. By updating based on class prototypes, we avoid problems that arise in previous approaches due to class imbalances. Our approach, which doesn't require extensive tuning or adversarial training, significantly improves the state of the art for semi-supervised domain adaptation. We provide a quantitative evaluation on two standard datasets, DomainNet and Office-Home, and performance analysis.

* BMVC 2021

Via

Access Paper or Ask Questions

Optical Wavelength Guided Self-Supervised Feature Learning For Galaxy Cluster Richness Estimate

Dec 04, 2020

Gongbo Liang, Yuanyuan Su, Sheng-Chieh Lin, Yu Zhang, Yuanyuan Zhang, Nathan Jacobs

Figure 1 for Optical Wavelength Guided Self-Supervised Feature Learning For Galaxy Cluster Richness Estimate

Figure 2 for Optical Wavelength Guided Self-Supervised Feature Learning For Galaxy Cluster Richness Estimate

Abstract:Most galaxies in the nearby Universe are gravitationally bound to a cluster or group of galaxies. Their optical contents, such as optical richness, are crucial for understanding the co-evolution of galaxies and large-scale structures in modern astronomy and cosmology. The determination of optical richness can be challenging. We propose a self-supervised approach for estimating optical richness from multi-band optical images. The method uses the data properties of the multi-band optical images for pre-training, which enables learning feature representations from a large but unlabeled dataset. We apply the proposed method to the Sloan Digital Sky Survey. The result shows our estimate of optical richness lowers the mean absolute error and intrinsic scatter by 11.84% and 20.78%, respectively, while reducing the need for labeled training data by up to 60%. We believe the proposed method will benefit astronomy and cosmology, where a large number of unlabeled multi-band images are available, but acquiring image labels is costly.

* Accepted to NeurIPS 2020 Workshop on Machine Learning and the Physical Sciences

Via

Access Paper or Ask Questions

Dynamic Image for 3D MRI Image Alzheimer's Disease Classification

Nov 30, 2020

Xin Xing, Gongbo Liang, Hunter Blanton, Muhammad Usman Rafique, Chris Wang, Ai-Ling Lin, Nathan Jacobs

Figure 1 for Dynamic Image for 3D MRI Image Alzheimer's Disease Classification

Figure 2 for Dynamic Image for 3D MRI Image Alzheimer's Disease Classification

Figure 3 for Dynamic Image for 3D MRI Image Alzheimer's Disease Classification

Figure 4 for Dynamic Image for 3D MRI Image Alzheimer's Disease Classification

Abstract:We propose to apply a 2D CNN architecture to 3D MRI image Alzheimer's disease classification. Training a 3D convolutional neural network (CNN) is time-consuming and computationally expensive. We make use of approximate rank pooling to transform the 3D MRI image volume into a 2D image to use as input to a 2D CNN. We show our proposed CNN model achieves $9.5\%$ better Alzheimer's disease classification accuracy than the baseline 3D models. We also show that our method allows for efficient training, requiring only 20% of the training time compared to 3D CNN models. The code is available online: https://github.com/UkyVision/alzheimer-project.

* Accepted to ECCV2020 Workshop on BioImage Computing

Via

Access Paper or Ask Questions

Weakly-Supervised Feature Learning via Text and Image Matching

Oct 06, 2020

Gongbo Liang, Connor Greenwell, Yu Zhang, Xiaoqin Wang, Ramakanth Kavuluru, Nathan Jacobs

Figure 1 for Weakly-Supervised Feature Learning via Text and Image Matching

Figure 2 for Weakly-Supervised Feature Learning via Text and Image Matching

Figure 3 for Weakly-Supervised Feature Learning via Text and Image Matching

Figure 4 for Weakly-Supervised Feature Learning via Text and Image Matching

Abstract:When training deep neural networks for medical image classification, obtaining a sufficient number of manually annotated images is often a significant challenge. We propose to use textual findings, which are routinely written by clinicians during manual image analysis, to help overcome this problem. The key idea is to use a contrastive loss to train image and text feature extractors to recognize if a given image-finding pair is a true match. The learned image feature extractor is then fine-tuned, in a transfer learning setting, for a supervised classification task. This approach makes it possible to train using large datasets because pairs of images and textual findings are widely available in medical records. We evaluate our method on three datasets and find consistent performance improvements. The biggest gains are realized when fewer manually labeled examples are available. In some cases, our method achieves the same performance as the baseline even when using 70\%--98\% fewer labeled examples.

Via

Access Paper or Ask Questions

Improved Trainable Calibration Method for Neural Networks on Medical Imaging Classification

Sep 09, 2020

Gongbo Liang, Yu Zhang, Xiaoqin Wang, Nathan Jacobs

Figure 1 for Improved Trainable Calibration Method for Neural Networks on Medical Imaging Classification

Figure 2 for Improved Trainable Calibration Method for Neural Networks on Medical Imaging Classification

Figure 3 for Improved Trainable Calibration Method for Neural Networks on Medical Imaging Classification

Figure 4 for Improved Trainable Calibration Method for Neural Networks on Medical Imaging Classification

Abstract:Recent works have shown that deep neural networks can achieve super-human performance in a wide range of image classification tasks in the medical imaging domain. However, these works have primarily focused on classification accuracy, ignoring the important role of uncertainty quantification. Empirically, neural networks are often miscalibrated and overconfident in their predictions. This miscalibration could be problematic in any automatic decision-making system, but we focus on the medical field in which neural network miscalibration has the potential to lead to significant treatment errors. We propose a novel calibration approach that maintains the overall classification accuracy while significantly improving model calibration. The proposed approach is based on expected calibration error, which is a common metric for quantifying miscalibration. Our approach can be easily integrated into any classification task as an auxiliary loss term, thus not requiring an explicit training round for calibration. We show that our approach reduces calibration error significantly across various architectures and datasets.

* Accepted to the 31th British Machine Vision Conference (BMVC 2020)

Via

Access Paper or Ask Questions

Unsupervised Domain Adaptation for Mammogram Image Classification: A Promising Tool for Model Generalization

Mar 02, 2020

Yu Zhang, Gongbo Liang, Nathan Jacobs, Xiaoqin Wang

Figure 1 for Unsupervised Domain Adaptation for Mammogram Image Classification: A Promising Tool for Model Generalization

Figure 2 for Unsupervised Domain Adaptation for Mammogram Image Classification: A Promising Tool for Model Generalization

Abstract:Generalization is one of the key challenges in the clinical validation and application of deep learning models to medical images. Studies have shown that such models trained on publicly available datasets often do not work well on real-world clinical data due to the differences in patient population and image device configurations. Also, manually annotating clinical images is expensive. In this work, we propose an unsupervised domain adaptation (UDA) method using Cycle-GAN to improve the generalization ability of the model without using any additional manual annotations.

* Accepted by C-MIMI 2019 as a scientific abstract

Via

Access Paper or Ask Questions