Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Mozhdeh Rouhsedaghat

LLMs in Biomedicine: A study on clinical Named Entity Recognition

Apr 10, 2024

Masoud Monajatipoor, Jiaxin Yang, Joel Stremmel, Melika Emami, Fazlolah Mohaghegh, Mozhdeh Rouhsedaghat, Kai-Wei Chang

Figure 1 for LLMs in Biomedicine: A study on clinical Named Entity Recognition

Figure 2 for LLMs in Biomedicine: A study on clinical Named Entity Recognition

Figure 3 for LLMs in Biomedicine: A study on clinical Named Entity Recognition

Figure 4 for LLMs in Biomedicine: A study on clinical Named Entity Recognition

Abstract:Large Language Models (LLMs) demonstrate remarkable versatility in various NLP tasks but encounter distinct challenges in biomedicine due to medical language complexities and data scarcity. This paper investigates the application of LLMs in the medical domain by exploring strategies to enhance their performance for the Named-Entity Recognition (NER) task. Specifically, our study reveals the importance of meticulously designed prompts in biomedicine. Strategic selection of in-context examples yields a notable improvement, showcasing ~15-20\% increase in F1 score across all benchmark datasets for few-shot clinical NER. Additionally, our findings suggest that integrating external resources through prompting strategies can bridge the gap between general-purpose LLM proficiency and the specialized demands of medical NER. Leveraging a medical knowledge base, our proposed method inspired by Retrieval-Augmented Generation (RAG) can boost the F1 score of LLMs for zero-shot clinical NER. We will release the code upon publication.

Via

Access Paper or Ask Questions

MetaVL: Transferring In-Context Learning Ability From Language Models to Vision-Language Models

Jun 02, 2023

Masoud Monajatipoor, Liunian Harold Li, Mozhdeh Rouhsedaghat, Lin F. Yang, Kai-Wei Chang

Figure 1 for MetaVL: Transferring In-Context Learning Ability From Language Models to Vision-Language Models

Figure 2 for MetaVL: Transferring In-Context Learning Ability From Language Models to Vision-Language Models

Figure 3 for MetaVL: Transferring In-Context Learning Ability From Language Models to Vision-Language Models

Figure 4 for MetaVL: Transferring In-Context Learning Ability From Language Models to Vision-Language Models

Abstract:Large-scale language models have shown the ability to adapt to a new task via conditioning on a few demonstrations (i.e., in-context learning). However, in the vision-language domain, most large-scale pre-trained vision-language (VL) models do not possess the ability to conduct in-context learning. How can we enable in-context learning for VL models? In this paper, we study an interesting hypothesis: can we transfer the in-context learning ability from the language domain to VL domain? Specifically, we first meta-trains a language model to perform in-context learning on NLP tasks (as in MetaICL); then we transfer this model to perform VL tasks by attaching a visual encoder. Our experiments suggest that indeed in-context learning ability can be transferred cross modalities: our model considerably improves the in-context learning capability on VL tasks and can even compensate for the size of the model significantly. On VQA, OK-VQA, and GQA, our method could outperform the baseline model while having 20 times fewer parameters.

Via

Access Paper or Ask Questions

MAGIC: Mask-Guided Image Synthesis by Inverting a Quasi-Robust Classifier

Sep 23, 2022

Mozhdeh Rouhsedaghat, Masoud Monajatipoor, Kai-Wei Chang, C. -C. Jay Kuo, Iacopo Masi

Figure 1 for MAGIC: Mask-Guided Image Synthesis by Inverting a Quasi-Robust Classifier

Figure 2 for MAGIC: Mask-Guided Image Synthesis by Inverting a Quasi-Robust Classifier

Figure 3 for MAGIC: Mask-Guided Image Synthesis by Inverting a Quasi-Robust Classifier

Figure 4 for MAGIC: Mask-Guided Image Synthesis by Inverting a Quasi-Robust Classifier

Abstract:We offer a method for one-shot image synthesis that allows controlling manipulations of a single image by inverting a quasi-robust classifier equipped with strong regularizers. Our proposed method, entitled Magic, samples structured gradients from a pre-trained quasi-robust classifier to better preserve the input semantics while preserving its classification accuracy, thereby guaranteeing credibility in the synthesis. Unlike current methods that use complex primitives to supervise the process or use attention maps as a weak supervisory signal, Magic aggregates gradients over the input, driven by a guide binary mask that enforces a strong, spatial prior. Magic implements a series of manipulations with a single framework achieving shape and location control, intense non-rigid shape deformations, and copy/move operations in the presence of repeating objects and gives users firm control over the synthesis by requiring simply specifying binary guide masks. Our study and findings are supported by various qualitative comparisons with the state-of-the-art on the same images sampled from ImageNet and quantitative analysis using machine perception along with a user survey of 100+ participants that endorse our synthesis quality.

* 12 pages, 9 figures, technical report

Via

Access Paper or Ask Questions

BERTHop: An Effective Vision-and-Language Model for Chest X-ray Disease Diagnosis

Aug 10, 2021

Masoud Monajatipoor, Mozhdeh Rouhsedaghat, Liunian Harold Li, Aichi Chien, C. -C. Jay Kuo, Fabien Scalzo, Kai-Wei Chang

Figure 1 for BERTHop: An Effective Vision-and-Language Model for Chest X-ray Disease Diagnosis

Figure 2 for BERTHop: An Effective Vision-and-Language Model for Chest X-ray Disease Diagnosis

Figure 3 for BERTHop: An Effective Vision-and-Language Model for Chest X-ray Disease Diagnosis

Figure 4 for BERTHop: An Effective Vision-and-Language Model for Chest X-ray Disease Diagnosis

Abstract:Vision-and-language(V&L) models take image and text as input and learn to capture the associations between them. Prior studies show that pre-trained V&L models can significantly improve the model performance for downstream tasks such as Visual Question Answering (VQA). However, V&L models are less effective when applied in the medical domain (e.g., on X-ray images and clinical notes) due to the domain gap. In this paper, we investigate the challenges of applying pre-trained V&L models in medical applications. In particular, we identify that the visual representation in general V&L models is not suitable for processing medical data. To overcome this limitation, we propose BERTHop, a transformer-based model based on PixelHop++ and VisualBERT, for better capturing the associations between the two modalities. Experiments on the OpenI dataset, a commonly used thoracic disease diagnosis benchmark, show that BERTHop achieves an average Area Under the Curve (AUC) of 98.12% which is 1.62% higher than state-of-the-art (SOTA) while it is trained on a 9 times smaller dataset.

* 10 pages, 8 figures, Accepted in ICCV workshop

Via

Access Paper or Ask Questions

DefakeHop: A Light-Weight High-Performance Deepfake Detector

Mar 11, 2021

Hong-Shuo Chen, Mozhdeh Rouhsedaghat, Hamza Ghani, Shuowen Hu, Suya You, C. -C. Jay Kuo

Figure 1 for DefakeHop: A Light-Weight High-Performance Deepfake Detector

Figure 2 for DefakeHop: A Light-Weight High-Performance Deepfake Detector

Figure 3 for DefakeHop: A Light-Weight High-Performance Deepfake Detector

Figure 4 for DefakeHop: A Light-Weight High-Performance Deepfake Detector

Abstract:A light-weight high-performance Deepfake detection method, called DefakeHop, is proposed in this work. State-of-the-art Deepfake detection methods are built upon deep neural networks. DefakeHop extracts features automatically using the successive subspace learning (SSL) principle from various parts of face images. The features are extracted by c/w Saab transform and further processed by our feature distillation module using spatial dimension reduction and soft classification for each channel to get a more concise description of the face. Extensive experiments are conducted to demonstrate the effectiveness of the proposed DefakeHop method. With a small model size of 42,845 parameters, DefakeHop achieves state-of-the-art performance with the area under the ROC curve (AUC) of 100%, 94.95%, and 90.56% on UADFV, Celeb-DF v1 and Celeb-DF v2 datasets, respectively.

* Accepted at ICME 2021

Via

Access Paper or Ask Questions

Successive Subspace Learning: An Overview

Feb 27, 2021

Mozhdeh Rouhsedaghat, Masoud Monajatipoor, Zohreh Azizi, C. -C. Jay Kuo

Figure 1 for Successive Subspace Learning: An Overview

Abstract:Successive Subspace Learning (SSL) offers a light-weight unsupervised feature learning method based on inherent statistical properties of data units (e.g. image pixels and points in point cloud sets). It has shown promising results, especially on small datasets. In this paper, we intuitively explain this method, provide an overview of its development, and point out some open questions and challenges for future research.

* 4 pages, 1 figure

Via

Access Paper or Ask Questions

Low-Resolution Face Recognition In Resource-Constrained Environments

Nov 23, 2020

Mozhdeh Rouhsedaghat, Yifan Wang, Shuowen Hu, Suya You, C. -C. Jay Kuo

Figure 1 for Low-Resolution Face Recognition In Resource-Constrained Environments

Figure 2 for Low-Resolution Face Recognition In Resource-Constrained Environments

Figure 3 for Low-Resolution Face Recognition In Resource-Constrained Environments

Figure 4 for Low-Resolution Face Recognition In Resource-Constrained Environments

Abstract:A non-parametric low-resolution face recognition model for resource-constrained environments with limited networking and computing is proposed in this work. Such environments often demand a small model capable of being effectively trained on a small number of labeled data samples, with low training complexity, and low-resolution input images. To address these challenges, we adopt an emerging explainable machine learning methodology called successive subspace learning (SSL).SSL offers an explainable non-parametric model that flexibly trades the model size for verification performance. Its training complexity is significantly lower since its model is trained in a one-pass feedforward manner without backpropagation. Furthermore, active learning can be conveniently incorporated to reduce the labeling cost. The effectiveness of the proposed model is demonstrated by experiments on the LFW and the CMU Multi-PIE datasets.

* 11 pages, 5 figures, under consideration at Pattern Recognition Letters

Via

Access Paper or Ask Questions

FaceHop: A Light-Weight Low-Resolution Face Gender Classification Method

Jul 21, 2020

Mozhdeh Rouhsedaghat, Yifan Wang, Xiou Ge, Shuowen Hu, Suya You, C. -C. Jay Kuo

Figure 1 for FaceHop: A Light-Weight Low-Resolution Face Gender Classification Method

Figure 2 for FaceHop: A Light-Weight Low-Resolution Face Gender Classification Method

Figure 3 for FaceHop: A Light-Weight Low-Resolution Face Gender Classification Method

Figure 4 for FaceHop: A Light-Weight Low-Resolution Face Gender Classification Method

Abstract:A light-weight low-resolution face gender classification method, called FaceHop, is proposed in this research. We have witnessed a rapid progress in face gender classification accuracy due to the adoption of deep learning (DL) technology. Yet, DL-based systems are not suitable for resource-constrained environments with limited networking and computing. FaceHop offers an interpretable non-parametric machine learning solution. It has desired characteristics such as a small model size, a small training data amount, low training complexity, and low resolution input images. FaceHop is developed with the successive subspace learning (SSL) principle and built upon the foundation of PixelHop++. The effectiveness of the FaceHop method is demonstrated by experiments. For gray-scale face images of resolution $32 \times 32$ in the LFW and the CMU Multi-PIE datasets, FaceHop achieves correct gender classification rates of 94.63\% and 95.12\% with model sizes of 16.9K and 17.6K parameters, respectively. It outperforms LeNet-5 in classification accuracy while LeNet-5 has a model size of 75.8K parameters.

Via

Access Paper or Ask Questions

PixelHop++: A Small Successive-Subspace-Learning-Based (SSL-based) Model for Image Classification

Feb 08, 2020

Yueru Chen, Mozhdeh Rouhsedaghat, Suya You, Raghuveer Rao, C. -C. Jay Kuo

Figure 1 for PixelHop++: A Small Successive-Subspace-Learning-Based (SSL-based) Model for Image Classification

Figure 2 for PixelHop++: A Small Successive-Subspace-Learning-Based (SSL-based) Model for Image Classification

Figure 3 for PixelHop++: A Small Successive-Subspace-Learning-Based (SSL-based) Model for Image Classification

Figure 4 for PixelHop++: A Small Successive-Subspace-Learning-Based (SSL-based) Model for Image Classification

Abstract:The successive subspace learning (SSL) principle was developed and used to design an interpretable learning model, known as the PixelHop method,for image classification in our prior work. Here, we propose an improved PixelHop method and call it PixelHop++. First, to make the PixelHop model size smaller, we decouple a joint spatial-spectral input tensor to multiple spatial tensors (one for each spectral component) under the spatial-spectral separability assumption and perform the Saab transform in a channel-wise manner, called the channel-wise (c/w) Saab transform.Second, by performing this operation from one hop to another successively, we construct a channel-decomposed feature tree whose leaf nodes contain features of one dimension (1D). Third, these 1D features are ranked according to their cross-entropy values, which allows us to select a subset of discriminant features for image classification. In PixelHop++, one can control the learning model size of fine-granularity,offering a flexible tradeoff between the model size and the classification performance. We demonstrate the flexibility of PixelHop++ on MNIST, Fashion MNIST, and CIFAR-10 three datasets.

* 5 pages, 5 figures, 4 tables, Submitted to ICIP 2020

Via

Access Paper or Ask Questions