Picture for Xiaoyan Cai

Xiaoyan Cai

Instruction-Aligned Visual Attention for Mitigating Hallucinations in Large Vision-Language Models

Add code
Mar 24, 2025
Viaarxiv icon

CoF: Coarse to Fine-Grained Image Understanding for Multi-modal Large Language Models

Add code
Dec 22, 2024
Figure 1 for CoF: Coarse to Fine-Grained Image Understanding for Multi-modal Large Language Models
Figure 2 for CoF: Coarse to Fine-Grained Image Understanding for Multi-modal Large Language Models
Figure 3 for CoF: Coarse to Fine-Grained Image Understanding for Multi-modal Large Language Models
Figure 4 for CoF: Coarse to Fine-Grained Image Understanding for Multi-modal Large Language Models
Viaarxiv icon

Enhancing Fine-Grained Vision-Language Pretraining with Negative Augmented Samples

Add code
Dec 13, 2024
Figure 1 for Enhancing Fine-Grained Vision-Language Pretraining with Negative Augmented Samples
Figure 2 for Enhancing Fine-Grained Vision-Language Pretraining with Negative Augmented Samples
Figure 3 for Enhancing Fine-Grained Vision-Language Pretraining with Negative Augmented Samples
Figure 4 for Enhancing Fine-Grained Vision-Language Pretraining with Negative Augmented Samples
Viaarxiv icon

MoDULA: Mixture of Domain-Specific and Universal LoRA for Multi-Task Learning

Add code
Dec 10, 2024
Figure 1 for MoDULA: Mixture of Domain-Specific and Universal LoRA for Multi-Task Learning
Figure 2 for MoDULA: Mixture of Domain-Specific and Universal LoRA for Multi-Task Learning
Figure 3 for MoDULA: Mixture of Domain-Specific and Universal LoRA for Multi-Task Learning
Figure 4 for MoDULA: Mixture of Domain-Specific and Universal LoRA for Multi-Task Learning
Viaarxiv icon

MLoRA: Multi-Domain Low-Rank Adaptive Network for CTR Prediction

Add code
Aug 14, 2024
Figure 1 for MLoRA: Multi-Domain Low-Rank Adaptive Network for CTR Prediction
Figure 2 for MLoRA: Multi-Domain Low-Rank Adaptive Network for CTR Prediction
Figure 3 for MLoRA: Multi-Domain Low-Rank Adaptive Network for CTR Prediction
Figure 4 for MLoRA: Multi-Domain Low-Rank Adaptive Network for CTR Prediction
Viaarxiv icon

General2Specialized LLMs Translation for E-commerce

Add code
Mar 06, 2024
Viaarxiv icon

ChatRadio-Valuer: A Chat Large Language Model for Generalizable Radiology Report Generation Based on Multi-institution and Multi-system Data

Add code
Oct 10, 2023
Viaarxiv icon

Evaluating Large Language Models for Radiology Natural Language Processing

Add code
Jul 27, 2023
Viaarxiv icon

ImpressionGPT: An Iterative Optimizing Framework for Radiology Report Summarization with ChatGPT

Add code
May 03, 2023
Viaarxiv icon

A Skeleton-Based Model for Promoting Coherence Among Sentences in Narrative Story Generation

Add code
Aug 27, 2018
Figure 1 for A Skeleton-Based Model for Promoting Coherence Among Sentences in Narrative Story Generation
Figure 2 for A Skeleton-Based Model for Promoting Coherence Among Sentences in Narrative Story Generation
Figure 3 for A Skeleton-Based Model for Promoting Coherence Among Sentences in Narrative Story Generation
Figure 4 for A Skeleton-Based Model for Promoting Coherence Among Sentences in Narrative Story Generation
Viaarxiv icon