Picture for Yang Bai

Yang Bai

Enhancing Community Vision Screening -- AI Driven Retinal Photography for Early Disease Detection and Patient Trust

Add code
Oct 27, 2024
Viaarxiv icon

From Generalist to Specialist: Adapting Vision Language Models via Task-Specific Visual Instruction Tuning

Add code
Oct 09, 2024
Figure 1 for From Generalist to Specialist: Adapting Vision Language Models via Task-Specific Visual Instruction Tuning
Figure 2 for From Generalist to Specialist: Adapting Vision Language Models via Task-Specific Visual Instruction Tuning
Figure 3 for From Generalist to Specialist: Adapting Vision Language Models via Task-Specific Visual Instruction Tuning
Figure 4 for From Generalist to Specialist: Adapting Vision Language Models via Task-Specific Visual Instruction Tuning
Viaarxiv icon

Length Desensitization in Directed Preference Optimization

Add code
Sep 10, 2024
Viaarxiv icon

UrFound: Towards Universal Retinal Foundation Models via Knowledge-Guided Masked Modeling

Add code
Aug 10, 2024
Viaarxiv icon

Video Watermarking: Safeguarding Your Video from (Unauthorized) Annotations by Video-based LLMs

Add code
Jul 02, 2024
Viaarxiv icon

Adversarial Robustness for Visual Grounding of Multimodal Large Language Models

Add code
May 16, 2024
Viaarxiv icon

Special Characters Attack: Toward Scalable Training Data Extraction From Large Language Models

Add code
May 09, 2024
Viaarxiv icon

Semi-supervised Text-based Person Search

Add code
Apr 28, 2024
Viaarxiv icon

Energy-Latency Manipulation of Multi-modal Large Language Models via Verbose Samples

Add code
Apr 25, 2024
Viaarxiv icon

MedRG: Medical Report Grounding with Multi-modal Large Language Model

Add code
Apr 10, 2024
Viaarxiv icon