Picture for Yeyuan Wang

Yeyuan Wang

CoF: Coarse to Fine-Grained Image Understanding for Multi-modal Large Language Models

Add code
Dec 22, 2024
Figure 1 for CoF: Coarse to Fine-Grained Image Understanding for Multi-modal Large Language Models
Figure 2 for CoF: Coarse to Fine-Grained Image Understanding for Multi-modal Large Language Models
Figure 3 for CoF: Coarse to Fine-Grained Image Understanding for Multi-modal Large Language Models
Figure 4 for CoF: Coarse to Fine-Grained Image Understanding for Multi-modal Large Language Models
Viaarxiv icon

Enhancing Fine-Grained Vision-Language Pretraining with Negative Augmented Samples

Add code
Dec 13, 2024
Figure 1 for Enhancing Fine-Grained Vision-Language Pretraining with Negative Augmented Samples
Figure 2 for Enhancing Fine-Grained Vision-Language Pretraining with Negative Augmented Samples
Figure 3 for Enhancing Fine-Grained Vision-Language Pretraining with Negative Augmented Samples
Figure 4 for Enhancing Fine-Grained Vision-Language Pretraining with Negative Augmented Samples
Viaarxiv icon