Picture for Mianzhi Pan

Mianzhi Pan

Food-500 Cap: A Fine-Grained Food Caption Benchmark for Evaluating Vision-Language Models

Add code
Aug 06, 2023
Viaarxiv icon

Probing Cross-modal Semantics Alignment Capability from the Textual Perspective

Add code
Oct 18, 2022
Figure 1 for Probing Cross-modal Semantics Alignment Capability from the Textual Perspective
Figure 2 for Probing Cross-modal Semantics Alignment Capability from the Textual Perspective
Figure 3 for Probing Cross-modal Semantics Alignment Capability from the Textual Perspective
Figure 4 for Probing Cross-modal Semantics Alignment Capability from the Textual Perspective
Viaarxiv icon