Picture for Zili Li

Zili Li

Augment the Pairs: Semantics-Preserving Image-Caption Pair Augmentation for Grounding-Based Vision and Language Models

Add code
Nov 05, 2023
Viaarxiv icon