Picture for Zhihuan Yu

Zhihuan Yu

EventLens: Leveraging Event-Aware Pretraining and Cross-modal Linking Enhances Visual Commonsense Reasoning

Add code
Apr 22, 2024
Viaarxiv icon