Picture for An-Lan Wang

An-Lan Wang

MCTBench: Multimodal Cognition towards Text-Rich Visual Scenes Benchmark

Add code
Oct 15, 2024
Viaarxiv icon

ParGo: Bridging Vision-Language with Partial and Global Views

Add code
Aug 23, 2024
Viaarxiv icon

EgoExo-Fitness: Towards Egocentric and Exocentric Full-Body Action Understanding

Add code
Jun 13, 2024
Viaarxiv icon

Event-Guided Procedure Planning from Instructional Videos with Text Supervision

Add code
Aug 17, 2023
Viaarxiv icon