Picture for Yingxuan Li

Yingxuan Li

Zero-Shot Character Identification and Speaker Prediction in Comics via Iterative Multimodal Fusion

Add code
Apr 24, 2024
Viaarxiv icon

On the Road with GPT-4V: Early Explorations of Visual-Language Model on Autonomous Driving

Add code
Nov 28, 2023
Viaarxiv icon

Manga109Dialog A Large-scale Dialogue Dataset for Comics Speaker Detection

Add code
Jun 30, 2023
Viaarxiv icon