Picture for Danhuai Zhao

Danhuai Zhao

MMFuser: Multimodal Multi-Layer Feature Fuser for Fine-Grained Vision-Language Understanding

Add code
Oct 15, 2024
Viaarxiv icon