Picture for Wenhui Hu

Wenhui Hu

Code-Vision: Evaluating Multimodal LLMs Logic Understanding and Code Generation Capabilities

Add code
Feb 17, 2025
Viaarxiv icon

Bridging Modality Gap for Visual Grounding with Effecitve Cross-modal Distillation

Add code
Dec 29, 2023
Viaarxiv icon