Picture for Xuange Zhang

Xuange Zhang

HiMix: Reducing Computational Complexity in Large Vision-Language Models

Add code
Jan 17, 2025
Viaarxiv icon

UCF-Crime Annotation: A Benchmark for Surveillance Video-and-Language Understanding

Add code
Sep 25, 2023
Viaarxiv icon