Picture for Jiale Yuan

Jiale Yuan

Compression with Global Guidance: Towards Training-free High-Resolution MLLMs Acceleration

Add code
Jan 09, 2025
Viaarxiv icon

LongDocURL: a Comprehensive Multimodal Long Document Benchmark Integrating Understanding, Reasoning, and Locating

Add code
Dec 24, 2024
Viaarxiv icon

ConvLLaVA: Hierarchical Backbones as Visual Encoder for Large Multimodal Models

Add code
May 24, 2024
Viaarxiv icon