Picture for Conghui He

Conghui He

Token Pruning in Multimodal Large Language Models: Are We Solving the Right Problem?

Add code
Feb 17, 2025
Viaarxiv icon

Stop Looking for Important Tokens in Multimodal Language Models: Duplication Matters More

Add code
Feb 17, 2025
Viaarxiv icon

A Comprehensive Survey on Imbalanced Data Learning

Add code
Feb 13, 2025
Viaarxiv icon

BenchMAX: A Comprehensive Multilingual Evaluation Suite for Large Language Models

Add code
Feb 11, 2025
Viaarxiv icon

Lumina-Video: Efficient and Flexible Video Generation with Multi-scale Next-DiT

Add code
Feb 10, 2025
Viaarxiv icon

Large Language Models Meet Symbolic Provers for Logical Reasoning Evaluation

Add code
Feb 10, 2025
Viaarxiv icon

GRAIT: Gradient-Driven Refusal-Aware Instruction Tuning for Effective Hallucination Mitigation

Add code
Feb 09, 2025
Viaarxiv icon

WanJuanSiLu: A High-Quality Open-Source Webtext Dataset for Low-Resource Languages

Add code
Jan 24, 2025
Figure 1 for WanJuanSiLu: A High-Quality Open-Source Webtext Dataset for Low-Resource Languages
Figure 2 for WanJuanSiLu: A High-Quality Open-Source Webtext Dataset for Low-Resource Languages
Figure 3 for WanJuanSiLu: A High-Quality Open-Source Webtext Dataset for Low-Resource Languages
Figure 4 for WanJuanSiLu: A High-Quality Open-Source Webtext Dataset for Low-Resource Languages
Viaarxiv icon

OVO-Bench: How Far is Your Video-LLMs from Real-World Online Video Understanding?

Add code
Jan 09, 2025
Viaarxiv icon

Accelerating Diffusion Transformers with Dual Feature Caching

Add code
Dec 25, 2024
Viaarxiv icon