Picture for Zheng Liu

Zheng Liu

STI-Bench: Are MLLMs Ready for Precise Spatial-Temporal World Understanding?

Add code
Mar 31, 2025
Viaarxiv icon

Challenging the Boundaries of Reasoning: An Olympiad-Level Math Benchmark for Large Language Models

Add code
Mar 27, 2025
Viaarxiv icon

MMCR: Benchmarking Cross-Source Reasoning in Scientific Papers

Add code
Mar 21, 2025
Viaarxiv icon

Memory-enhanced Retrieval Augmentation for Long Video Understanding

Add code
Mar 12, 2025
Viaarxiv icon

An Empirical Study on Eliciting and Improving R1-like Reasoning Models

Add code
Mar 06, 2025
Viaarxiv icon

Efficient and Distributed Large-Scale Point Cloud Bundle Adjustment via Majorization-Minimization

Add code
Feb 26, 2025
Viaarxiv icon

MMTEB: Massive Multilingual Text Embedding Benchmark

Add code
Feb 19, 2025
Viaarxiv icon

HawkBench: Investigating Resilience of RAG Methods on Stratified Information-Seeking Tasks

Add code
Feb 19, 2025
Viaarxiv icon

MomentSeeker: A Comprehensive Benchmark and A Strong Baseline For Moment Retrieval Within Long Videos

Add code
Feb 18, 2025
Viaarxiv icon

Reinforced Information Retrieval

Add code
Feb 17, 2025
Viaarxiv icon