Picture for Zhen Li

Zhen Li

LMO, CELESTE, HEC Paris

HyPerNav: Hybrid Perception for Object-Oriented Navigation in Unknown Environment

Add code
Oct 27, 2025
Viaarxiv icon

Posterior Collapse as a Phase Transition in Variational Autoencoders

Add code
Oct 02, 2025
Viaarxiv icon

InfiR2: A Comprehensive FP8 Training Recipe for Reasoning-Enhanced Language Models

Add code
Sep 26, 2025
Viaarxiv icon

Deep learning for 3D point cloud processing -- from approaches, tasks to its implications on urban and environmental applications

Add code
Sep 15, 2025
Viaarxiv icon

MDK12-Bench: A Comprehensive Evaluation of Multimodal Large Language Models on Multidisciplinary Exams

Add code
Aug 09, 2025
Viaarxiv icon

Oedipus and the Sphinx: Benchmarking and Improving Visual Language Models for Complex Graphic Reasoning

Add code
Aug 01, 2025
Viaarxiv icon

T2VParser: Adaptive Decomposition Tokens for Partial Alignment in Text to Video Retrieval

Add code
Jul 28, 2025
Viaarxiv icon

Lumina-mGPT 2.0: Stand-Alone AutoRegressive Image Modeling

Add code
Jul 23, 2025
Viaarxiv icon

Yume: An Interactive World Generation Model

Add code
Jul 23, 2025
Figure 1 for Yume: An Interactive World Generation Model
Figure 2 for Yume: An Interactive World Generation Model
Figure 3 for Yume: An Interactive World Generation Model
Figure 4 for Yume: An Interactive World Generation Model
Viaarxiv icon

Bradley-Terry and Multi-Objective Reward Modeling Are Complementary

Add code
Jul 10, 2025
Viaarxiv icon