Picture for Peng Li

Peng Li

DJI Innovations Inc

Inference-Time Scaling for Generalist Reward Modeling

Add code
Apr 03, 2025
Viaarxiv icon

1-Tb/s/λ Transmission over Record 10714-km AR-HCF

Add code
Apr 02, 2025
Viaarxiv icon

AdaMMS: Model Merging for Heterogeneous Multimodal Large Language Models with Unsupervised Coefficient Optimization

Add code
Mar 31, 2025
Viaarxiv icon

A Survey of Efficient Reasoning for Large Reasoning Models: Language, Multimodality, and Beyond

Add code
Mar 27, 2025
Viaarxiv icon

Embodied-Reasoner: Synergizing Visual Search, Reasoning, and Action for Embodied Interactive Tasks

Add code
Mar 27, 2025
Viaarxiv icon

CoSpace: Benchmarking Continuous Space Perception Ability for Vision-Language Models

Add code
Mar 18, 2025
Viaarxiv icon

LVAgent: Long Video Understanding by Multi-Round Dynamical Collaboration of MLLM Agents

Add code
Mar 13, 2025
Viaarxiv icon

How Do Multimodal Large Language Models Handle Complex Multimodal Reasoning? Placing Them in An Extensible Escape Game

Add code
Mar 13, 2025
Viaarxiv icon

Gradient-guided Attention Map Editing: Towards Efficient Contextual Hallucination Mitigation

Add code
Mar 11, 2025
Viaarxiv icon

YuE: Scaling Open Foundation Models for Long-Form Music Generation

Add code
Mar 11, 2025
Viaarxiv icon