Picture for Bo Li

Bo Li

Beijing Key Laboratory of Digital Media, School of Computer Science and Engineering, Beihang University, Beijing, China

MMSearch-R1: Incentivizing LMMs to Search

Add code
Jun 25, 2025
Viaarxiv icon

RA-NeRF: Robust Neural Radiance Field Reconstruction with Accurate Camera Pose Estimation under Complex Trajectories

Add code
Jun 18, 2025
Viaarxiv icon

Seewo's Submission to MLC-SLM: Lessons learned from Speech Reasoning Language Models

Add code
Jun 16, 2025
Viaarxiv icon

Reliable Reasoning Path: Distilling Effective Guidance for LLM Reasoning with Knowledge Graphs

Add code
Jun 12, 2025
Viaarxiv icon

Step-Audio-AQAA: a Fully End-to-End Expressive Large Audio Language Model

Add code
Jun 10, 2025
Viaarxiv icon

CDR-Agent: Intelligent Selection and Execution of Clinical Decision Rules Using Large Language Model Agents

Add code
May 29, 2025
Viaarxiv icon

HyperMotion: DiT-Based Pose-Guided Human Image Animation of Complex Motions

Add code
May 29, 2025
Viaarxiv icon

MagicTryOn: Harnessing Diffusion Transformer for Garment-Preserving Video Virtual Try-on

Add code
May 28, 2025
Viaarxiv icon

Photography Perspective Composition: Towards Aesthetic Perspective Recommendation

Add code
May 27, 2025
Viaarxiv icon

SOSBENCH: Benchmarking Safety Alignment on Scientific Knowledge

Add code
May 27, 2025
Viaarxiv icon