Picture for Lei Bai

Lei Bai

Squeeze the Soaked Sponge: Efficient Off-policy Reinforcement Finetuning for Large Language Model

Add code
Jul 09, 2025
Viaarxiv icon

Position: Intelligent Science Laboratory Requires the Integration of Cognitive and Embodied AI

Add code
Jun 24, 2025
Viaarxiv icon

ReconMOST: Multi-Layer Sea Temperature Reconstruction with Observations-Guided Diffusion

Add code
Jun 12, 2025
Viaarxiv icon

Scientists' First Exam: Probing Cognitive Abilities of MLLM via Perception, Understanding, and Reasoning

Add code
Jun 12, 2025
Viaarxiv icon

VIKI-R: Coordinating Embodied Multi-Agent Cooperation via Reinforcement Learning

Add code
Jun 10, 2025
Viaarxiv icon

OmniEarth-Bench: Towards Holistic Evaluation of Earth's Six Spheres and Cross-Spheres Interactions with Multimodal Observational Earth Data

Add code
May 29, 2025
Viaarxiv icon

ML-Agent: Reinforcing LLM Agents for Autonomous Machine Learning Engineering

Add code
May 29, 2025
Viaarxiv icon

LabUtopia: High-Fidelity Simulation and Hierarchical Benchmark for Scientific Embodied Agents

Add code
May 28, 2025
Viaarxiv icon

The Entropy Mechanism of Reinforcement Learning for Reasoning Language Models

Add code
May 28, 2025
Viaarxiv icon

Align-DA: Align Score-based Atmospheric Data Assimilation with Multiple Preferences

Add code
May 28, 2025
Viaarxiv icon