Picture for Zhe Li

Zhe Li

RoboForge: Physically Optimized Text-guided Whole-Body Locomotion for Humanoids

Add code
Mar 19, 2026
Viaarxiv icon

VisBrowse-Bench: Benchmarking Visual-Native Search for Multimodal Browsing Agents

Add code
Mar 17, 2026
Viaarxiv icon

YOLO-NAS-Bench: A Surrogate Benchmark with Self-Evolving Predictors for YOLO Architecture Search

Add code
Mar 10, 2026
Viaarxiv icon

AoE: Always-on Egocentric Human Video Collection for Embodied AI

Add code
Mar 02, 2026
Viaarxiv icon

Beyond Next-Token Alignment: Distilling Multimodal Large Language Models via Token Interactions

Add code
Feb 10, 2026
Viaarxiv icon

MOSAIC: Bridging the Sim-to-Real Gap in Generalist Humanoid Motion Tracking and Teleoperation with Rapid Residual Adaptation

Add code
Feb 09, 2026
Viaarxiv icon

ShapePuri: Shape Guided and Appearance Generalized Adversarial Purification

Add code
Feb 05, 2026
Viaarxiv icon

Latent Reasoning VLA: Latent Thinking and Prediction for Vision-Language-Action Models

Add code
Feb 01, 2026
Viaarxiv icon

RoboBrain 2.5: Depth in Sight, Time in Mind

Add code
Jan 20, 2026
Viaarxiv icon

HOSL: Hybrid-Order Split Learning for Memory-Constrained Edge Training

Add code
Jan 16, 2026
Viaarxiv icon