Picture for Lei Yang

Lei Yang

SOUP: Token-level Single-sample Mix-policy Reinforcement Learning for Large Language Models

Add code
Jan 29, 2026
Viaarxiv icon

Meta-Cognitive Reinforcement Learning with Self-Doubt and Recovery

Add code
Jan 28, 2026
Viaarxiv icon

Training instability in deep learning follows low-dimensional dynamical principles

Add code
Jan 19, 2026
Viaarxiv icon

Learning to Trust Experience: A Monitor-Trust-Regulator Framework for Learning under Unobservable Feedback Reliability

Add code
Jan 14, 2026
Viaarxiv icon

Radiance-Field Reinforced Pretraining: Scaling Localization Models with Unlabeled Wireless Signals

Add code
Dec 08, 2025
Figure 1 for Radiance-Field Reinforced Pretraining: Scaling Localization Models with Unlabeled Wireless Signals
Figure 2 for Radiance-Field Reinforced Pretraining: Scaling Localization Models with Unlabeled Wireless Signals
Figure 3 for Radiance-Field Reinforced Pretraining: Scaling Localization Models with Unlabeled Wireless Signals
Figure 4 for Radiance-Field Reinforced Pretraining: Scaling Localization Models with Unlabeled Wireless Signals
Viaarxiv icon

Scaling Spatial Intelligence with Multimodal Foundation Models

Add code
Nov 17, 2025
Figure 1 for Scaling Spatial Intelligence with Multimodal Foundation Models
Figure 2 for Scaling Spatial Intelligence with Multimodal Foundation Models
Figure 3 for Scaling Spatial Intelligence with Multimodal Foundation Models
Figure 4 for Scaling Spatial Intelligence with Multimodal Foundation Models
Viaarxiv icon

DGFusion: Dual-guided Fusion for Robust Multi-Modal 3D Object Detection

Add code
Nov 13, 2025
Viaarxiv icon

Cross-Modal Unlearning via Influential Neuron Path Editing in Multimodal Large Language Models

Add code
Nov 10, 2025
Viaarxiv icon

Phased DMD: Few-step Distribution Matching Distillation via Score Matching within Subintervals

Add code
Oct 31, 2025
Figure 1 for Phased DMD: Few-step Distribution Matching Distillation via Score Matching within Subintervals
Figure 2 for Phased DMD: Few-step Distribution Matching Distillation via Score Matching within Subintervals
Figure 3 for Phased DMD: Few-step Distribution Matching Distillation via Score Matching within Subintervals
Figure 4 for Phased DMD: Few-step Distribution Matching Distillation via Score Matching within Subintervals
Viaarxiv icon

The Quest for Generalizable Motion Generation: Data, Model, and Evaluation

Add code
Oct 30, 2025
Viaarxiv icon