Picture for Yang Cai

Yang Cai

ABot-N0: Technical Report on the VLA Foundation Model for Versatile Embodied Navigation

Add code
Feb 12, 2026
Viaarxiv icon

Is Online Linear Optimization Sufficient for Strategic Robustness?

Add code
Feb 12, 2026
Viaarxiv icon

Asymptotic Universal Alignment: A New Alignment Framework via Test-Time Scaling

Add code
Jan 13, 2026
Viaarxiv icon

Smooth Operator: Smooth Verifiable Reward Activates Spatial Reasoning Ability of Vision-Language Model

Add code
Jan 12, 2026
Viaarxiv icon

DiffVL: Diffusion-Based Visual Localization on 2D Maps via BEV-Conditioned GPS Denoising

Add code
Sep 18, 2025
Figure 1 for DiffVL: Diffusion-Based Visual Localization on 2D Maps via BEV-Conditioned GPS Denoising
Figure 2 for DiffVL: Diffusion-Based Visual Localization on 2D Maps via BEV-Conditioned GPS Denoising
Figure 3 for DiffVL: Diffusion-Based Visual Localization on 2D Maps via BEV-Conditioned GPS Denoising
Figure 4 for DiffVL: Diffusion-Based Visual Localization on 2D Maps via BEV-Conditioned GPS Denoising
Viaarxiv icon

What Makes Treatment Effects Identifiable? Characterizations and Estimators Beyond Unconfoundedness

Add code
Jun 04, 2025
Viaarxiv icon

On Separation Between Best-Iterate, Random-Iterate, and Last-Iterate Convergence of Learning in Games

Add code
Mar 04, 2025
Viaarxiv icon

Convergence of the Min-Max Langevin Dynamics and Algorithm for Zero-Sum Games

Add code
Dec 29, 2024
Viaarxiv icon

Provable Partially Observable Reinforcement Learning with Privileged Information

Add code
Dec 01, 2024
Figure 1 for Provable Partially Observable Reinforcement Learning with Privileged Information
Figure 2 for Provable Partially Observable Reinforcement Learning with Privileged Information
Figure 3 for Provable Partially Observable Reinforcement Learning with Privileged Information
Figure 4 for Provable Partially Observable Reinforcement Learning with Privileged Information
Viaarxiv icon

COMAL: A Convergent Meta-Algorithm for Aligning LLMs with General Preferences

Add code
Oct 30, 2024
Figure 1 for COMAL: A Convergent Meta-Algorithm for Aligning LLMs with General Preferences
Figure 2 for COMAL: A Convergent Meta-Algorithm for Aligning LLMs with General Preferences
Figure 3 for COMAL: A Convergent Meta-Algorithm for Aligning LLMs with General Preferences
Figure 4 for COMAL: A Convergent Meta-Algorithm for Aligning LLMs with General Preferences
Viaarxiv icon