Picture for Yi Wu

Yi Wu

How Far Are We from Optimal Reasoning Efficiency?

Add code
Jun 08, 2025
Viaarxiv icon

AReaL: A Large-Scale Asynchronous Reinforcement Learning System for Language Reasoning

Add code
May 30, 2025
Viaarxiv icon

What Can RL Bring to VLA Generalization? An Empirical Study

Add code
May 26, 2025
Viaarxiv icon

StyleAR: Customizing Multimodal Autoregressive Model for Style-Aligned Text-to-Image Generation

Add code
May 26, 2025
Viaarxiv icon

Toward Real-World Cooperative and Competitive Soccer with Quadrupedal Robot Teams

Add code
May 20, 2025
Viaarxiv icon

Fine-tuning Diffusion Policies with Backpropagation Through Diffusion Timesteps

Add code
May 15, 2025
Viaarxiv icon

PaRT: Enhancing Proactive Social Chatbots with Personalized Real-Time Retrieval

Add code
Apr 29, 2025
Viaarxiv icon

Technologies on Effectiveness and Efficiency: A Survey of State Spaces Models

Add code
Mar 14, 2025
Viaarxiv icon

Proxy-Tuning: Tailoring Multimodal Autoregressive Models for Subject-Driven Image Generation

Add code
Mar 13, 2025
Viaarxiv icon

Learning Strategic Language Agents in the Werewolf Game with Iterative Latent Space Policy Optimization

Add code
Feb 07, 2025
Figure 1 for Learning Strategic Language Agents in the Werewolf Game with Iterative Latent Space Policy Optimization
Figure 2 for Learning Strategic Language Agents in the Werewolf Game with Iterative Latent Space Policy Optimization
Figure 3 for Learning Strategic Language Agents in the Werewolf Game with Iterative Latent Space Policy Optimization
Figure 4 for Learning Strategic Language Agents in the Werewolf Game with Iterative Latent Space Policy Optimization
Viaarxiv icon