Picture for Chao Yu

Chao Yu

Hefei National Laboratory for Physical Sciences at Microscale and Department of Modern Physics, University of Science and Technology of China, Hefei, China, Shanghai Branch, CAS Center for Excellence in Quantum Information and Quantum Physics, University of Science and Technology of China, Shanghai, China, Shanghai Research Center for Quantum Sciences, Shanghai, China

Policy-to-Language: Train LLMs to Explain Decisions with Flow-Matching Generated Rewards

Add code
Feb 18, 2025
Viaarxiv icon

Learning Strategic Language Agents in the Werewolf Game with Iterative Latent Space Policy Optimization

Add code
Feb 07, 2025
Figure 1 for Learning Strategic Language Agents in the Werewolf Game with Iterative Latent Space Policy Optimization
Figure 2 for Learning Strategic Language Agents in the Werewolf Game with Iterative Latent Space Policy Optimization
Figure 3 for Learning Strategic Language Agents in the Werewolf Game with Iterative Latent Space Policy Optimization
Figure 4 for Learning Strategic Language Agents in the Werewolf Game with Iterative Latent Space Policy Optimization
Viaarxiv icon

VolleyBots: A Testbed for Multi-Drone Volleyball Game Combining Motion Control and Strategic Play

Add code
Feb 04, 2025
Viaarxiv icon

Rapid Learning in Constrained Minimax Games with Negative Momentum

Add code
Dec 31, 2024
Viaarxiv icon

An Experimental Study of Passive UAV Tracking with Digital Arrays and Cellular Downlink Signals

Add code
Dec 30, 2024
Viaarxiv icon

Hierarchical Multi-agent Meta-Reinforcement Learning for Cross-channel Bidding

Add code
Dec 26, 2024
Viaarxiv icon

What Matters in Learning A Zero-Shot Sim-to-Real RL Policy for Quadrotor Control? A Comprehensive Study

Add code
Dec 17, 2024
Viaarxiv icon

Neural Internal Model Control: Learning a Robust Control Policy via Predictive Error Feedback

Add code
Nov 20, 2024
Figure 1 for Neural Internal Model Control: Learning a Robust Control Policy via Predictive Error Feedback
Figure 2 for Neural Internal Model Control: Learning a Robust Control Policy via Predictive Error Feedback
Figure 3 for Neural Internal Model Control: Learning a Robust Control Policy via Predictive Error Feedback
Figure 4 for Neural Internal Model Control: Learning a Robust Control Policy via Predictive Error Feedback
Viaarxiv icon

Hunyuan-Large: An Open-Source MoE Model with 52 Billion Activated Parameters by Tencent

Add code
Nov 05, 2024
Figure 1 for Hunyuan-Large: An Open-Source MoE Model with 52 Billion Activated Parameters by Tencent
Figure 2 for Hunyuan-Large: An Open-Source MoE Model with 52 Billion Activated Parameters by Tencent
Figure 3 for Hunyuan-Large: An Open-Source MoE Model with 52 Billion Activated Parameters by Tencent
Figure 4 for Hunyuan-Large: An Open-Source MoE Model with 52 Billion Activated Parameters by Tencent
Viaarxiv icon

SleepNetZero: Zero-Burden Zero-Shot Reliable Sleep Staging With Neural Networks Based on Ballistocardiograms

Add code
Oct 30, 2024
Viaarxiv icon