Picture for Chao Yu

Chao Yu

Hefei National Laboratory for Physical Sciences at Microscale and Department of Modern Physics, University of Science and Technology of China, Hefei, China, Shanghai Branch, CAS Center for Excellence in Quantum Information and Quantum Physics, University of Science and Technology of China, Shanghai, China, Shanghai Research Center for Quantum Sciences, Shanghai, China

Rapid Learning in Constrained Minimax Games with Negative Momentum

Add code
Dec 31, 2024
Viaarxiv icon

An Experimental Study of Passive UAV Tracking with Digital Arrays and Cellular Downlink Signals

Add code
Dec 30, 2024
Viaarxiv icon

Hierarchical Multi-agent Meta-Reinforcement Learning for Cross-channel Bidding

Add code
Dec 26, 2024
Viaarxiv icon

What Matters in Learning A Zero-Shot Sim-to-Real RL Policy for Quadrotor Control? A Comprehensive Study

Add code
Dec 17, 2024
Viaarxiv icon

Neural Internal Model Control: Learning a Robust Control Policy via Predictive Error Feedback

Add code
Nov 20, 2024
Viaarxiv icon

Hunyuan-Large: An Open-Source MoE Model with 52 Billion Activated Parameters by Tencent

Add code
Nov 05, 2024
Figure 1 for Hunyuan-Large: An Open-Source MoE Model with 52 Billion Activated Parameters by Tencent
Figure 2 for Hunyuan-Large: An Open-Source MoE Model with 52 Billion Activated Parameters by Tencent
Figure 3 for Hunyuan-Large: An Open-Source MoE Model with 52 Billion Activated Parameters by Tencent
Figure 4 for Hunyuan-Large: An Open-Source MoE Model with 52 Billion Activated Parameters by Tencent
Viaarxiv icon

SleepNetZero: Zero-Burden Zero-Shot Reliable Sleep Staging With Neural Networks Based on Ballistocardiograms

Add code
Oct 30, 2024
Viaarxiv icon

Multi-UAV Behavior-based Formation with Static and Dynamic Obstacles Avoidance via Reinforcement Learning

Add code
Oct 24, 2024
Figure 1 for Multi-UAV Behavior-based Formation with Static and Dynamic Obstacles Avoidance via Reinforcement Learning
Figure 2 for Multi-UAV Behavior-based Formation with Static and Dynamic Obstacles Avoidance via Reinforcement Learning
Figure 3 for Multi-UAV Behavior-based Formation with Static and Dynamic Obstacles Avoidance via Reinforcement Learning
Figure 4 for Multi-UAV Behavior-based Formation with Static and Dynamic Obstacles Avoidance via Reinforcement Learning
Viaarxiv icon

Few-shot In-Context Preference Learning Using Large Language Models

Add code
Oct 22, 2024
Figure 1 for Few-shot In-Context Preference Learning Using Large Language Models
Figure 2 for Few-shot In-Context Preference Learning Using Large Language Models
Figure 3 for Few-shot In-Context Preference Learning Using Large Language Models
Figure 4 for Few-shot In-Context Preference Learning Using Large Language Models
Viaarxiv icon

Diverse Policies Recovering via Pointwise Mutual Information Weighted Imitation Learning

Add code
Oct 21, 2024
Figure 1 for Diverse Policies Recovering via Pointwise Mutual Information Weighted Imitation Learning
Figure 2 for Diverse Policies Recovering via Pointwise Mutual Information Weighted Imitation Learning
Figure 3 for Diverse Policies Recovering via Pointwise Mutual Information Weighted Imitation Learning
Figure 4 for Diverse Policies Recovering via Pointwise Mutual Information Weighted Imitation Learning
Viaarxiv icon