Picture for Xinqi Wang

Xinqi Wang

Multi-Agent Reinforcement Learning from Human Feedback: Data Coverage and Algorithmic Techniques

Add code
Sep 04, 2024
Viaarxiv icon

USV-AUV Collaboration Framework for Underwater Tasks under Extreme Sea Conditions

Add code
Sep 04, 2024
Viaarxiv icon

CSANet: Channel Spatial Attention Network for Robust 3D Face Alignment and Reconstruction

Add code
May 30, 2024
Viaarxiv icon

Transferable Reinforcement Learning via Generalized Occupancy Models

Add code
Mar 10, 2024
Viaarxiv icon

Tree-of-Mixed-Thought: Combining Fast and Slow Thinking for Multi-hop Visual Reasoning

Add code
Aug 21, 2023
Viaarxiv icon

On Gap-dependent Bounds for Offline Reinforcement Learning

Add code
Jun 01, 2022
Figure 1 for On Gap-dependent Bounds for Offline Reinforcement Learning
Figure 2 for On Gap-dependent Bounds for Offline Reinforcement Learning
Viaarxiv icon