Picture for Tian Xu

Tian Xu

Provably and Practically Efficient Adversarial Imitation Learning with General Function Approximation

Add code
Nov 01, 2024
Viaarxiv icon

Collaborative motion planning for multi-manipulator systems through Reinforcement Learning and Dynamic Movement Primitives

Add code
Oct 01, 2024
Viaarxiv icon

Entropic Distribution Matching in Supervised Fine-tuning of LLMs: Less Overfitting and Better Diversity

Add code
Aug 29, 2024
Viaarxiv icon

AI-driven platform for systematic nomenclature and intelligent knowledge acquisition of natural medicinal materials

Add code
Dec 27, 2023
Viaarxiv icon

Policy Optimization in RLHF: The Impact of Out-of-preference Data

Add code
Dec 17, 2023
Viaarxiv icon

ReMax: A Simple, Effective, and Efficient Reinforcement Learning Method for Aligning Large Language Models

Add code
Oct 17, 2023
Viaarxiv icon

Reward-Consistent Dynamics Models are Strongly Generalizable for Offline Reinforcement Learning

Add code
Oct 09, 2023
Viaarxiv icon

Provably Efficient Adversarial Imitation Learning with Unknown Transitions

Add code
Jun 11, 2023
Viaarxiv icon

Theoretical Analysis of Offline Imitation With Supplementary Dataset

Add code
Jan 27, 2023
Viaarxiv icon

Understanding Adversarial Imitation Learning in Small Sample Regime: A Stage-coupled Analysis

Add code
Aug 03, 2022
Figure 1 for Understanding Adversarial Imitation Learning in Small Sample Regime: A Stage-coupled Analysis
Figure 2 for Understanding Adversarial Imitation Learning in Small Sample Regime: A Stage-coupled Analysis
Figure 3 for Understanding Adversarial Imitation Learning in Small Sample Regime: A Stage-coupled Analysis
Figure 4 for Understanding Adversarial Imitation Learning in Small Sample Regime: A Stage-coupled Analysis
Viaarxiv icon