Picture for Chengxing Jia

Chengxing Jia

Q-Adapter: Training Your LLM Adapter as a Residual Q-Function

Add code
Jul 04, 2024
Viaarxiv icon

Any-step Dynamics Model Improves Future Predictions for Online and Offline Reinforcement Learning

Add code
May 27, 2024
Viaarxiv icon

BWArea Model: Learning World Model, Inverse Dynamics, and Policy for Controllable Language Generation

Add code
May 27, 2024
Viaarxiv icon

Disentangling Policy from Offline Task Representation Learning via Adversarial Data Augmentation

Add code
Mar 12, 2024
Figure 1 for Disentangling Policy from Offline Task Representation Learning via Adversarial Data Augmentation
Figure 2 for Disentangling Policy from Offline Task Representation Learning via Adversarial Data Augmentation
Figure 3 for Disentangling Policy from Offline Task Representation Learning via Adversarial Data Augmentation
Figure 4 for Disentangling Policy from Offline Task Representation Learning via Adversarial Data Augmentation
Viaarxiv icon

Debiased Offline Representation Learning for Fast Online Adaptation in Non-stationary Dynamics

Add code
Feb 17, 2024
Figure 1 for Debiased Offline Representation Learning for Fast Online Adaptation in Non-stationary Dynamics
Figure 2 for Debiased Offline Representation Learning for Fast Online Adaptation in Non-stationary Dynamics
Figure 3 for Debiased Offline Representation Learning for Fast Online Adaptation in Non-stationary Dynamics
Figure 4 for Debiased Offline Representation Learning for Fast Online Adaptation in Non-stationary Dynamics
Viaarxiv icon

Empowering Language Models with Active Inquiry for Deeper Understanding

Add code
Feb 06, 2024
Viaarxiv icon

Model Generation with Provable Coverability for Offline Reinforcement Learning

Add code
Jun 08, 2022
Figure 1 for Model Generation with Provable Coverability for Offline Reinforcement Learning
Figure 2 for Model Generation with Provable Coverability for Offline Reinforcement Learning
Figure 3 for Model Generation with Provable Coverability for Offline Reinforcement Learning
Figure 4 for Model Generation with Provable Coverability for Offline Reinforcement Learning
Viaarxiv icon