Picture for Gokul Swamy

Gokul Swamy

A Smooth Sea Never Made a Skilled $\texttt{SAILOR}$: Robust Imitation via Learning to Search

Add code
Jun 05, 2025
Viaarxiv icon

Scaling Offline RL via Efficient and Expressive Shortcut Models

Add code
May 28, 2025
Figure 1 for Scaling Offline RL via Efficient and Expressive Shortcut Models
Figure 2 for Scaling Offline RL via Efficient and Expressive Shortcut Models
Figure 3 for Scaling Offline RL via Efficient and Expressive Shortcut Models
Figure 4 for Scaling Offline RL via Efficient and Expressive Shortcut Models
Viaarxiv icon

Efficient Imitation Under Misspecification

Add code
Mar 17, 2025
Viaarxiv icon

All Roads Lead to Likelihood: The Value of Reinforcement Learning in Fine-Tuning

Add code
Mar 03, 2025
Figure 1 for All Roads Lead to Likelihood: The Value of Reinforcement Learning in Fine-Tuning
Figure 2 for All Roads Lead to Likelihood: The Value of Reinforcement Learning in Fine-Tuning
Figure 3 for All Roads Lead to Likelihood: The Value of Reinforcement Learning in Fine-Tuning
Figure 4 for All Roads Lead to Likelihood: The Value of Reinforcement Learning in Fine-Tuning
Viaarxiv icon

From Foresight to Forethought: VLM-In-the-Loop Policy Steering via Latent Alignment

Add code
Feb 03, 2025
Viaarxiv icon

Your Learned Constraint is Secretly a Backward Reachable Tube

Add code
Jan 26, 2025
Viaarxiv icon

Diffusing States and Matching Scores: A New Framework for Imitation Learning

Add code
Oct 17, 2024
Viaarxiv icon

Regressing the Relative Future: Efficient Policy Optimization for Multi-turn RLHF

Add code
Oct 06, 2024
Figure 1 for Regressing the Relative Future: Efficient Policy Optimization for Multi-turn RLHF
Figure 2 for Regressing the Relative Future: Efficient Policy Optimization for Multi-turn RLHF
Figure 3 for Regressing the Relative Future: Efficient Policy Optimization for Multi-turn RLHF
Figure 4 for Regressing the Relative Future: Efficient Policy Optimization for Multi-turn RLHF
Viaarxiv icon

EvIL: Evolution Strategies for Generalisable Imitation Learning

Add code
Jun 15, 2024
Figure 1 for EvIL: Evolution Strategies for Generalisable Imitation Learning
Figure 2 for EvIL: Evolution Strategies for Generalisable Imitation Learning
Figure 3 for EvIL: Evolution Strategies for Generalisable Imitation Learning
Figure 4 for EvIL: Evolution Strategies for Generalisable Imitation Learning
Viaarxiv icon

Multi-Agent Imitation Learning: Value is Easy, Regret is Hard

Add code
Jun 06, 2024
Figure 1 for Multi-Agent Imitation Learning: Value is Easy, Regret is Hard
Figure 2 for Multi-Agent Imitation Learning: Value is Easy, Regret is Hard
Figure 3 for Multi-Agent Imitation Learning: Value is Easy, Regret is Hard
Figure 4 for Multi-Agent Imitation Learning: Value is Easy, Regret is Hard
Viaarxiv icon