Imitation Learning


Imitation learning is a framework for learning a behavior policy from demonstrations. Usually, demonstrations are presented in the form of state-action trajectories, with each pair indicating the action to take at the state being visited. In order to learn the behavior policy, the demonstrated actions are usually utilized in two ways. The first, known as Behavior Cloning (BC), treats the action as the target label for each state, and then learns a generalized mapping from states to actions in a supervised manner. Another way, known as Inverse Reinforcement Learning (IRL), views the demonstrated actions as a sequence of decisions, and aims at finding a reward/cost function under which the demonstrated decisions are optimal.

From Imitation to Discrimination: Progressive Curriculum Learning for Robust Web Navigation

Add code
Apr 14, 2026
Viaarxiv icon

WM-DAgger: Enabling Efficient Data Aggregation for Imitation Learning with World Models

Add code
Apr 13, 2026
Viaarxiv icon

Active Imitation Learning for Thermal- and Kernel-Aware LFM Inference on 3D S-NUCA Many-Cores

Add code
Apr 13, 2026
Viaarxiv icon

AffordSim: A Scalable Data Generator and Benchmark for Affordance-Aware Robotic Manipulation

Add code
Apr 13, 2026
Viaarxiv icon

ScoRe-Flow: Complete Distributional Control via Score-Based Reinforcement Learning for Flow Matching

Add code
Apr 13, 2026
Viaarxiv icon

LIDEA: Human-to-Robot Imitation Learning via Implicit Feature Distillation and Explicit Geometry Alignment

Add code
Apr 12, 2026
Viaarxiv icon

AffordGen: Generating Diverse Demonstrations for Generalizable Object Manipulation with Afford Correspondence

Add code
Apr 12, 2026
Viaarxiv icon

MimicLM: Zero-Shot Voice Imitation through Autoregressive Modeling of Pseudo-Parallel Speech Corpora

Add code
Apr 13, 2026
Viaarxiv icon

MoRI: Mixture of RL and IL Experts for Long-Horizon Manipulation Tasks

Add code
Apr 11, 2026
Viaarxiv icon

Beyond Compliance: A Resistance-Informed Motivation Reasoning Framework for Challenging Psychological Client Simulation

Add code
Apr 12, 2026
Viaarxiv icon