Picture for Ishan Durugkar

Ishan Durugkar

N-Agent Ad Hoc Teamwork

Add code
Apr 16, 2024
Viaarxiv icon

$f$-Policy Gradients: A General Framework for Goal Conditioned RL using $f$-Divergences

Add code
Oct 10, 2023
Viaarxiv icon

ABC: Adversarial Behavioral Cloning for Offline Mode-Seeking Imitation Learning

Add code
Nov 08, 2022
Viaarxiv icon

DM$^2$: Distributed Multi-Agent Reinforcement Learning for Distribution Matching

Add code
Jun 01, 2022
Figure 1 for DM$^2$: Distributed Multi-Agent Reinforcement Learning for Distribution Matching
Figure 2 for DM$^2$: Distributed Multi-Agent Reinforcement Learning for Distribution Matching
Figure 3 for DM$^2$: Distributed Multi-Agent Reinforcement Learning for Distribution Matching
Figure 4 for DM$^2$: Distributed Multi-Agent Reinforcement Learning for Distribution Matching
Viaarxiv icon

Wasserstein Distance Maximizing Intrinsic Control

Add code
Oct 28, 2021
Figure 1 for Wasserstein Distance Maximizing Intrinsic Control
Figure 2 for Wasserstein Distance Maximizing Intrinsic Control
Figure 3 for Wasserstein Distance Maximizing Intrinsic Control
Viaarxiv icon

Adversarial Intrinsic Motivation for Reinforcement Learning

Add code
May 30, 2021
Figure 1 for Adversarial Intrinsic Motivation for Reinforcement Learning
Figure 2 for Adversarial Intrinsic Motivation for Reinforcement Learning
Figure 3 for Adversarial Intrinsic Motivation for Reinforcement Learning
Figure 4 for Adversarial Intrinsic Motivation for Reinforcement Learning
Viaarxiv icon

Reducing Sampling Error in Batch Temporal Difference Learning

Add code
Aug 15, 2020
Viaarxiv icon

An Imitation from Observation Approach to Sim-to-Real Transfer

Add code
Aug 04, 2020
Figure 1 for An Imitation from Observation Approach to Sim-to-Real Transfer
Figure 2 for An Imitation from Observation Approach to Sim-to-Real Transfer
Figure 3 for An Imitation from Observation Approach to Sim-to-Real Transfer
Figure 4 for An Imitation from Observation Approach to Sim-to-Real Transfer
Viaarxiv icon

Multi-Preference Actor Critic

Add code
Apr 05, 2019
Figure 1 for Multi-Preference Actor Critic
Figure 2 for Multi-Preference Actor Critic
Figure 3 for Multi-Preference Actor Critic
Figure 4 for Multi-Preference Actor Critic
Viaarxiv icon

Go for a Walk and Arrive at the Answer: Reasoning Over Paths in Knowledge Bases using Reinforcement Learning

Add code
Nov 15, 2017
Figure 1 for Go for a Walk and Arrive at the Answer: Reasoning Over Paths in Knowledge Bases using Reinforcement Learning
Figure 2 for Go for a Walk and Arrive at the Answer: Reasoning Over Paths in Knowledge Bases using Reinforcement Learning
Figure 3 for Go for a Walk and Arrive at the Answer: Reasoning Over Paths in Knowledge Bases using Reinforcement Learning
Figure 4 for Go for a Walk and Arrive at the Answer: Reasoning Over Paths in Knowledge Bases using Reinforcement Learning
Viaarxiv icon