Picture for Li-Cheng Lan

Li-Cheng Lan

Expert Proximity as Surrogate Rewards for Single Demonstration Imitation Learning

Add code
Feb 01, 2024
Figure 1 for Expert Proximity as Surrogate Rewards for Single Demonstration Imitation Learning
Figure 2 for Expert Proximity as Surrogate Rewards for Single Demonstration Imitation Learning
Figure 3 for Expert Proximity as Surrogate Rewards for Single Demonstration Imitation Learning
Figure 4 for Expert Proximity as Surrogate Rewards for Single Demonstration Imitation Learning
Viaarxiv icon

Can Agents Run Relay Race with Strangers? Generalization of RL to Out-of-Distribution Trajectories

Add code
Apr 26, 2023
Viaarxiv icon

Are AlphaZero-like Agents Robust to Adversarial Perturbations?

Add code
Nov 07, 2022
Viaarxiv icon

Learning to Stop: Dynamic Simulation Monte-Carlo Tree Search

Add code
Dec 14, 2020
Figure 1 for Learning to Stop: Dynamic Simulation Monte-Carlo Tree Search
Figure 2 for Learning to Stop: Dynamic Simulation Monte-Carlo Tree Search
Figure 3 for Learning to Stop: Dynamic Simulation Monte-Carlo Tree Search
Figure 4 for Learning to Stop: Dynamic Simulation Monte-Carlo Tree Search
Viaarxiv icon

How much progress have we made in neural network training? A New Evaluation Protocol for Benchmarking Optimizers

Add code
Oct 19, 2020
Figure 1 for How much progress have we made in neural network training? A New Evaluation Protocol for Benchmarking Optimizers
Figure 2 for How much progress have we made in neural network training? A New Evaluation Protocol for Benchmarking Optimizers
Figure 3 for How much progress have we made in neural network training? A New Evaluation Protocol for Benchmarking Optimizers
Figure 4 for How much progress have we made in neural network training? A New Evaluation Protocol for Benchmarking Optimizers
Viaarxiv icon

Multiple Policy Value Monte Carlo Tree Search

Add code
May 31, 2019
Figure 1 for Multiple Policy Value Monte Carlo Tree Search
Figure 2 for Multiple Policy Value Monte Carlo Tree Search
Figure 3 for Multiple Policy Value Monte Carlo Tree Search
Figure 4 for Multiple Policy Value Monte Carlo Tree Search
Viaarxiv icon

Multi-Labelled Value Networks for Computer Go

Add code
May 30, 2017
Figure 1 for Multi-Labelled Value Networks for Computer Go
Figure 2 for Multi-Labelled Value Networks for Computer Go
Figure 3 for Multi-Labelled Value Networks for Computer Go
Figure 4 for Multi-Labelled Value Networks for Computer Go
Viaarxiv icon