Picture for Li-Cheng Lan

Li-Cheng Lan

Expert Proximity as Surrogate Rewards for Single Demonstration Imitation Learning

Add code
Feb 01, 2024
Viaarxiv icon

Can Agents Run Relay Race with Strangers? Generalization of RL to Out-of-Distribution Trajectories

Add code
Apr 26, 2023
Viaarxiv icon

Are AlphaZero-like Agents Robust to Adversarial Perturbations?

Add code
Nov 07, 2022
Viaarxiv icon

Learning to Stop: Dynamic Simulation Monte-Carlo Tree Search

Add code
Dec 14, 2020
Figure 1 for Learning to Stop: Dynamic Simulation Monte-Carlo Tree Search
Figure 2 for Learning to Stop: Dynamic Simulation Monte-Carlo Tree Search
Figure 3 for Learning to Stop: Dynamic Simulation Monte-Carlo Tree Search
Figure 4 for Learning to Stop: Dynamic Simulation Monte-Carlo Tree Search
Viaarxiv icon

How much progress have we made in neural network training? A New Evaluation Protocol for Benchmarking Optimizers

Add code
Oct 19, 2020
Figure 1 for How much progress have we made in neural network training? A New Evaluation Protocol for Benchmarking Optimizers
Figure 2 for How much progress have we made in neural network training? A New Evaluation Protocol for Benchmarking Optimizers
Figure 3 for How much progress have we made in neural network training? A New Evaluation Protocol for Benchmarking Optimizers
Figure 4 for How much progress have we made in neural network training? A New Evaluation Protocol for Benchmarking Optimizers
Viaarxiv icon

Multiple Policy Value Monte Carlo Tree Search

Add code
May 31, 2019
Figure 1 for Multiple Policy Value Monte Carlo Tree Search
Figure 2 for Multiple Policy Value Monte Carlo Tree Search
Figure 3 for Multiple Policy Value Monte Carlo Tree Search
Figure 4 for Multiple Policy Value Monte Carlo Tree Search
Viaarxiv icon

Multi-Labelled Value Networks for Computer Go

Add code
May 30, 2017
Figure 1 for Multi-Labelled Value Networks for Computer Go
Figure 2 for Multi-Labelled Value Networks for Computer Go
Figure 3 for Multi-Labelled Value Networks for Computer Go
Figure 4 for Multi-Labelled Value Networks for Computer Go
Viaarxiv icon