Picture for Alexander Novikov

Alexander Novikov

Amplifying human performance in combinatorial competitive programming

Add code
Nov 29, 2024
Viaarxiv icon

Quantum Circuit Optimization with AlphaTensor

Add code
Mar 05, 2024
Viaarxiv icon

A Generalist Agent

Add code
May 19, 2022
Figure 1 for A Generalist Agent
Figure 2 for A Generalist Agent
Figure 3 for A Generalist Agent
Figure 4 for A Generalist Agent
Viaarxiv icon

Benchmarks for Deep Off-Policy Evaluation

Add code
Mar 30, 2021
Figure 1 for Benchmarks for Deep Off-Policy Evaluation
Figure 2 for Benchmarks for Deep Off-Policy Evaluation
Figure 3 for Benchmarks for Deep Off-Policy Evaluation
Figure 4 for Benchmarks for Deep Off-Policy Evaluation
Viaarxiv icon

Automatic differentiation for Riemannian optimization on low-rank matrix and tensor-train manifolds

Add code
Mar 27, 2021
Figure 1 for Automatic differentiation for Riemannian optimization on low-rank matrix and tensor-train manifolds
Figure 2 for Automatic differentiation for Riemannian optimization on low-rank matrix and tensor-train manifolds
Figure 3 for Automatic differentiation for Riemannian optimization on low-rank matrix and tensor-train manifolds
Figure 4 for Automatic differentiation for Riemannian optimization on low-rank matrix and tensor-train manifolds
Viaarxiv icon

Semi-supervised reward learning for offline reinforcement learning

Add code
Dec 12, 2020
Figure 1 for Semi-supervised reward learning for offline reinforcement learning
Figure 2 for Semi-supervised reward learning for offline reinforcement learning
Figure 3 for Semi-supervised reward learning for offline reinforcement learning
Figure 4 for Semi-supervised reward learning for offline reinforcement learning
Viaarxiv icon

Offline Learning from Demonstrations and Unlabeled Experience

Add code
Nov 27, 2020
Figure 1 for Offline Learning from Demonstrations and Unlabeled Experience
Figure 2 for Offline Learning from Demonstrations and Unlabeled Experience
Figure 3 for Offline Learning from Demonstrations and Unlabeled Experience
Figure 4 for Offline Learning from Demonstrations and Unlabeled Experience
Viaarxiv icon

Hyperparameter Selection for Offline Reinforcement Learning

Add code
Jul 17, 2020
Figure 1 for Hyperparameter Selection for Offline Reinforcement Learning
Figure 2 for Hyperparameter Selection for Offline Reinforcement Learning
Figure 3 for Hyperparameter Selection for Offline Reinforcement Learning
Figure 4 for Hyperparameter Selection for Offline Reinforcement Learning
Viaarxiv icon

RL Unplugged: Benchmarks for Offline Reinforcement Learning

Add code
Jul 02, 2020
Figure 1 for RL Unplugged: Benchmarks for Offline Reinforcement Learning
Figure 2 for RL Unplugged: Benchmarks for Offline Reinforcement Learning
Figure 3 for RL Unplugged: Benchmarks for Offline Reinforcement Learning
Figure 4 for RL Unplugged: Benchmarks for Offline Reinforcement Learning
Viaarxiv icon

Critic Regularized Regression

Add code
Jun 26, 2020
Figure 1 for Critic Regularized Regression
Figure 2 for Critic Regularized Regression
Figure 3 for Critic Regularized Regression
Figure 4 for Critic Regularized Regression
Viaarxiv icon