Picture for Anssi Kanervisto

Anssi Kanervisto

Diffusion for World Modeling: Visual Details Matter in Atari

Add code
May 20, 2024
Figure 1 for Diffusion for World Modeling: Visual Details Matter in Atari
Figure 2 for Diffusion for World Modeling: Visual Details Matter in Atari
Figure 3 for Diffusion for World Modeling: Visual Details Matter in Atari
Figure 4 for Diffusion for World Modeling: Visual Details Matter in Atari
Viaarxiv icon

Toward Human-AI Alignment in Large-Scale Multi-Player Games

Add code
Feb 05, 2024
Viaarxiv icon

BEDD: The MineRL BASALT Evaluation and Demonstrations Dataset for Training and Benchmarking Agents that Solve Fuzzy Tasks

Add code
Dec 05, 2023
Figure 1 for BEDD: The MineRL BASALT Evaluation and Demonstrations Dataset for Training and Benchmarking Agents that Solve Fuzzy Tasks
Figure 2 for BEDD: The MineRL BASALT Evaluation and Demonstrations Dataset for Training and Benchmarking Agents that Solve Fuzzy Tasks
Figure 3 for BEDD: The MineRL BASALT Evaluation and Demonstrations Dataset for Training and Benchmarking Agents that Solve Fuzzy Tasks
Figure 4 for BEDD: The MineRL BASALT Evaluation and Demonstrations Dataset for Training and Benchmarking Agents that Solve Fuzzy Tasks
Viaarxiv icon

Visual Encoders for Data-Efficient Imitation Learning in Modern Video Games

Add code
Dec 04, 2023
Viaarxiv icon

Towards Solving Fuzzy Tasks with Human Feedback: A Retrospective of the MineRL BASALT 2022 Competition

Add code
Mar 23, 2023
Figure 1 for Towards Solving Fuzzy Tasks with Human Feedback: A Retrospective of the MineRL BASALT 2022 Competition
Figure 2 for Towards Solving Fuzzy Tasks with Human Feedback: A Retrospective of the MineRL BASALT 2022 Competition
Figure 3 for Towards Solving Fuzzy Tasks with Human Feedback: A Retrospective of the MineRL BASALT 2022 Competition
Figure 4 for Towards Solving Fuzzy Tasks with Human Feedback: A Retrospective of the MineRL BASALT 2022 Competition
Viaarxiv icon

Imitating Human Behaviour with Diffusion Models

Add code
Jan 25, 2023
Viaarxiv icon

A2C is a special case of PPO

Add code
May 18, 2022
Figure 1 for A2C is a special case of PPO
Viaarxiv icon

GAN-Aimbots: Using Machine Learning for Cheating in First Person Shooters

Add code
May 14, 2022
Figure 1 for GAN-Aimbots: Using Machine Learning for Cheating in First Person Shooters
Figure 2 for GAN-Aimbots: Using Machine Learning for Cheating in First Person Shooters
Figure 3 for GAN-Aimbots: Using Machine Learning for Cheating in First Person Shooters
Figure 4 for GAN-Aimbots: Using Machine Learning for Cheating in First Person Shooters
Viaarxiv icon

Retrospective on the 2021 BASALT Competition on Learning from Human Feedback

Add code
Apr 14, 2022
Figure 1 for Retrospective on the 2021 BASALT Competition on Learning from Human Feedback
Figure 2 for Retrospective on the 2021 BASALT Competition on Learning from Human Feedback
Figure 3 for Retrospective on the 2021 BASALT Competition on Learning from Human Feedback
Figure 4 for Retrospective on the 2021 BASALT Competition on Learning from Human Feedback
Viaarxiv icon

Insights From the NeurIPS 2021 NetHack Challenge

Add code
Mar 22, 2022
Figure 1 for Insights From the NeurIPS 2021 NetHack Challenge
Figure 2 for Insights From the NeurIPS 2021 NetHack Challenge
Figure 3 for Insights From the NeurIPS 2021 NetHack Challenge
Figure 4 for Insights From the NeurIPS 2021 NetHack Challenge
Viaarxiv icon