Picture for Artem Zholus

Artem Zholus

TRecViT: A Recurrent Video Transformer

Add code
Dec 18, 2024
Viaarxiv icon

Unraveling the Complexity of Memory in RL Agents: an Approach for Classification and Evaluation

Add code
Dec 09, 2024
Figure 1 for Unraveling the Complexity of Memory in RL Agents: an Approach for Classification and Evaluation
Figure 2 for Unraveling the Complexity of Memory in RL Agents: an Approach for Classification and Evaluation
Figure 3 for Unraveling the Complexity of Memory in RL Agents: an Approach for Classification and Evaluation
Figure 4 for Unraveling the Complexity of Memory in RL Agents: an Approach for Classification and Evaluation
Viaarxiv icon

Interpretability in Action: Exploratory Analysis of VPT, a Minecraft Agent

Add code
Jul 16, 2024
Viaarxiv icon

IDAT: A Multi-Modal Dataset and Toolkit for Building and Evaluating Interactive Task-Solving Agents

Add code
Jul 12, 2024
Figure 1 for IDAT: A Multi-Modal Dataset and Toolkit for Building and Evaluating Interactive Task-Solving Agents
Figure 2 for IDAT: A Multi-Modal Dataset and Toolkit for Building and Evaluating Interactive Task-Solving Agents
Figure 3 for IDAT: A Multi-Modal Dataset and Toolkit for Building and Evaluating Interactive Task-Solving Agents
Figure 4 for IDAT: A Multi-Modal Dataset and Toolkit for Building and Evaluating Interactive Task-Solving Agents
Viaarxiv icon

BindGPT: A Scalable Framework for 3D Molecular Design via Language Modeling and Reinforcement Learning

Add code
Jun 06, 2024
Figure 1 for BindGPT: A Scalable Framework for 3D Molecular Design via Language Modeling and Reinforcement Learning
Figure 2 for BindGPT: A Scalable Framework for 3D Molecular Design via Language Modeling and Reinforcement Learning
Figure 3 for BindGPT: A Scalable Framework for 3D Molecular Design via Language Modeling and Reinforcement Learning
Figure 4 for BindGPT: A Scalable Framework for 3D Molecular Design via Language Modeling and Reinforcement Learning
Viaarxiv icon

Mastering Memory Tasks with World Models

Add code
Mar 07, 2024
Viaarxiv icon

Transforming Human-Centered AI Collaboration: Redefining Embodied Agents Capabilities through Interactive Grounded Language Instructions

Add code
May 18, 2023
Figure 1 for Transforming Human-Centered AI Collaboration: Redefining Embodied Agents Capabilities through Interactive Grounded Language Instructions
Figure 2 for Transforming Human-Centered AI Collaboration: Redefining Embodied Agents Capabilities through Interactive Grounded Language Instructions
Figure 3 for Transforming Human-Centered AI Collaboration: Redefining Embodied Agents Capabilities through Interactive Grounded Language Instructions
Figure 4 for Transforming Human-Centered AI Collaboration: Redefining Embodied Agents Capabilities through Interactive Grounded Language Instructions
Viaarxiv icon

Collecting Interactive Multi-modal Datasets for Grounded Language Understanding

Add code
Nov 18, 2022
Figure 1 for Collecting Interactive Multi-modal Datasets for Grounded Language Understanding
Figure 2 for Collecting Interactive Multi-modal Datasets for Grounded Language Understanding
Figure 3 for Collecting Interactive Multi-modal Datasets for Grounded Language Understanding
Viaarxiv icon

Learning to Solve Voxel Building Embodied Tasks from Pixels and Natural Language Instructions

Add code
Nov 01, 2022
Viaarxiv icon

IGLU Gridworld: Simple and Fast Environment for Embodied Dialog Agents

Add code
May 31, 2022
Figure 1 for IGLU Gridworld: Simple and Fast Environment for Embodied Dialog Agents
Figure 2 for IGLU Gridworld: Simple and Fast Environment for Embodied Dialog Agents
Figure 3 for IGLU Gridworld: Simple and Fast Environment for Embodied Dialog Agents
Viaarxiv icon