Picture for Denis Tarasov

Denis Tarasov

Yes, Q-learning Helps Offline In-Context RL

Add code
Feb 24, 2025
Viaarxiv icon

Vintix: Action Model via In-Context Reinforcement Learning

Add code
Jan 31, 2025
Viaarxiv icon

The Role of Deep Learning Regularizations on Actors in Offline RL

Add code
Sep 11, 2024
Viaarxiv icon

Is Value Functions Estimation with Classification Plug-and-play for Offline Reinforcement Learning?

Add code
Jun 10, 2024
Viaarxiv icon

Distilling LLMs' Decomposition Abilities into Compact Language Models

Add code
Feb 02, 2024
Viaarxiv icon

Katakomba: Tools and Benchmarks for Data-Driven NetHack

Add code
Jun 14, 2023
Viaarxiv icon

Revisiting the Minimalist Approach to Offline Reinforcement Learning

Add code
May 16, 2023
Viaarxiv icon

Anti-Exploration by Random Network Distillation

Add code
Jan 31, 2023
Viaarxiv icon

Q-Ensemble for Offline RL: Don't Scale the Ensemble, Scale the Batch Size

Add code
Nov 20, 2022
Viaarxiv icon

Let Offline RL Flow: Training Conservative Agents in the Latent Space of Normalizing Flows

Add code
Nov 20, 2022
Viaarxiv icon