Picture for Andrei Polubarov

Andrei Polubarov

Yes, Q-learning Helps Offline In-Context RL

Add code
Feb 24, 2025
Viaarxiv icon

N-Gram Induction Heads for In-Context RL: Improving Stability and Reducing Data Needs

Add code
Nov 04, 2024
Viaarxiv icon