Picture for András György

András György

Non-Stationary Learning of Neural Networks with Automatic Soft Parameter Reset

Add code
Nov 06, 2024
Viaarxiv icon

Toward Understanding In-context vs. In-weight Learning

Add code
Oct 30, 2024
Viaarxiv icon

Learning Continually by Spectral Regularization

Add code
Jun 10, 2024
Viaarxiv icon

Prior-Dependent Allocations for Bayesian Fixed-Budget Best-Arm Identification in Structured Bandits

Add code
Feb 08, 2024
Viaarxiv icon

Online RL in Linearly $q^π$-Realizable MDPs Is as Easy as in Linear MDPs If You Learn What to Ignore

Add code
Oct 11, 2023
Viaarxiv icon

Optimistic Natural Policy Gradient: a Simple Efficient Policy Optimization Framework for Online RL

Add code
May 18, 2023
Figure 1 for Optimistic Natural Policy Gradient: a Simple Efficient Policy Optimization Framework for Online RL
Viaarxiv icon

A Second-Order Method for Stochastic Bandit Convex Optimisation

Add code
Feb 10, 2023
Viaarxiv icon

Optimistic Meta-Gradients

Add code
Jan 09, 2023
Viaarxiv icon

Generalization Bounds for Transfer Learning with Pretrained Classifiers

Add code
Dec 23, 2022
Viaarxiv icon

Understanding Self-Predictive Learning for Reinforcement Learning

Add code
Dec 06, 2022
Viaarxiv icon