Picture for Edoardo Cetin

Edoardo Cetin

Large Language Models to Diffusion Finetuning

Add code
Jan 27, 2025
Viaarxiv icon

$\text{Transformer}^2$: Self-adaptive LLMs

Add code
Jan 14, 2025
Viaarxiv icon

Finer Behavioral Foundation Models via Auto-Regressive Features and Advantage Weighting

Add code
Dec 05, 2024
Figure 1 for Finer Behavioral Foundation Models via Auto-Regressive Features and Advantage Weighting
Figure 2 for Finer Behavioral Foundation Models via Auto-Regressive Features and Advantage Weighting
Figure 3 for Finer Behavioral Foundation Models via Auto-Regressive Features and Advantage Weighting
Figure 4 for Finer Behavioral Foundation Models via Auto-Regressive Features and Advantage Weighting
Viaarxiv icon

An Evolved Universal Transformer Memory

Add code
Oct 17, 2024
Viaarxiv icon

Simple Ingredients for Offline Reinforcement Learning

Add code
Mar 19, 2024
Figure 1 for Simple Ingredients for Offline Reinforcement Learning
Figure 2 for Simple Ingredients for Offline Reinforcement Learning
Figure 3 for Simple Ingredients for Offline Reinforcement Learning
Figure 4 for Simple Ingredients for Offline Reinforcement Learning
Viaarxiv icon

Policy Gradient With Serial Markov Chain Reasoning

Add code
Oct 13, 2022
Figure 1 for Policy Gradient With Serial Markov Chain Reasoning
Figure 2 for Policy Gradient With Serial Markov Chain Reasoning
Figure 3 for Policy Gradient With Serial Markov Chain Reasoning
Figure 4 for Policy Gradient With Serial Markov Chain Reasoning
Viaarxiv icon

Hyperbolic Deep Reinforcement Learning

Add code
Oct 04, 2022
Figure 1 for Hyperbolic Deep Reinforcement Learning
Figure 2 for Hyperbolic Deep Reinforcement Learning
Figure 3 for Hyperbolic Deep Reinforcement Learning
Figure 4 for Hyperbolic Deep Reinforcement Learning
Viaarxiv icon

Stabilizing Off-Policy Deep Reinforcement Learning from Pixels

Add code
Jul 03, 2022
Figure 1 for Stabilizing Off-Policy Deep Reinforcement Learning from Pixels
Figure 2 for Stabilizing Off-Policy Deep Reinforcement Learning from Pixels
Figure 3 for Stabilizing Off-Policy Deep Reinforcement Learning from Pixels
Figure 4 for Stabilizing Off-Policy Deep Reinforcement Learning from Pixels
Viaarxiv icon

Learning Pessimism for Robust and Efficient Off-Policy Reinforcement Learning

Add code
Oct 07, 2021
Figure 1 for Learning Pessimism for Robust and Efficient Off-Policy Reinforcement Learning
Figure 2 for Learning Pessimism for Robust and Efficient Off-Policy Reinforcement Learning
Figure 3 for Learning Pessimism for Robust and Efficient Off-Policy Reinforcement Learning
Figure 4 for Learning Pessimism for Robust and Efficient Off-Policy Reinforcement Learning
Viaarxiv icon

Learning Routines for Effective Off-Policy Reinforcement Learning

Add code
Jun 05, 2021
Figure 1 for Learning Routines for Effective Off-Policy Reinforcement Learning
Figure 2 for Learning Routines for Effective Off-Policy Reinforcement Learning
Figure 3 for Learning Routines for Effective Off-Policy Reinforcement Learning
Figure 4 for Learning Routines for Effective Off-Policy Reinforcement Learning
Viaarxiv icon