Picture for Edoardo Cetin

Edoardo Cetin

Large Language Models to Diffusion Finetuning

Add code
Jan 27, 2025
Viaarxiv icon

$\text{Transformer}^2$: Self-adaptive LLMs

Add code
Jan 14, 2025
Viaarxiv icon

Finer Behavioral Foundation Models via Auto-Regressive Features and Advantage Weighting

Add code
Dec 05, 2024
Figure 1 for Finer Behavioral Foundation Models via Auto-Regressive Features and Advantage Weighting
Figure 2 for Finer Behavioral Foundation Models via Auto-Regressive Features and Advantage Weighting
Figure 3 for Finer Behavioral Foundation Models via Auto-Regressive Features and Advantage Weighting
Figure 4 for Finer Behavioral Foundation Models via Auto-Regressive Features and Advantage Weighting
Viaarxiv icon

An Evolved Universal Transformer Memory

Add code
Oct 17, 2024
Figure 1 for An Evolved Universal Transformer Memory
Figure 2 for An Evolved Universal Transformer Memory
Figure 3 for An Evolved Universal Transformer Memory
Figure 4 for An Evolved Universal Transformer Memory
Viaarxiv icon

Simple Ingredients for Offline Reinforcement Learning

Add code
Mar 19, 2024
Figure 1 for Simple Ingredients for Offline Reinforcement Learning
Figure 2 for Simple Ingredients for Offline Reinforcement Learning
Figure 3 for Simple Ingredients for Offline Reinforcement Learning
Figure 4 for Simple Ingredients for Offline Reinforcement Learning
Viaarxiv icon

Policy Gradient With Serial Markov Chain Reasoning

Add code
Oct 13, 2022
Figure 1 for Policy Gradient With Serial Markov Chain Reasoning
Figure 2 for Policy Gradient With Serial Markov Chain Reasoning
Figure 3 for Policy Gradient With Serial Markov Chain Reasoning
Figure 4 for Policy Gradient With Serial Markov Chain Reasoning
Viaarxiv icon

Hyperbolic Deep Reinforcement Learning

Add code
Oct 04, 2022
Figure 1 for Hyperbolic Deep Reinforcement Learning
Figure 2 for Hyperbolic Deep Reinforcement Learning
Figure 3 for Hyperbolic Deep Reinforcement Learning
Figure 4 for Hyperbolic Deep Reinforcement Learning
Viaarxiv icon

Stabilizing Off-Policy Deep Reinforcement Learning from Pixels

Add code
Jul 03, 2022
Figure 1 for Stabilizing Off-Policy Deep Reinforcement Learning from Pixels
Figure 2 for Stabilizing Off-Policy Deep Reinforcement Learning from Pixels
Figure 3 for Stabilizing Off-Policy Deep Reinforcement Learning from Pixels
Figure 4 for Stabilizing Off-Policy Deep Reinforcement Learning from Pixels
Viaarxiv icon

Learning Pessimism for Robust and Efficient Off-Policy Reinforcement Learning

Add code
Oct 07, 2021
Figure 1 for Learning Pessimism for Robust and Efficient Off-Policy Reinforcement Learning
Figure 2 for Learning Pessimism for Robust and Efficient Off-Policy Reinforcement Learning
Figure 3 for Learning Pessimism for Robust and Efficient Off-Policy Reinforcement Learning
Figure 4 for Learning Pessimism for Robust and Efficient Off-Policy Reinforcement Learning
Viaarxiv icon

Learning Routines for Effective Off-Policy Reinforcement Learning

Add code
Jun 05, 2021
Figure 1 for Learning Routines for Effective Off-Policy Reinforcement Learning
Figure 2 for Learning Routines for Effective Off-Policy Reinforcement Learning
Figure 3 for Learning Routines for Effective Off-Policy Reinforcement Learning
Figure 4 for Learning Routines for Effective Off-Policy Reinforcement Learning
Viaarxiv icon