Picture for Sarah Perrin

Sarah Perrin

Diversity-Rewarded CFG Distillation

Add code
Oct 08, 2024
Viaarxiv icon

Gemma 2: Improving Open Language Models at a Practical Size

Add code
Aug 02, 2024
Figure 1 for Gemma 2: Improving Open Language Models at a Practical Size
Figure 2 for Gemma 2: Improving Open Language Models at a Practical Size
Figure 3 for Gemma 2: Improving Open Language Models at a Practical Size
Figure 4 for Gemma 2: Improving Open Language Models at a Practical Size
Viaarxiv icon

BOND: Aligning LLMs with Best-of-N Distillation

Add code
Jul 19, 2024
Figure 1 for BOND: Aligning LLMs with Best-of-N Distillation
Figure 2 for BOND: Aligning LLMs with Best-of-N Distillation
Figure 3 for BOND: Aligning LLMs with Best-of-N Distillation
Figure 4 for BOND: Aligning LLMs with Best-of-N Distillation
Viaarxiv icon

Learning Correlated Equilibria in Mean-Field Games

Add code
Aug 22, 2022
Figure 1 for Learning Correlated Equilibria in Mean-Field Games
Figure 2 for Learning Correlated Equilibria in Mean-Field Games
Figure 3 for Learning Correlated Equilibria in Mean-Field Games
Figure 4 for Learning Correlated Equilibria in Mean-Field Games
Viaarxiv icon

Learning Mean Field Games: A Survey

Add code
May 25, 2022
Figure 1 for Learning Mean Field Games: A Survey
Figure 2 for Learning Mean Field Games: A Survey
Figure 3 for Learning Mean Field Games: A Survey
Figure 4 for Learning Mean Field Games: A Survey
Viaarxiv icon

Scalable Deep Reinforcement Learning Algorithms for Mean Field Games

Add code
Mar 22, 2022
Figure 1 for Scalable Deep Reinforcement Learning Algorithms for Mean Field Games
Figure 2 for Scalable Deep Reinforcement Learning Algorithms for Mean Field Games
Figure 3 for Scalable Deep Reinforcement Learning Algorithms for Mean Field Games
Figure 4 for Scalable Deep Reinforcement Learning Algorithms for Mean Field Games
Viaarxiv icon

Generalization in Mean Field Games by Learning Master Policies

Add code
Sep 20, 2021
Figure 1 for Generalization in Mean Field Games by Learning Master Policies
Figure 2 for Generalization in Mean Field Games by Learning Master Policies
Figure 3 for Generalization in Mean Field Games by Learning Master Policies
Figure 4 for Generalization in Mean Field Games by Learning Master Policies
Viaarxiv icon

Concave Utility Reinforcement Learning: the Mean-field Game viewpoint

Add code
Jun 09, 2021
Figure 1 for Concave Utility Reinforcement Learning: the Mean-field Game viewpoint
Figure 2 for Concave Utility Reinforcement Learning: the Mean-field Game viewpoint
Figure 3 for Concave Utility Reinforcement Learning: the Mean-field Game viewpoint
Figure 4 for Concave Utility Reinforcement Learning: the Mean-field Game viewpoint
Viaarxiv icon

Mean Field Games Flock! The Reinforcement Learning Way

Add code
May 17, 2021
Figure 1 for Mean Field Games Flock! The Reinforcement Learning Way
Figure 2 for Mean Field Games Flock! The Reinforcement Learning Way
Figure 3 for Mean Field Games Flock! The Reinforcement Learning Way
Figure 4 for Mean Field Games Flock! The Reinforcement Learning Way
Viaarxiv icon

Scaling up Mean Field Games with Online Mirror Descent

Add code
Feb 28, 2021
Figure 1 for Scaling up Mean Field Games with Online Mirror Descent
Figure 2 for Scaling up Mean Field Games with Online Mirror Descent
Figure 3 for Scaling up Mean Field Games with Online Mirror Descent
Figure 4 for Scaling up Mean Field Games with Online Mirror Descent
Viaarxiv icon