Picture for Amélie Héliou

Amélie Héliou

Pixtral 12B

Add code
Oct 09, 2024
Viaarxiv icon

BOND: Aligning LLMs with Best-of-N Distillation

Add code
Jul 19, 2024
Figure 1 for BOND: Aligning LLMs with Best-of-N Distillation
Figure 2 for BOND: Aligning LLMs with Best-of-N Distillation
Figure 3 for BOND: Aligning LLMs with Best-of-N Distillation
Figure 4 for BOND: Aligning LLMs with Best-of-N Distillation
Viaarxiv icon

Gemma: Open Models Based on Gemini Research and Technology

Add code
Mar 13, 2024
Figure 1 for Gemma: Open Models Based on Gemini Research and Technology
Figure 2 for Gemma: Open Models Based on Gemini Research and Technology
Figure 3 for Gemma: Open Models Based on Gemini Research and Technology
Figure 4 for Gemma: Open Models Based on Gemini Research and Technology
Viaarxiv icon

Gemini: A Family of Highly Capable Multimodal Models

Add code
Dec 19, 2023
Viaarxiv icon

Zeroth-order non-convex learning via hierarchical dual averaging

Add code
Sep 13, 2021
Figure 1 for Zeroth-order non-convex learning via hierarchical dual averaging
Figure 2 for Zeroth-order non-convex learning via hierarchical dual averaging
Figure 3 for Zeroth-order non-convex learning via hierarchical dual averaging
Figure 4 for Zeroth-order non-convex learning via hierarchical dual averaging
Viaarxiv icon

Online non-convex optimization with imperfect feedback

Add code
Oct 16, 2020
Figure 1 for Online non-convex optimization with imperfect feedback
Figure 2 for Online non-convex optimization with imperfect feedback
Figure 3 for Online non-convex optimization with imperfect feedback
Viaarxiv icon

Individual Treatment Effect Estimation in a Low Compliance Setting

Add code
Aug 07, 2020
Figure 1 for Individual Treatment Effect Estimation in a Low Compliance Setting
Figure 2 for Individual Treatment Effect Estimation in a Low Compliance Setting
Figure 3 for Individual Treatment Effect Estimation in a Low Compliance Setting
Figure 4 for Individual Treatment Effect Estimation in a Low Compliance Setting
Viaarxiv icon

Exponentially fast convergence to (strict) equilibrium via hedging

Add code
Jul 29, 2016
Viaarxiv icon