Picture for Alexandre Ramé

Alexandre Ramé

Diversity-Rewarded CFG Distillation

Add code
Oct 08, 2024
Viaarxiv icon

Gemma 2: Improving Open Language Models at a Practical Size

Add code
Aug 02, 2024
Figure 1 for Gemma 2: Improving Open Language Models at a Practical Size
Figure 2 for Gemma 2: Improving Open Language Models at a Practical Size
Figure 3 for Gemma 2: Improving Open Language Models at a Practical Size
Figure 4 for Gemma 2: Improving Open Language Models at a Practical Size
Viaarxiv icon

Conditioned Language Policy: A General Framework for Steerable Multi-Objective Finetuning

Add code
Jul 22, 2024
Figure 1 for Conditioned Language Policy: A General Framework for Steerable Multi-Objective Finetuning
Figure 2 for Conditioned Language Policy: A General Framework for Steerable Multi-Objective Finetuning
Figure 3 for Conditioned Language Policy: A General Framework for Steerable Multi-Objective Finetuning
Figure 4 for Conditioned Language Policy: A General Framework for Steerable Multi-Objective Finetuning
Viaarxiv icon

BOND: Aligning LLMs with Best-of-N Distillation

Add code
Jul 19, 2024
Figure 1 for BOND: Aligning LLMs with Best-of-N Distillation
Figure 2 for BOND: Aligning LLMs with Best-of-N Distillation
Figure 3 for BOND: Aligning LLMs with Best-of-N Distillation
Figure 4 for BOND: Aligning LLMs with Best-of-N Distillation
Viaarxiv icon

WARP: On the Benefits of Weight Averaged Rewarded Policies

Add code
Jun 24, 2024
Figure 1 for WARP: On the Benefits of Weight Averaged Rewarded Policies
Figure 2 for WARP: On the Benefits of Weight Averaged Rewarded Policies
Figure 3 for WARP: On the Benefits of Weight Averaged Rewarded Policies
Figure 4 for WARP: On the Benefits of Weight Averaged Rewarded Policies
Viaarxiv icon

WARM: On the Benefits of Weight Averaged Reward Models

Add code
Jan 22, 2024
Viaarxiv icon

Recycling diverse models for out-of-distribution generalization

Add code
Dec 20, 2022
Figure 1 for Recycling diverse models for out-of-distribution generalization
Figure 2 for Recycling diverse models for out-of-distribution generalization
Figure 3 for Recycling diverse models for out-of-distribution generalization
Figure 4 for Recycling diverse models for out-of-distribution generalization
Viaarxiv icon

Towards efficient feature sharing in MIMO architectures

Add code
May 20, 2022
Figure 1 for Towards efficient feature sharing in MIMO architectures
Figure 2 for Towards efficient feature sharing in MIMO architectures
Figure 3 for Towards efficient feature sharing in MIMO architectures
Figure 4 for Towards efficient feature sharing in MIMO architectures
Viaarxiv icon

DyTox: Transformers for Continual Learning with DYnamic TOken eXpansion

Add code
Nov 22, 2021
Figure 1 for DyTox: Transformers for Continual Learning with DYnamic TOken eXpansion
Figure 2 for DyTox: Transformers for Continual Learning with DYnamic TOken eXpansion
Figure 3 for DyTox: Transformers for Continual Learning with DYnamic TOken eXpansion
Figure 4 for DyTox: Transformers for Continual Learning with DYnamic TOken eXpansion
Viaarxiv icon

Leveraging Weakly Annotated Data for Fashion Image Retrieval and Label Prediction

Add code
Sep 27, 2017
Figure 1 for Leveraging Weakly Annotated Data for Fashion Image Retrieval and Label Prediction
Figure 2 for Leveraging Weakly Annotated Data for Fashion Image Retrieval and Label Prediction
Figure 3 for Leveraging Weakly Annotated Data for Fashion Image Retrieval and Label Prediction
Figure 4 for Leveraging Weakly Annotated Data for Fashion Image Retrieval and Label Prediction
Viaarxiv icon