Picture for Melanie Kambadur

Melanie Kambadur

Jack

A Systematic Examination of Preference Learning through the Lens of Instruction-Following

Add code
Dec 18, 2024
Viaarxiv icon

Self-Generated Critiques Boost Reward Modeling for Language Models

Add code
Nov 25, 2024
Figure 1 for Self-Generated Critiques Boost Reward Modeling for Language Models
Figure 2 for Self-Generated Critiques Boost Reward Modeling for Language Models
Figure 3 for Self-Generated Critiques Boost Reward Modeling for Language Models
Figure 4 for Self-Generated Critiques Boost Reward Modeling for Language Models
Viaarxiv icon

Multi-IF: Benchmarking LLMs on Multi-Turn and Multilingual Instructions Following

Add code
Oct 21, 2024
Viaarxiv icon

Law of the Weakest Link: Cross Capabilities of Large Language Models

Add code
Sep 30, 2024
Figure 1 for Law of the Weakest Link: Cross Capabilities of Large Language Models
Figure 2 for Law of the Weakest Link: Cross Capabilities of Large Language Models
Figure 3 for Law of the Weakest Link: Cross Capabilities of Large Language Models
Figure 4 for Law of the Weakest Link: Cross Capabilities of Large Language Models
Viaarxiv icon

The Llama 3 Herd of Models

Add code
Jul 31, 2024
Viaarxiv icon

Llama 2: Open Foundation and Fine-Tuned Chat Models

Add code
Jul 19, 2023
Figure 1 for Llama 2: Open Foundation and Fine-Tuned Chat Models
Figure 2 for Llama 2: Open Foundation and Fine-Tuned Chat Models
Figure 3 for Llama 2: Open Foundation and Fine-Tuned Chat Models
Figure 4 for Llama 2: Open Foundation and Fine-Tuned Chat Models
Viaarxiv icon

The HCI Aspects of Public Deployment of Research Chatbots: A User Study, Design Recommendations, and Open Challenges

Add code
Jun 07, 2023
Figure 1 for The HCI Aspects of Public Deployment of Research Chatbots: A User Study, Design Recommendations, and Open Challenges
Figure 2 for The HCI Aspects of Public Deployment of Research Chatbots: A User Study, Design Recommendations, and Open Challenges
Viaarxiv icon

A Theory on Adam Instability in Large-Scale Machine Learning

Add code
Apr 25, 2023
Figure 1 for A Theory on Adam Instability in Large-Scale Machine Learning
Figure 2 for A Theory on Adam Instability in Large-Scale Machine Learning
Figure 3 for A Theory on Adam Instability in Large-Scale Machine Learning
Figure 4 for A Theory on Adam Instability in Large-Scale Machine Learning
Viaarxiv icon

BlenderBot 3: a deployed conversational agent that continually learns to responsibly engage

Add code
Aug 10, 2022
Figure 1 for BlenderBot 3: a deployed conversational agent that continually learns to responsibly engage
Figure 2 for BlenderBot 3: a deployed conversational agent that continually learns to responsibly engage
Figure 3 for BlenderBot 3: a deployed conversational agent that continually learns to responsibly engage
Figure 4 for BlenderBot 3: a deployed conversational agent that continually learns to responsibly engage
Viaarxiv icon