Picture for Jimmy Ba

Jimmy Ba

Report Cards: Qualitative Evaluation of Language Models Using Natural Language Summaries

Add code
Sep 01, 2024
Figure 1 for Report Cards: Qualitative Evaluation of Language Models Using Natural Language Summaries
Figure 2 for Report Cards: Qualitative Evaluation of Language Models Using Natural Language Summaries
Figure 3 for Report Cards: Qualitative Evaluation of Language Models Using Natural Language Summaries
Figure 4 for Report Cards: Qualitative Evaluation of Language Models Using Natural Language Summaries
Viaarxiv icon

Decomposed Prompting to Answer Questions on a Course Discussion Board

Add code
Jul 30, 2024
Viaarxiv icon

The WMDP Benchmark: Measuring and Reducing Malicious Use With Unlearning

Add code
Mar 06, 2024
Figure 1 for The WMDP Benchmark: Measuring and Reducing Malicious Use With Unlearning
Figure 2 for The WMDP Benchmark: Measuring and Reducing Malicious Use With Unlearning
Figure 3 for The WMDP Benchmark: Measuring and Reducing Malicious Use With Unlearning
Figure 4 for The WMDP Benchmark: Measuring and Reducing Malicious Use With Unlearning
Viaarxiv icon

Using Large Language Models for Hyperparameter Optimization

Add code
Dec 07, 2023
Figure 1 for Using Large Language Models for Hyperparameter Optimization
Figure 2 for Using Large Language Models for Hyperparameter Optimization
Figure 3 for Using Large Language Models for Hyperparameter Optimization
Figure 4 for Using Large Language Models for Hyperparameter Optimization
Viaarxiv icon

OpenWebMath: An Open Dataset of High-Quality Mathematical Web Text

Add code
Oct 10, 2023
Viaarxiv icon

Identifying the Risks of LM Agents with an LM-Emulated Sandbox

Add code
Sep 25, 2023
Viaarxiv icon

STEVE-1: A Generative Model for Text-to-Behavior in Minecraft

Add code
Jun 05, 2023
Viaarxiv icon

Training on Thin Air: Improve Image Classification with Generated Data

Add code
May 24, 2023
Viaarxiv icon

AlpacaFarm: A Simulation Framework for Methods that Learn from Human Feedback

Add code
May 22, 2023
Figure 1 for AlpacaFarm: A Simulation Framework for Methods that Learn from Human Feedback
Figure 2 for AlpacaFarm: A Simulation Framework for Methods that Learn from Human Feedback
Figure 3 for AlpacaFarm: A Simulation Framework for Methods that Learn from Human Feedback
Figure 4 for AlpacaFarm: A Simulation Framework for Methods that Learn from Human Feedback
Viaarxiv icon

Clinical Camel: An Open-Source Expert-Level Medical Language Model with Dialogue-Based Knowledge Encoding

Add code
May 19, 2023
Figure 1 for Clinical Camel: An Open-Source Expert-Level Medical Language Model with Dialogue-Based Knowledge Encoding
Figure 2 for Clinical Camel: An Open-Source Expert-Level Medical Language Model with Dialogue-Based Knowledge Encoding
Figure 3 for Clinical Camel: An Open-Source Expert-Level Medical Language Model with Dialogue-Based Knowledge Encoding
Figure 4 for Clinical Camel: An Open-Source Expert-Level Medical Language Model with Dialogue-Based Knowledge Encoding
Viaarxiv icon