Picture for Eric Hambro

Eric Hambro

Know When To Stop: A Study of Semantic Drift in Text Generation

Add code
Apr 08, 2024
Viaarxiv icon

Teaching Large Language Models to Reason with Reinforcement Learning

Add code
Mar 07, 2024
Viaarxiv icon

Rainbow Teaming: Open-Ended Generation of Diverse Adversarial Prompts

Add code
Feb 26, 2024
Figure 1 for Rainbow Teaming: Open-Ended Generation of Diverse Adversarial Prompts
Figure 2 for Rainbow Teaming: Open-Ended Generation of Diverse Adversarial Prompts
Figure 3 for Rainbow Teaming: Open-Ended Generation of Diverse Adversarial Prompts
Figure 4 for Rainbow Teaming: Open-Ended Generation of Diverse Adversarial Prompts
Viaarxiv icon

GLoRe: When, Where, and How to Improve LLM Reasoning via Global and Local Refinements

Add code
Feb 13, 2024
Viaarxiv icon

Generalization to New Sequential Decision Making Tasks with In-Context Learning

Add code
Dec 06, 2023
Viaarxiv icon

Understanding the Effects of RLHF on LLM Generalisation and Diversity

Add code
Oct 10, 2023
Viaarxiv icon

LLaMA: Open and Efficient Foundation Language Models

Add code
Feb 27, 2023
Viaarxiv icon

Dungeons and Data: A Large-Scale NetHack Dataset

Add code
Nov 22, 2022
Viaarxiv icon

Insights From the NeurIPS 2021 NetHack Challenge

Add code
Mar 22, 2022
Figure 1 for Insights From the NeurIPS 2021 NetHack Challenge
Figure 2 for Insights From the NeurIPS 2021 NetHack Challenge
Figure 3 for Insights From the NeurIPS 2021 NetHack Challenge
Figure 4 for Insights From the NeurIPS 2021 NetHack Challenge
Viaarxiv icon

MiniHack the Planet: A Sandbox for Open-Ended Reinforcement Learning Research

Add code
Sep 27, 2021
Figure 1 for MiniHack the Planet: A Sandbox for Open-Ended Reinforcement Learning Research
Figure 2 for MiniHack the Planet: A Sandbox for Open-Ended Reinforcement Learning Research
Figure 3 for MiniHack the Planet: A Sandbox for Open-Ended Reinforcement Learning Research
Figure 4 for MiniHack the Planet: A Sandbox for Open-Ended Reinforcement Learning Research
Viaarxiv icon