Picture for Jannis Bulian

Jannis Bulian

How Susceptible are LLMs to Influence in Prompts?

Add code
Aug 17, 2024
Figure 1 for How Susceptible are LLMs to Influence in Prompts?
Figure 2 for How Susceptible are LLMs to Influence in Prompts?
Figure 3 for How Susceptible are LLMs to Influence in Prompts?
Figure 4 for How Susceptible are LLMs to Influence in Prompts?
Viaarxiv icon

On scalable oversight with weak LLMs judging strong LLMs

Add code
Jul 05, 2024
Figure 1 for On scalable oversight with weak LLMs judging strong LLMs
Figure 2 for On scalable oversight with weak LLMs judging strong LLMs
Figure 3 for On scalable oversight with weak LLMs judging strong LLMs
Figure 4 for On scalable oversight with weak LLMs judging strong LLMs
Viaarxiv icon

Assessing Large Language Models on Climate Information

Add code
Oct 04, 2023
Figure 1 for Assessing Large Language Models on Climate Information
Figure 2 for Assessing Large Language Models on Climate Information
Figure 3 for Assessing Large Language Models on Climate Information
Figure 4 for Assessing Large Language Models on Climate Information
Viaarxiv icon

Scaling Up Models and Data with $\texttt{t5x}$ and $\texttt{seqio}$

Add code
Mar 31, 2022
Figure 1 for Scaling Up Models and Data with $\texttt{t5x}$ and $\texttt{seqio}$
Figure 2 for Scaling Up Models and Data with $\texttt{t5x}$ and $\texttt{seqio}$
Viaarxiv icon

Tomayto, Tomahto. Beyond Token-level Answer Equivalence for Question Answering Evaluation

Add code
Feb 15, 2022
Viaarxiv icon

Fool Me Twice: Entailment from Wikipedia Gamification

Add code
Apr 10, 2021
Figure 1 for Fool Me Twice: Entailment from Wikipedia Gamification
Figure 2 for Fool Me Twice: Entailment from Wikipedia Gamification
Figure 3 for Fool Me Twice: Entailment from Wikipedia Gamification
Figure 4 for Fool Me Twice: Entailment from Wikipedia Gamification
Viaarxiv icon

CLIMATE-FEVER: A Dataset for Verification of Real-World Climate Claims

Add code
Jan 02, 2021
Figure 1 for CLIMATE-FEVER: A Dataset for Verification of Real-World Climate Claims
Figure 2 for CLIMATE-FEVER: A Dataset for Verification of Real-World Climate Claims
Figure 3 for CLIMATE-FEVER: A Dataset for Verification of Real-World Climate Claims
Figure 4 for CLIMATE-FEVER: A Dataset for Verification of Real-World Climate Claims
Viaarxiv icon

Meta Answering for Machine Reading

Add code
Nov 11, 2019
Figure 1 for Meta Answering for Machine Reading
Figure 2 for Meta Answering for Machine Reading
Figure 3 for Meta Answering for Machine Reading
Figure 4 for Meta Answering for Machine Reading
Viaarxiv icon

Learning to Coordinate Multiple Reinforcement Learning Agents for Diverse Query Reformulation

Add code
Sep 27, 2018
Figure 1 for Learning to Coordinate Multiple Reinforcement Learning Agents for Diverse Query Reformulation
Figure 2 for Learning to Coordinate Multiple Reinforcement Learning Agents for Diverse Query Reformulation
Figure 3 for Learning to Coordinate Multiple Reinforcement Learning Agents for Diverse Query Reformulation
Figure 4 for Learning to Coordinate Multiple Reinforcement Learning Agents for Diverse Query Reformulation
Viaarxiv icon

Ask the Right Questions: Active Question Reformulation with Reinforcement Learning

Add code
Mar 02, 2018
Figure 1 for Ask the Right Questions: Active Question Reformulation with Reinforcement Learning
Figure 2 for Ask the Right Questions: Active Question Reformulation with Reinforcement Learning
Figure 3 for Ask the Right Questions: Active Question Reformulation with Reinforcement Learning
Viaarxiv icon