Picture for Ori Yoran

Ori Yoran

LMEnt: A Suite for Analyzing Knowledge in Language Models from Pretraining Data to Representations

Add code
Sep 03, 2025
Figure 1 for LMEnt: A Suite for Analyzing Knowledge in Language Models from Pretraining Data to Representations
Figure 2 for LMEnt: A Suite for Analyzing Knowledge in Language Models from Pretraining Data to Representations
Figure 3 for LMEnt: A Suite for Analyzing Knowledge in Language Models from Pretraining Data to Representations
Figure 4 for LMEnt: A Suite for Analyzing Knowledge in Language Models from Pretraining Data to Representations
Viaarxiv icon

The KoLMogorov Test: Compression by Code Generation

Add code
Mar 18, 2025
Viaarxiv icon

Preventing Rogue Agents Improves Multi-Agent Collaboration

Add code
Feb 09, 2025
Figure 1 for Preventing Rogue Agents Improves Multi-Agent Collaboration
Figure 2 for Preventing Rogue Agents Improves Multi-Agent Collaboration
Figure 3 for Preventing Rogue Agents Improves Multi-Agent Collaboration
Figure 4 for Preventing Rogue Agents Improves Multi-Agent Collaboration
Viaarxiv icon

The BrowserGym Ecosystem for Web Agent Research

Add code
Dec 10, 2024
Figure 1 for The BrowserGym Ecosystem for Web Agent Research
Figure 2 for The BrowserGym Ecosystem for Web Agent Research
Figure 3 for The BrowserGym Ecosystem for Web Agent Research
Figure 4 for The BrowserGym Ecosystem for Web Agent Research
Viaarxiv icon

AssistantBench: Can Web Agents Solve Realistic and Time-Consuming Tasks?

Add code
Jul 22, 2024
Viaarxiv icon

From Loops to Oops: Fallback Behaviors of Language Models Under Uncertainty

Add code
Jul 08, 2024
Figure 1 for From Loops to Oops: Fallback Behaviors of Language Models Under Uncertainty
Figure 2 for From Loops to Oops: Fallback Behaviors of Language Models Under Uncertainty
Figure 3 for From Loops to Oops: Fallback Behaviors of Language Models Under Uncertainty
Figure 4 for From Loops to Oops: Fallback Behaviors of Language Models Under Uncertainty
Viaarxiv icon

Making Retrieval-Augmented Language Models Robust to Irrelevant Context

Add code
Oct 02, 2023
Figure 1 for Making Retrieval-Augmented Language Models Robust to Irrelevant Context
Figure 2 for Making Retrieval-Augmented Language Models Robust to Irrelevant Context
Figure 3 for Making Retrieval-Augmented Language Models Robust to Irrelevant Context
Figure 4 for Making Retrieval-Augmented Language Models Robust to Irrelevant Context
Viaarxiv icon

Evaluating the Ripple Effects of Knowledge Editing in Language Models

Add code
Jul 24, 2023
Viaarxiv icon

Answering Questions by Meta-Reasoning over Multiple Chains of Thought

Add code
Apr 25, 2023
Figure 1 for Answering Questions by Meta-Reasoning over Multiple Chains of Thought
Figure 2 for Answering Questions by Meta-Reasoning over Multiple Chains of Thought
Figure 3 for Answering Questions by Meta-Reasoning over Multiple Chains of Thought
Figure 4 for Answering Questions by Meta-Reasoning over Multiple Chains of Thought
Viaarxiv icon

QAMPARI: : An Open-domain Question Answering Benchmark for Questions with Many Answers from Multiple Paragraphs

Add code
May 26, 2022
Figure 1 for QAMPARI: : An Open-domain Question Answering Benchmark for Questions with Many Answers from Multiple Paragraphs
Figure 2 for QAMPARI: : An Open-domain Question Answering Benchmark for Questions with Many Answers from Multiple Paragraphs
Figure 3 for QAMPARI: : An Open-domain Question Answering Benchmark for Questions with Many Answers from Multiple Paragraphs
Figure 4 for QAMPARI: : An Open-domain Question Answering Benchmark for Questions with Many Answers from Multiple Paragraphs
Viaarxiv icon