Picture for Dan Roth

Dan Roth

Shammie

On Reference (In-)Determinacy in Natural Language Inference

Add code
Feb 09, 2025
Viaarxiv icon

Self-supervised Analogical Learning using Language Models

Add code
Feb 03, 2025
Viaarxiv icon

Can we Retrieve Everything All at Once? ARM: An Alignment-Oriented LLM-based Retrieval Method

Add code
Jan 30, 2025
Viaarxiv icon

ReFocus: Visual Editing as a Chain of Thought for Structured Image Understanding

Add code
Jan 09, 2025
Figure 1 for ReFocus: Visual Editing as a Chain of Thought for Structured Image Understanding
Figure 2 for ReFocus: Visual Editing as a Chain of Thought for Structured Image Understanding
Figure 3 for ReFocus: Visual Editing as a Chain of Thought for Structured Image Understanding
Figure 4 for ReFocus: Visual Editing as a Chain of Thought for Structured Image Understanding
Viaarxiv icon

NAVCON: A Cognitively Inspired and Linguistically Grounded Corpus for Vision and Language Navigation

Add code
Dec 17, 2024
Viaarxiv icon

DiverseAgentEntropy: Quantifying Black-Box LLM Uncertainty through Diverse Perspectives and Multi-Agent Interaction

Add code
Dec 12, 2024
Figure 1 for DiverseAgentEntropy: Quantifying Black-Box LLM Uncertainty through Diverse Perspectives and Multi-Agent Interaction
Figure 2 for DiverseAgentEntropy: Quantifying Black-Box LLM Uncertainty through Diverse Perspectives and Multi-Agent Interaction
Figure 3 for DiverseAgentEntropy: Quantifying Black-Box LLM Uncertainty through Diverse Perspectives and Multi-Agent Interaction
Figure 4 for DiverseAgentEntropy: Quantifying Black-Box LLM Uncertainty through Diverse Perspectives and Multi-Agent Interaction
Viaarxiv icon

Contextualized Evaluations: Taking the Guesswork Out of Language Model Evaluations

Add code
Nov 11, 2024
Viaarxiv icon

Benchmarking LLM Guardrails in Handling Multilingual Toxicity

Add code
Oct 29, 2024
Viaarxiv icon

ReasonAgain: Using Extractable Symbolic Programs to Evaluate Mathematical Reasoning

Add code
Oct 24, 2024
Figure 1 for ReasonAgain: Using Extractable Symbolic Programs to Evaluate Mathematical Reasoning
Figure 2 for ReasonAgain: Using Extractable Symbolic Programs to Evaluate Mathematical Reasoning
Viaarxiv icon

Open Domain Question Answering with Conflicting Contexts

Add code
Oct 16, 2024
Figure 1 for Open Domain Question Answering with Conflicting Contexts
Figure 2 for Open Domain Question Answering with Conflicting Contexts
Figure 3 for Open Domain Question Answering with Conflicting Contexts
Figure 4 for Open Domain Question Answering with Conflicting Contexts
Viaarxiv icon