Picture for Benjamin Newman

Benjamin Newman

ArxivDIGESTables: Synthesizing Scientific Literature into Tables using Language Models

Add code
Oct 25, 2024
Figure 1 for ArxivDIGESTables: Synthesizing Scientific Literature into Tables using Language Models
Figure 2 for ArxivDIGESTables: Synthesizing Scientific Literature into Tables using Language Models
Figure 3 for ArxivDIGESTables: Synthesizing Scientific Literature into Tables using Language Models
Figure 4 for ArxivDIGESTables: Synthesizing Scientific Literature into Tables using Language Models
Viaarxiv icon

WildHallucinations: Evaluating Long-form Factuality in LLMs with Real-World Entity Queries

Add code
Jul 24, 2024
Figure 1 for WildHallucinations: Evaluating Long-form Factuality in LLMs with Real-World Entity Queries
Figure 2 for WildHallucinations: Evaluating Long-form Factuality in LLMs with Real-World Entity Queries
Figure 3 for WildHallucinations: Evaluating Long-form Factuality in LLMs with Real-World Entity Queries
Figure 4 for WildHallucinations: Evaluating Long-form Factuality in LLMs with Real-World Entity Queries
Viaarxiv icon

Assessment of Sports Concussion in Female Athletes: A Role for Neuroinformatics?

Add code
Jan 23, 2024
Viaarxiv icon

The Generative AI Paradox: "What It Can Create, It May Not Understand"

Add code
Oct 31, 2023
Viaarxiv icon

A Controllable QA-based Framework for Decontextualization

Add code
May 24, 2023
Viaarxiv icon

Comparing Sentence-Level Suggestions to Message-Level Suggestions in AI-Mediated Communication

Add code
Feb 26, 2023
Viaarxiv icon

Holistic Evaluation of Language Models

Add code
Nov 16, 2022
Figure 1 for Holistic Evaluation of Language Models
Figure 2 for Holistic Evaluation of Language Models
Figure 3 for Holistic Evaluation of Language Models
Figure 4 for Holistic Evaluation of Language Models
Viaarxiv icon

P-Adapters: Robustly Extracting Factual Information from Language Models with Diverse Prompts

Add code
Oct 14, 2021
Figure 1 for P-Adapters: Robustly Extracting Factual Information from Language Models with Diverse Prompts
Figure 2 for P-Adapters: Robustly Extracting Factual Information from Language Models with Diverse Prompts
Figure 3 for P-Adapters: Robustly Extracting Factual Information from Language Models with Diverse Prompts
Figure 4 for P-Adapters: Robustly Extracting Factual Information from Language Models with Diverse Prompts
Viaarxiv icon

Refining Targeted Syntactic Evaluation of Language Models

Add code
Apr 19, 2021
Figure 1 for Refining Targeted Syntactic Evaluation of Language Models
Figure 2 for Refining Targeted Syntactic Evaluation of Language Models
Figure 3 for Refining Targeted Syntactic Evaluation of Language Models
Figure 4 for Refining Targeted Syntactic Evaluation of Language Models
Viaarxiv icon

Optimal Assistance for Object-Rearrangement Tasks in Augmented Reality

Add code
Oct 14, 2020
Figure 1 for Optimal Assistance for Object-Rearrangement Tasks in Augmented Reality
Figure 2 for Optimal Assistance for Object-Rearrangement Tasks in Augmented Reality
Figure 3 for Optimal Assistance for Object-Rearrangement Tasks in Augmented Reality
Figure 4 for Optimal Assistance for Object-Rearrangement Tasks in Augmented Reality
Viaarxiv icon