Picture for Avi Caciularu

Avi Caciularu

MDCure: A Scalable Pipeline for Multi-Document Instruction-Following

Add code
Oct 30, 2024
Viaarxiv icon

CoverBench: A Challenging Benchmark for Complex Claim Verification

Add code
Aug 06, 2024
Viaarxiv icon

SEAM: A Stochastic Benchmark for Multi-Document Tasks

Add code
Jun 23, 2024
Viaarxiv icon

Identifying User Goals from UI Trajectories

Add code
Jun 20, 2024
Viaarxiv icon

Can Few-shot Work in Long-Context? Recycling the Context to Generate Demonstrations

Add code
Jun 19, 2024
Viaarxiv icon

TACT: Advancing Complex Aggregative Reasoning with Information Extraction Tools

Add code
Jun 05, 2024
Viaarxiv icon

Unpacking Tokenization: Evaluating Text Compression and its Correlation with Model Performance

Add code
Mar 10, 2024
Viaarxiv icon

Patchscopes: A Unifying Framework for Inspecting Hidden Representations of Language Models

Add code
Jan 12, 2024
Figure 1 for Patchscopes: A Unifying Framework for Inspecting Hidden Representations of Language Models
Figure 2 for Patchscopes: A Unifying Framework for Inspecting Hidden Representations of Language Models
Figure 3 for Patchscopes: A Unifying Framework for Inspecting Hidden Representations of Language Models
Figure 4 for Patchscopes: A Unifying Framework for Inspecting Hidden Representations of Language Models
Viaarxiv icon

Optimizing Retrieval-augmented Reader Models via Token Elimination

Add code
Oct 20, 2023
Figure 1 for Optimizing Retrieval-augmented Reader Models via Token Elimination
Figure 2 for Optimizing Retrieval-augmented Reader Models via Token Elimination
Figure 3 for Optimizing Retrieval-augmented Reader Models via Token Elimination
Figure 4 for Optimizing Retrieval-augmented Reader Models via Token Elimination
Viaarxiv icon

The Curious Case of Hallucinatory Unanswerablity: Finding Truths in the Hidden States of Over-Confident Large Language Models

Add code
Oct 18, 2023
Viaarxiv icon