Picture for Alexander R. Fabbri

Alexander R. Fabbri

Evaluating Cultural and Social Awareness of LLM Web Agents

Add code
Oct 30, 2024
Viaarxiv icon

ReIFE: Re-evaluating Instruction-Following Evaluation

Add code
Oct 09, 2024
Figure 1 for ReIFE: Re-evaluating Instruction-Following Evaluation
Figure 2 for ReIFE: Re-evaluating Instruction-Following Evaluation
Figure 3 for ReIFE: Re-evaluating Instruction-Following Evaluation
Figure 4 for ReIFE: Re-evaluating Instruction-Following Evaluation
Viaarxiv icon

Summary of a Haystack: A Challenge to Long-Context LLMs and RAG Systems

Add code
Jul 01, 2024
Figure 1 for Summary of a Haystack: A Challenge to Long-Context LLMs and RAG Systems
Figure 2 for Summary of a Haystack: A Challenge to Long-Context LLMs and RAG Systems
Figure 3 for Summary of a Haystack: A Challenge to Long-Context LLMs and RAG Systems
Figure 4 for Summary of a Haystack: A Challenge to Long-Context LLMs and RAG Systems
Viaarxiv icon

Investigating the prompt leakage effect and black-box defenses for multi-turn LLM interactions

Add code
Apr 26, 2024
Figure 1 for Investigating the prompt leakage effect and black-box defenses for multi-turn LLM interactions
Figure 2 for Investigating the prompt leakage effect and black-box defenses for multi-turn LLM interactions
Figure 3 for Investigating the prompt leakage effect and black-box defenses for multi-turn LLM interactions
Figure 4 for Investigating the prompt leakage effect and black-box defenses for multi-turn LLM interactions
Viaarxiv icon

Benchmarking Generation and Evaluation Capabilities of Large Language Models for Instruction Controllable Summarization

Add code
Nov 15, 2023
Viaarxiv icon

Lexical Repetitions Lead to Rote Learning: Unveiling the Impact of Lexical Overlap in Train and Test Reference Summaries

Add code
Nov 15, 2023
Viaarxiv icon

Embrace Divergence for Richer Insights: A Multi-document Summarization Benchmark and a Case Study on Summarizing Diverse Information from News Articles

Add code
Sep 17, 2023
Viaarxiv icon

Generating EDU Extracts for Plan-Guided Summary Re-Ranking

Add code
May 28, 2023
Figure 1 for Generating EDU Extracts for Plan-Guided Summary Re-Ranking
Figure 2 for Generating EDU Extracts for Plan-Guided Summary Re-Ranking
Figure 3 for Generating EDU Extracts for Plan-Guided Summary Re-Ranking
Figure 4 for Generating EDU Extracts for Plan-Guided Summary Re-Ranking
Viaarxiv icon

LLMs as Factual Reasoners: Insights from Existing Benchmarks and Beyond

Add code
May 23, 2023
Figure 1 for LLMs as Factual Reasoners: Insights from Existing Benchmarks and Beyond
Figure 2 for LLMs as Factual Reasoners: Insights from Existing Benchmarks and Beyond
Figure 3 for LLMs as Factual Reasoners: Insights from Existing Benchmarks and Beyond
Figure 4 for LLMs as Factual Reasoners: Insights from Existing Benchmarks and Beyond
Viaarxiv icon

On Learning to Summarize with Large Language Models as References

Add code
May 23, 2023
Viaarxiv icon