Picture for Nick Craswell

Nick Craswell

Initial Nugget Evaluation Results for the TREC 2024 RAG Track with the AutoNuggetizer Framework

Add code
Nov 14, 2024
Figure 1 for Initial Nugget Evaluation Results for the TREC 2024 RAG Track with the AutoNuggetizer Framework
Figure 2 for Initial Nugget Evaluation Results for the TREC 2024 RAG Track with the AutoNuggetizer Framework
Figure 3 for Initial Nugget Evaluation Results for the TREC 2024 RAG Track with the AutoNuggetizer Framework
Figure 4 for Initial Nugget Evaluation Results for the TREC 2024 RAG Track with the AutoNuggetizer Framework
Viaarxiv icon

A Large-Scale Study of Relevance Assessments with Large Language Models: An Initial Look

Add code
Nov 13, 2024
Viaarxiv icon

SynDL: A Large-Scale Synthetic Test Collection for Passage Retrieval

Add code
Aug 30, 2024
Figure 1 for SynDL: A Large-Scale Synthetic Test Collection for Passage Retrieval
Figure 2 for SynDL: A Large-Scale Synthetic Test Collection for Passage Retrieval
Figure 3 for SynDL: A Large-Scale Synthetic Test Collection for Passage Retrieval
Figure 4 for SynDL: A Large-Scale Synthetic Test Collection for Passage Retrieval
Viaarxiv icon

LLMJudge: LLMs for Relevance Judgments

Add code
Aug 09, 2024
Figure 1 for LLMJudge: LLMs for Relevance Judgments
Figure 2 for LLMJudge: LLMs for Relevance Judgments
Viaarxiv icon

Report on the 1st Workshop on Large Language Model for Evaluation in Information Retrieval (LLM4Eval 2024) at SIGIR 2024

Add code
Aug 09, 2024
Figure 1 for Report on the 1st Workshop on Large Language Model for Evaluation in Information Retrieval (LLM4Eval 2024) at SIGIR 2024
Figure 2 for Report on the 1st Workshop on Large Language Model for Evaluation in Information Retrieval (LLM4Eval 2024) at SIGIR 2024
Viaarxiv icon

Ragnarök: A Reusable RAG Framework and Baselines for TREC 2024 Retrieval-Augmented Generation Track

Add code
Jun 24, 2024
Viaarxiv icon

UMBRELA: UMbrela is the (Open-Source Reproduction of the) Bing RELevance Assessor

Add code
Jun 10, 2024
Viaarxiv icon

Synthetic Test Collections for Retrieval Evaluation

Add code
May 13, 2024
Figure 1 for Synthetic Test Collections for Retrieval Evaluation
Figure 2 for Synthetic Test Collections for Retrieval Evaluation
Figure 3 for Synthetic Test Collections for Retrieval Evaluation
Figure 4 for Synthetic Test Collections for Retrieval Evaluation
Viaarxiv icon

MS MARCO Web Search: a Large-scale Information-rich Web Dataset with Millions of Real Click Labels

Add code
May 13, 2024
Figure 1 for MS MARCO Web Search: a Large-scale Information-rich Web Dataset with Millions of Real Click Labels
Figure 2 for MS MARCO Web Search: a Large-scale Information-rich Web Dataset with Millions of Real Click Labels
Figure 3 for MS MARCO Web Search: a Large-scale Information-rich Web Dataset with Millions of Real Click Labels
Figure 4 for MS MARCO Web Search: a Large-scale Information-rich Web Dataset with Millions of Real Click Labels
Viaarxiv icon

Towards Group-aware Search Success

Add code
Apr 26, 2024
Figure 1 for Towards Group-aware Search Success
Figure 2 for Towards Group-aware Search Success
Figure 3 for Towards Group-aware Search Success
Figure 4 for Towards Group-aware Search Success
Viaarxiv icon