Picture for Ronak Pradeep

Ronak Pradeep

Michael Pokorny

The Great Nugget Recall: Automating Fact Extraction and RAG Evaluation with Large Language Models

Add code
Apr 21, 2025
Viaarxiv icon

Support Evaluation for the TREC 2024 RAG Track: Comparing Human versus LLM Judges

Add code
Apr 21, 2025
Viaarxiv icon

Humanity's Last Exam

Add code
Jan 24, 2025
Viaarxiv icon

Initial Nugget Evaluation Results for the TREC 2024 RAG Track with the AutoNuggetizer Framework

Add code
Nov 14, 2024
Figure 1 for Initial Nugget Evaluation Results for the TREC 2024 RAG Track with the AutoNuggetizer Framework
Figure 2 for Initial Nugget Evaluation Results for the TREC 2024 RAG Track with the AutoNuggetizer Framework
Figure 3 for Initial Nugget Evaluation Results for the TREC 2024 RAG Track with the AutoNuggetizer Framework
Figure 4 for Initial Nugget Evaluation Results for the TREC 2024 RAG Track with the AutoNuggetizer Framework
Viaarxiv icon

A Large-Scale Study of Relevance Assessments with Large Language Models: An Initial Look

Add code
Nov 13, 2024
Viaarxiv icon

An Early FIRST Reproduction and Improvements to Single-Token Decoding for Fast Listwise Reranking

Add code
Nov 08, 2024
Viaarxiv icon

ConvKGYarn: Spinning Configurable and Scalable Conversational Knowledge Graph QA datasets with Large Language Models

Add code
Aug 12, 2024
Viaarxiv icon

Ragnarök: A Reusable RAG Framework and Baselines for TREC 2024 Retrieval-Augmented Generation Track

Add code
Jun 24, 2024
Viaarxiv icon

Prompts as Auto-Optimized Training Hyperparameters: Training Best-in-Class IR Models from Scratch with 10 Gold Labels

Add code
Jun 17, 2024
Viaarxiv icon

UMBRELA: UMbrela is the (Open-Source Reproduction of the) Bing RELevance Assessor

Add code
Jun 10, 2024
Viaarxiv icon