Picture for Charles L. A. Clarke

Charles L. A. Clarke

Beyond Utility: Evaluating LLM as Recommender

Add code
Nov 01, 2024
Figure 1 for Beyond Utility: Evaluating LLM as Recommender
Figure 2 for Beyond Utility: Evaluating LLM as Recommender
Figure 3 for Beyond Utility: Evaluating LLM as Recommender
Figure 4 for Beyond Utility: Evaluating LLM as Recommender
Viaarxiv icon

Report on the 1st Workshop on Large Language Model for Evaluation in Information Retrieval (LLM4Eval 2024) at SIGIR 2024

Add code
Aug 09, 2024
Viaarxiv icon

LLMJudge: LLMs for Relevance Judgments

Add code
Aug 09, 2024
Viaarxiv icon

Assessing and Verifying Task Utility in LLM-Powered Applications

Add code
May 03, 2024
Viaarxiv icon

Generative Information Retrieval Evaluation

Add code
Apr 11, 2024
Viaarxiv icon

A Comparison of Methods for Evaluating Generative IR

Add code
Apr 09, 2024
Viaarxiv icon

Fréchet Distance for Offline Evaluation of Information Retrieval Systems with Sparse Labels

Add code
Jan 31, 2024
Viaarxiv icon

Adapting Standard Retrieval Benchmarks to Evaluate Generated Answers

Add code
Jan 09, 2024
Viaarxiv icon

Retrieving Supporting Evidence for Generative Question Answering

Add code
Sep 20, 2023
Viaarxiv icon

Retrieving Supporting Evidence for LLMs Generated Answers

Add code
Jun 23, 2023
Viaarxiv icon