Picture for Charles L. A. Clarke

Charles L. A. Clarke

LLM-based relevance assessment still can't replace human relevance assessment

Add code
Dec 22, 2024
Viaarxiv icon

EMPRA: Embedding Perturbation Rank Attack against Neural Ranking Models

Add code
Dec 20, 2024
Viaarxiv icon

Annotative Indexing

Add code
Nov 20, 2024
Viaarxiv icon

Beyond Utility: Evaluating LLM as Recommender

Add code
Nov 01, 2024
Figure 1 for Beyond Utility: Evaluating LLM as Recommender
Figure 2 for Beyond Utility: Evaluating LLM as Recommender
Figure 3 for Beyond Utility: Evaluating LLM as Recommender
Figure 4 for Beyond Utility: Evaluating LLM as Recommender
Viaarxiv icon

Report on the 1st Workshop on Large Language Model for Evaluation in Information Retrieval (LLM4Eval 2024) at SIGIR 2024

Add code
Aug 09, 2024
Figure 1 for Report on the 1st Workshop on Large Language Model for Evaluation in Information Retrieval (LLM4Eval 2024) at SIGIR 2024
Figure 2 for Report on the 1st Workshop on Large Language Model for Evaluation in Information Retrieval (LLM4Eval 2024) at SIGIR 2024
Viaarxiv icon

LLMJudge: LLMs for Relevance Judgments

Add code
Aug 09, 2024
Figure 1 for LLMJudge: LLMs for Relevance Judgments
Figure 2 for LLMJudge: LLMs for Relevance Judgments
Viaarxiv icon

Assessing and Verifying Task Utility in LLM-Powered Applications

Add code
May 03, 2024
Figure 1 for Assessing and Verifying Task Utility in LLM-Powered Applications
Figure 2 for Assessing and Verifying Task Utility in LLM-Powered Applications
Figure 3 for Assessing and Verifying Task Utility in LLM-Powered Applications
Figure 4 for Assessing and Verifying Task Utility in LLM-Powered Applications
Viaarxiv icon

Generative Information Retrieval Evaluation

Add code
Apr 11, 2024
Viaarxiv icon

A Comparison of Methods for Evaluating Generative IR

Add code
Apr 09, 2024
Viaarxiv icon

Fréchet Distance for Offline Evaluation of Information Retrieval Systems with Sparse Labels

Add code
Jan 31, 2024
Viaarxiv icon