Picture for Clemencia Siro

Clemencia Siro

Shammie

Judging the Judges: A Collection of LLM-Generated Relevance Judgements

Add code
Feb 19, 2025
Viaarxiv icon

Multi-Turn Multi-Modal Question Clarification for Enhanced Conversational Understanding

Add code
Feb 17, 2025
Viaarxiv icon

AGENT-CQ: Automatic Generation and Evaluation of Clarifying Questions for Conversational Search with LLMs

Add code
Oct 25, 2024
Figure 1 for AGENT-CQ: Automatic Generation and Evaluation of Clarifying Questions for Conversational Search with LLMs
Figure 2 for AGENT-CQ: Automatic Generation and Evaluation of Clarifying Questions for Conversational Search with LLMs
Figure 3 for AGENT-CQ: Automatic Generation and Evaluation of Clarifying Questions for Conversational Search with LLMs
Figure 4 for AGENT-CQ: Automatic Generation and Evaluation of Clarifying Questions for Conversational Search with LLMs
Viaarxiv icon

LLMJudge: LLMs for Relevance Judgments

Add code
Aug 09, 2024
Figure 1 for LLMJudge: LLMs for Relevance Judgments
Figure 2 for LLMJudge: LLMs for Relevance Judgments
Viaarxiv icon

Report on the 1st Workshop on Large Language Model for Evaluation in Information Retrieval (LLM4Eval 2024) at SIGIR 2024

Add code
Aug 09, 2024
Figure 1 for Report on the 1st Workshop on Large Language Model for Evaluation in Information Retrieval (LLM4Eval 2024) at SIGIR 2024
Figure 2 for Report on the 1st Workshop on Large Language Model for Evaluation in Information Retrieval (LLM4Eval 2024) at SIGIR 2024
Viaarxiv icon

Rethinking the Evaluation of Dialogue Systems: Effects of User Feedback on Crowdworkers and LLMs

Add code
Apr 19, 2024
Viaarxiv icon

Context Does Matter: Implications for Crowdsourced Evaluation Labels in Task-Oriented Dialogue Systems

Add code
Apr 15, 2024
Figure 1 for Context Does Matter: Implications for Crowdsourced Evaluation Labels in Task-Oriented Dialogue Systems
Figure 2 for Context Does Matter: Implications for Crowdsourced Evaluation Labels in Task-Oriented Dialogue Systems
Figure 3 for Context Does Matter: Implications for Crowdsourced Evaluation Labels in Task-Oriented Dialogue Systems
Figure 4 for Context Does Matter: Implications for Crowdsourced Evaluation Labels in Task-Oriented Dialogue Systems
Viaarxiv icon

Asking Multimodal Clarifying Questions in Mixed-Initiative Conversational Search

Add code
Feb 12, 2024
Viaarxiv icon

AfriMTE and AfriCOMET: Empowering COMET to Embrace Under-resourced African Languages

Add code
Nov 16, 2023
Figure 1 for AfriMTE and AfriCOMET: Empowering COMET to Embrace Under-resourced African Languages
Figure 2 for AfriMTE and AfriCOMET: Empowering COMET to Embrace Under-resourced African Languages
Figure 3 for AfriMTE and AfriCOMET: Empowering COMET to Embrace Under-resourced African Languages
Figure 4 for AfriMTE and AfriCOMET: Empowering COMET to Embrace Under-resourced African Languages
Viaarxiv icon

AfriQA: Cross-lingual Open-Retrieval Question Answering for African Languages

Add code
May 11, 2023
Figure 1 for AfriQA: Cross-lingual Open-Retrieval Question Answering for African Languages
Figure 2 for AfriQA: Cross-lingual Open-Retrieval Question Answering for African Languages
Figure 3 for AfriQA: Cross-lingual Open-Retrieval Question Answering for African Languages
Figure 4 for AfriQA: Cross-lingual Open-Retrieval Question Answering for African Languages
Viaarxiv icon