Picture for Hossein A. Rahmani

Hossein A. Rahmani

AgentCoMa: A Compositional Benchmark Mixing Commonsense and Mathematical Reasoning in Real-World Scenarios

Add code
Aug 27, 2025
Viaarxiv icon

Towards Understanding Bias in Synthetic Data for Evaluation

Add code
Jun 12, 2025
Viaarxiv icon

Judging the Judges: A Collection of LLM-Generated Relevance Judgements

Add code
Feb 19, 2025
Figure 1 for Judging the Judges: A Collection of LLM-Generated Relevance Judgements
Figure 2 for Judging the Judges: A Collection of LLM-Generated Relevance Judgements
Figure 3 for Judging the Judges: A Collection of LLM-Generated Relevance Judgements
Figure 4 for Judging the Judges: A Collection of LLM-Generated Relevance Judgements
Viaarxiv icon

JudgeBlender: Ensembling Judgments for Automatic Relevance Assessment

Add code
Dec 17, 2024
Figure 1 for JudgeBlender: Ensembling Judgments for Automatic Relevance Assessment
Figure 2 for JudgeBlender: Ensembling Judgments for Automatic Relevance Assessment
Figure 3 for JudgeBlender: Ensembling Judgments for Automatic Relevance Assessment
Figure 4 for JudgeBlender: Ensembling Judgments for Automatic Relevance Assessment
Viaarxiv icon

SynDL: A Large-Scale Synthetic Test Collection for Passage Retrieval

Add code
Aug 30, 2024
Figure 1 for SynDL: A Large-Scale Synthetic Test Collection for Passage Retrieval
Figure 2 for SynDL: A Large-Scale Synthetic Test Collection for Passage Retrieval
Figure 3 for SynDL: A Large-Scale Synthetic Test Collection for Passage Retrieval
Figure 4 for SynDL: A Large-Scale Synthetic Test Collection for Passage Retrieval
Viaarxiv icon

LLMJudge: LLMs for Relevance Judgments

Add code
Aug 09, 2024
Figure 1 for LLMJudge: LLMs for Relevance Judgments
Figure 2 for LLMJudge: LLMs for Relevance Judgments
Viaarxiv icon

Report on the 1st Workshop on Large Language Model for Evaluation in Information Retrieval (LLM4Eval 2024) at SIGIR 2024

Add code
Aug 09, 2024
Figure 1 for Report on the 1st Workshop on Large Language Model for Evaluation in Information Retrieval (LLM4Eval 2024) at SIGIR 2024
Figure 2 for Report on the 1st Workshop on Large Language Model for Evaluation in Information Retrieval (LLM4Eval 2024) at SIGIR 2024
Viaarxiv icon

Understanding the Role of User Profile in the Personalization of Large Language Models

Add code
Jun 22, 2024
Figure 1 for Understanding the Role of User Profile in the Personalization of Large Language Models
Figure 2 for Understanding the Role of User Profile in the Personalization of Large Language Models
Figure 3 for Understanding the Role of User Profile in the Personalization of Large Language Models
Figure 4 for Understanding the Role of User Profile in the Personalization of Large Language Models
Viaarxiv icon

Synthetic Test Collections for Retrieval Evaluation

Add code
May 13, 2024
Figure 1 for Synthetic Test Collections for Retrieval Evaluation
Figure 2 for Synthetic Test Collections for Retrieval Evaluation
Figure 3 for Synthetic Test Collections for Retrieval Evaluation
Figure 4 for Synthetic Test Collections for Retrieval Evaluation
Viaarxiv icon

Clarifying the Path to User Satisfaction: An Investigation into Clarification Usefulness

Add code
Feb 02, 2024
Figure 1 for Clarifying the Path to User Satisfaction: An Investigation into Clarification Usefulness
Figure 2 for Clarifying the Path to User Satisfaction: An Investigation into Clarification Usefulness
Figure 3 for Clarifying the Path to User Satisfaction: An Investigation into Clarification Usefulness
Figure 4 for Clarifying the Path to User Satisfaction: An Investigation into Clarification Usefulness
Viaarxiv icon