Picture for Negar Arabzadeh

Negar Arabzadeh

exHarmony: Authorship and Citations for Benchmarking the Reviewer Assignment Problem

Add code
Feb 11, 2025
Viaarxiv icon

Benchmarking Prompt Sensitivity in Large Language Models

Add code
Feb 09, 2025
Viaarxiv icon

EMPRA: Embedding Perturbation Rank Attack against Neural Ranking Models

Add code
Dec 20, 2024
Figure 1 for EMPRA: Embedding Perturbation Rank Attack against Neural Ranking Models
Figure 2 for EMPRA: Embedding Perturbation Rank Attack against Neural Ranking Models
Figure 3 for EMPRA: Embedding Perturbation Rank Attack against Neural Ranking Models
Figure 4 for EMPRA: Embedding Perturbation Rank Attack against Neural Ranking Models
Viaarxiv icon

Offline Evaluation of Set-Based Text-to-Image Generation

Add code
Oct 22, 2024
Viaarxiv icon

IDAT: A Multi-Modal Dataset and Toolkit for Building and Evaluating Interactive Task-Solving Agents

Add code
Jul 12, 2024
Figure 1 for IDAT: A Multi-Modal Dataset and Toolkit for Building and Evaluating Interactive Task-Solving Agents
Figure 2 for IDAT: A Multi-Modal Dataset and Toolkit for Building and Evaluating Interactive Task-Solving Agents
Figure 3 for IDAT: A Multi-Modal Dataset and Toolkit for Building and Evaluating Interactive Task-Solving Agents
Figure 4 for IDAT: A Multi-Modal Dataset and Toolkit for Building and Evaluating Interactive Task-Solving Agents
Viaarxiv icon

Assessing and Verifying Task Utility in LLM-Powered Applications

Add code
May 03, 2024
Figure 1 for Assessing and Verifying Task Utility in LLM-Powered Applications
Figure 2 for Assessing and Verifying Task Utility in LLM-Powered Applications
Figure 3 for Assessing and Verifying Task Utility in LLM-Powered Applications
Figure 4 for Assessing and Verifying Task Utility in LLM-Powered Applications
Viaarxiv icon

Ranked List Truncation for Large Language Model-based Re-Ranking

Add code
Apr 28, 2024
Viaarxiv icon

Generative Information Retrieval Evaluation

Add code
Apr 11, 2024
Viaarxiv icon

A Comparison of Methods for Evaluating Generative IR

Add code
Apr 09, 2024
Viaarxiv icon

Query Performance Prediction using Relevance Judgments Generated by Large Language Models

Add code
Apr 01, 2024
Viaarxiv icon