Picture for Mingqi Gao

Mingqi Gao

MMCR: Benchmarking Cross-Source Reasoning in Scientific Papers

Add code
Mar 21, 2025
Viaarxiv icon

Exploring the Multilingual NLG Evaluation Abilities of LLM-Based Evaluators

Add code
Mar 06, 2025
Viaarxiv icon

Aspect-Guided Multi-Level Perturbation Analysis of Large Language Models in Automated Peer Review

Add code
Feb 18, 2025
Viaarxiv icon

A Dual-Perspective NLG Meta-Evaluation Framework with Automatic Benchmark and Better Interpretability

Add code
Feb 17, 2025
Viaarxiv icon

Re-evaluating Automatic LLM System Ranking for Alignment with Human Preference

Add code
Dec 31, 2024
Figure 1 for Re-evaluating Automatic LLM System Ranking for Alignment with Human Preference
Figure 2 for Re-evaluating Automatic LLM System Ranking for Alignment with Human Preference
Figure 3 for Re-evaluating Automatic LLM System Ranking for Alignment with Human Preference
Figure 4 for Re-evaluating Automatic LLM System Ranking for Alignment with Human Preference
Viaarxiv icon

Analyzing and Evaluating Correlation Measures in NLG Meta-Evaluation

Add code
Oct 22, 2024
Figure 1 for Analyzing and Evaluating Correlation Measures in NLG Meta-Evaluation
Figure 2 for Analyzing and Evaluating Correlation Measures in NLG Meta-Evaluation
Figure 3 for Analyzing and Evaluating Correlation Measures in NLG Meta-Evaluation
Figure 4 for Analyzing and Evaluating Correlation Measures in NLG Meta-Evaluation
Viaarxiv icon

Themis: Towards Flexible and Interpretable NLG Evaluation

Add code
Jun 26, 2024
Figure 1 for Themis: Towards Flexible and Interpretable NLG Evaluation
Figure 2 for Themis: Towards Flexible and Interpretable NLG Evaluation
Figure 3 for Themis: Towards Flexible and Interpretable NLG Evaluation
Figure 4 for Themis: Towards Flexible and Interpretable NLG Evaluation
Viaarxiv icon

PVUW 2024 Challenge on Complex Video Understanding: Methods and Results

Add code
Jun 24, 2024
Figure 1 for PVUW 2024 Challenge on Complex Video Understanding: Methods and Results
Figure 2 for PVUW 2024 Challenge on Complex Video Understanding: Methods and Results
Figure 3 for PVUW 2024 Challenge on Complex Video Understanding: Methods and Results
Figure 4 for PVUW 2024 Challenge on Complex Video Understanding: Methods and Results
Viaarxiv icon

Better than Random: Reliable NLG Human Evaluation with Constrained Active Sampling

Add code
Jun 12, 2024
Viaarxiv icon

1st Place Solution for MeViS Track in CVPR 2024 PVUW Workshop: Motion Expression guided Video Segmentation

Add code
Jun 11, 2024
Figure 1 for 1st Place Solution for MeViS Track in CVPR 2024 PVUW Workshop: Motion Expression guided Video Segmentation
Figure 2 for 1st Place Solution for MeViS Track in CVPR 2024 PVUW Workshop: Motion Expression guided Video Segmentation
Figure 3 for 1st Place Solution for MeViS Track in CVPR 2024 PVUW Workshop: Motion Expression guided Video Segmentation
Figure 4 for 1st Place Solution for MeViS Track in CVPR 2024 PVUW Workshop: Motion Expression guided Video Segmentation
Viaarxiv icon