Picture for Mingqi Gao

Mingqi Gao

Aspect-Guided Multi-Level Perturbation Analysis of Large Language Models in Automated Peer Review

Add code
Feb 18, 2025
Viaarxiv icon

A Dual-Perspective NLG Meta-Evaluation Framework with Automatic Benchmark and Better Interpretability

Add code
Feb 17, 2025
Viaarxiv icon

Re-evaluating Automatic LLM System Ranking for Alignment with Human Preference

Add code
Dec 31, 2024
Figure 1 for Re-evaluating Automatic LLM System Ranking for Alignment with Human Preference
Figure 2 for Re-evaluating Automatic LLM System Ranking for Alignment with Human Preference
Figure 3 for Re-evaluating Automatic LLM System Ranking for Alignment with Human Preference
Figure 4 for Re-evaluating Automatic LLM System Ranking for Alignment with Human Preference
Viaarxiv icon

Analyzing and Evaluating Correlation Measures in NLG Meta-Evaluation

Add code
Oct 22, 2024
Figure 1 for Analyzing and Evaluating Correlation Measures in NLG Meta-Evaluation
Figure 2 for Analyzing and Evaluating Correlation Measures in NLG Meta-Evaluation
Figure 3 for Analyzing and Evaluating Correlation Measures in NLG Meta-Evaluation
Figure 4 for Analyzing and Evaluating Correlation Measures in NLG Meta-Evaluation
Viaarxiv icon

Themis: Towards Flexible and Interpretable NLG Evaluation

Add code
Jun 26, 2024
Figure 1 for Themis: Towards Flexible and Interpretable NLG Evaluation
Figure 2 for Themis: Towards Flexible and Interpretable NLG Evaluation
Figure 3 for Themis: Towards Flexible and Interpretable NLG Evaluation
Figure 4 for Themis: Towards Flexible and Interpretable NLG Evaluation
Viaarxiv icon

PVUW 2024 Challenge on Complex Video Understanding: Methods and Results

Add code
Jun 24, 2024
Figure 1 for PVUW 2024 Challenge on Complex Video Understanding: Methods and Results
Figure 2 for PVUW 2024 Challenge on Complex Video Understanding: Methods and Results
Figure 3 for PVUW 2024 Challenge on Complex Video Understanding: Methods and Results
Figure 4 for PVUW 2024 Challenge on Complex Video Understanding: Methods and Results
Viaarxiv icon

Better than Random: Reliable NLG Human Evaluation with Constrained Active Sampling

Add code
Jun 12, 2024
Viaarxiv icon

1st Place Solution for MeViS Track in CVPR 2024 PVUW Workshop: Motion Expression guided Video Segmentation

Add code
Jun 11, 2024
Figure 1 for 1st Place Solution for MeViS Track in CVPR 2024 PVUW Workshop: Motion Expression guided Video Segmentation
Figure 2 for 1st Place Solution for MeViS Track in CVPR 2024 PVUW Workshop: Motion Expression guided Video Segmentation
Figure 3 for 1st Place Solution for MeViS Track in CVPR 2024 PVUW Workshop: Motion Expression guided Video Segmentation
Figure 4 for 1st Place Solution for MeViS Track in CVPR 2024 PVUW Workshop: Motion Expression guided Video Segmentation
Viaarxiv icon

Place Anything into Any Video

Add code
Feb 22, 2024
Viaarxiv icon

Are LLM-based Evaluators Confusing NLG Quality Criteria?

Add code
Feb 19, 2024
Figure 1 for Are LLM-based Evaluators Confusing NLG Quality Criteria?
Figure 2 for Are LLM-based Evaluators Confusing NLG Quality Criteria?
Figure 3 for Are LLM-based Evaluators Confusing NLG Quality Criteria?
Figure 4 for Are LLM-based Evaluators Confusing NLG Quality Criteria?
Viaarxiv icon