Picture for Qinyao Ai

Qinyao Ai

Benchmarking LLM-as-a-Judge for Long-Form Output Evaluation

Add code
Jun 01, 2026
Viaarxiv icon

An Automatic and Cost-Efficient Peer-Review Framework for Language Generation Evaluation

Add code
Oct 16, 2024
Figure 1 for An Automatic and Cost-Efficient Peer-Review Framework for Language Generation Evaluation
Figure 2 for An Automatic and Cost-Efficient Peer-Review Framework for Language Generation Evaluation
Figure 3 for An Automatic and Cost-Efficient Peer-Review Framework for Language Generation Evaluation
Figure 4 for An Automatic and Cost-Efficient Peer-Review Framework for Language Generation Evaluation
Viaarxiv icon