Picture for Qingyao Ai

Qingyao Ai

Improving GenIR Systems Based on User Feedback

Add code
Jan 06, 2025
Figure 1 for Improving GenIR Systems Based on User Feedback
Figure 2 for Improving GenIR Systems Based on User Feedback
Figure 3 for Improving GenIR Systems Based on User Feedback
Viaarxiv icon

Foundations of GenIR

Add code
Jan 06, 2025
Figure 1 for Foundations of GenIR
Figure 2 for Foundations of GenIR
Viaarxiv icon

Unsupervised dense retrieval with conterfactual contrastive learning

Add code
Dec 30, 2024
Viaarxiv icon

LegalAgentBench: Evaluating LLM Agents in Legal Domain

Add code
Dec 23, 2024
Viaarxiv icon

Knowledge Editing through Chain-of-Thought

Add code
Dec 23, 2024
Viaarxiv icon

LLMs-as-Judges: A Comprehensive Survey on LLM-based Evaluation Methods

Add code
Dec 10, 2024
Figure 1 for LLMs-as-Judges: A Comprehensive Survey on LLM-based Evaluation Methods
Figure 2 for LLMs-as-Judges: A Comprehensive Survey on LLM-based Evaluation Methods
Figure 3 for LLMs-as-Judges: A Comprehensive Survey on LLM-based Evaluation Methods
Figure 4 for LLMs-as-Judges: A Comprehensive Survey on LLM-based Evaluation Methods
Viaarxiv icon

CalibraEval: Calibrating Prediction Distribution to Mitigate Selection Bias in LLMs-as-Judges

Add code
Oct 20, 2024
Viaarxiv icon

LexEval: A Comprehensive Chinese Legal Benchmark for Evaluating Large Language Models

Add code
Sep 30, 2024
Figure 1 for LexEval: A Comprehensive Chinese Legal Benchmark for Evaluating Large Language Models
Figure 2 for LexEval: A Comprehensive Chinese Legal Benchmark for Evaluating Large Language Models
Figure 3 for LexEval: A Comprehensive Chinese Legal Benchmark for Evaluating Large Language Models
Figure 4 for LexEval: A Comprehensive Chinese Legal Benchmark for Evaluating Large Language Models
Viaarxiv icon

LeKUBE: A Legal Knowledge Update BEnchmark

Add code
Jul 19, 2024
Figure 1 for LeKUBE: A Legal Knowledge Update BEnchmark
Figure 2 for LeKUBE: A Legal Knowledge Update BEnchmark
Figure 3 for LeKUBE: A Legal Knowledge Update BEnchmark
Figure 4 for LeKUBE: A Legal Knowledge Update BEnchmark
Viaarxiv icon

Mitigating Entity-Level Hallucination in Large Language Models

Add code
Jul 12, 2024
Figure 1 for Mitigating Entity-Level Hallucination in Large Language Models
Figure 2 for Mitigating Entity-Level Hallucination in Large Language Models
Figure 3 for Mitigating Entity-Level Hallucination in Large Language Models
Figure 4 for Mitigating Entity-Level Hallucination in Large Language Models
Viaarxiv icon