Picture for Qingyao Ai

Qingyao Ai

RbFT: Robust Fine-tuning for Retrieval-Augmented Generation against Retrieval Defects

Add code
Jan 30, 2025
Figure 1 for RbFT: Robust Fine-tuning for Retrieval-Augmented Generation against Retrieval Defects
Figure 2 for RbFT: Robust Fine-tuning for Retrieval-Augmented Generation against Retrieval Defects
Figure 3 for RbFT: Robust Fine-tuning for Retrieval-Augmented Generation against Retrieval Defects
Figure 4 for RbFT: Robust Fine-tuning for Retrieval-Augmented Generation against Retrieval Defects
Viaarxiv icon

Parametric Retrieval Augmented Generation

Add code
Jan 27, 2025
Viaarxiv icon

Foundations of GenIR

Add code
Jan 06, 2025
Figure 1 for Foundations of GenIR
Figure 2 for Foundations of GenIR
Viaarxiv icon

Improving GenIR Systems Based on User Feedback

Add code
Jan 06, 2025
Figure 1 for Improving GenIR Systems Based on User Feedback
Figure 2 for Improving GenIR Systems Based on User Feedback
Figure 3 for Improving GenIR Systems Based on User Feedback
Viaarxiv icon

Unsupervised dense retrieval with conterfactual contrastive learning

Add code
Dec 30, 2024
Viaarxiv icon

LegalAgentBench: Evaluating LLM Agents in Legal Domain

Add code
Dec 23, 2024
Viaarxiv icon

Knowledge Editing through Chain-of-Thought

Add code
Dec 23, 2024
Viaarxiv icon

LLMs-as-Judges: A Comprehensive Survey on LLM-based Evaluation Methods

Add code
Dec 10, 2024
Figure 1 for LLMs-as-Judges: A Comprehensive Survey on LLM-based Evaluation Methods
Figure 2 for LLMs-as-Judges: A Comprehensive Survey on LLM-based Evaluation Methods
Figure 3 for LLMs-as-Judges: A Comprehensive Survey on LLM-based Evaluation Methods
Figure 4 for LLMs-as-Judges: A Comprehensive Survey on LLM-based Evaluation Methods
Viaarxiv icon

CalibraEval: Calibrating Prediction Distribution to Mitigate Selection Bias in LLMs-as-Judges

Add code
Oct 20, 2024
Viaarxiv icon

LexEval: A Comprehensive Chinese Legal Benchmark for Evaluating Large Language Models

Add code
Sep 30, 2024
Figure 1 for LexEval: A Comprehensive Chinese Legal Benchmark for Evaluating Large Language Models
Figure 2 for LexEval: A Comprehensive Chinese Legal Benchmark for Evaluating Large Language Models
Figure 3 for LexEval: A Comprehensive Chinese Legal Benchmark for Evaluating Large Language Models
Figure 4 for LexEval: A Comprehensive Chinese Legal Benchmark for Evaluating Large Language Models
Viaarxiv icon