Picture for Qian Dong

Qian Dong

LLMs-as-Judges: A Comprehensive Survey on LLM-based Evaluation Methods

Add code
Dec 10, 2024
Viaarxiv icon

CalibraEval: Calibrating Prediction Distribution to Mitigate Selection Bias in LLMs-as-Judges

Add code
Oct 20, 2024
Viaarxiv icon

BLADE: Enhancing Black-box Large Language Models with Small Domain-Specific Models

Add code
Mar 27, 2024
Figure 1 for BLADE: Enhancing Black-box Large Language Models with Small Domain-Specific Models
Figure 2 for BLADE: Enhancing Black-box Large Language Models with Small Domain-Specific Models
Figure 3 for BLADE: Enhancing Black-box Large Language Models with Small Domain-Specific Models
Figure 4 for BLADE: Enhancing Black-box Large Language Models with Small Domain-Specific Models
Viaarxiv icon

DELTA: Pre-train a Discriminative Encoder for Legal Case Retrieval via Structural Word Alignment

Add code
Mar 27, 2024
Viaarxiv icon

Aligning the Capabilities of Large Language Models with the Context of Information Retrieval via Contrastive Feedback

Add code
Sep 29, 2023
Figure 1 for Aligning the Capabilities of Large Language Models with the Context of Information Retrieval via Contrastive Feedback
Figure 2 for Aligning the Capabilities of Large Language Models with the Context of Information Retrieval via Contrastive Feedback
Figure 3 for Aligning the Capabilities of Large Language Models with the Context of Information Retrieval via Contrastive Feedback
Figure 4 for Aligning the Capabilities of Large Language Models with the Context of Information Retrieval via Contrastive Feedback
Viaarxiv icon

I^3 Retriever: Incorporating Implicit Interaction in Pre-trained Language Models for Passage Retrieval

Add code
Jun 04, 2023
Viaarxiv icon

SAILER: Structure-aware Pre-trained Language Model for Legal Case Retrieval

Add code
Apr 22, 2023
Viaarxiv icon

T2Ranking: A large-scale Chinese Benchmark for Passage Ranking

Add code
Apr 07, 2023
Figure 1 for T2Ranking: A large-scale Chinese Benchmark for Passage Ranking
Figure 2 for T2Ranking: A large-scale Chinese Benchmark for Passage Ranking
Figure 3 for T2Ranking: A large-scale Chinese Benchmark for Passage Ranking
Figure 4 for T2Ranking: A large-scale Chinese Benchmark for Passage Ranking
Viaarxiv icon

Social4Rec: Distilling User Preference from Social Graph for Video Recommendation in Tencent

Add code
Feb 23, 2023
Viaarxiv icon

Layout-aware Webpage Quality Assessment

Add code
Feb 05, 2023
Viaarxiv icon