Picture for Haitao Li

Haitao Li

LLMs-as-Judges: A Comprehensive Survey on LLM-based Evaluation Methods

Add code
Dec 10, 2024
Viaarxiv icon

De-biased Multimodal Electrocardiogram Analysis

Add code
Nov 22, 2024
Viaarxiv icon

Large-scale cross-modality pretrained model enhances cardiovascular state estimation and cardiomyopathy detection from electrocardiograms: An AI system development and multi-center validation study

Add code
Nov 19, 2024
Viaarxiv icon

CalibraEval: Calibrating Prediction Distribution to Mitigate Selection Bias in LLMs-as-Judges

Add code
Oct 20, 2024
Viaarxiv icon

An Automatic and Cost-Efficient Peer-Review Framework for Language Generation Evaluation

Add code
Oct 16, 2024
Viaarxiv icon

LexEval: A Comprehensive Chinese Legal Benchmark for Evaluating Large Language Models

Add code
Sep 30, 2024
Figure 1 for LexEval: A Comprehensive Chinese Legal Benchmark for Evaluating Large Language Models
Figure 2 for LexEval: A Comprehensive Chinese Legal Benchmark for Evaluating Large Language Models
Figure 3 for LexEval: A Comprehensive Chinese Legal Benchmark for Evaluating Large Language Models
Figure 4 for LexEval: A Comprehensive Chinese Legal Benchmark for Evaluating Large Language Models
Viaarxiv icon

Towards an In-Depth Comprehension of Case Relevance for Better Legal Retrieval

Add code
Apr 01, 2024
Figure 1 for Towards an In-Depth Comprehension of Case Relevance for Better Legal Retrieval
Figure 2 for Towards an In-Depth Comprehension of Case Relevance for Better Legal Retrieval
Figure 3 for Towards an In-Depth Comprehension of Case Relevance for Better Legal Retrieval
Figure 4 for Towards an In-Depth Comprehension of Case Relevance for Better Legal Retrieval
Viaarxiv icon

DELTA: Pre-train a Discriminative Encoder for Legal Case Retrieval via Structural Word Alignment

Add code
Mar 27, 2024
Viaarxiv icon

BLADE: Enhancing Black-box Large Language Models with Small Domain-Specific Models

Add code
Mar 27, 2024
Figure 1 for BLADE: Enhancing Black-box Large Language Models with Small Domain-Specific Models
Figure 2 for BLADE: Enhancing Black-box Large Language Models with Small Domain-Specific Models
Figure 3 for BLADE: Enhancing Black-box Large Language Models with Small Domain-Specific Models
Figure 4 for BLADE: Enhancing Black-box Large Language Models with Small Domain-Specific Models
Viaarxiv icon

Evaluation Ethics of LLMs in Legal Domain

Add code
Mar 17, 2024
Figure 1 for Evaluation Ethics of LLMs in Legal Domain
Figure 2 for Evaluation Ethics of LLMs in Legal Domain
Figure 3 for Evaluation Ethics of LLMs in Legal Domain
Figure 4 for Evaluation Ethics of LLMs in Legal Domain
Viaarxiv icon