Picture for Jiali Pang

Jiali Pang

LLM-Mini-CEX: Automatic Evaluation of Large Language Model for Diagnostic Conversation

Add code
Aug 15, 2023
Viaarxiv icon

MedGPTEval: A Dataset and Benchmark to Evaluate Responses of Large Language Models in Medicine

Add code
May 12, 2023
Figure 1 for MedGPTEval: A Dataset and Benchmark to Evaluate Responses of Large Language Models in Medicine
Figure 2 for MedGPTEval: A Dataset and Benchmark to Evaluate Responses of Large Language Models in Medicine
Figure 3 for MedGPTEval: A Dataset and Benchmark to Evaluate Responses of Large Language Models in Medicine
Figure 4 for MedGPTEval: A Dataset and Benchmark to Evaluate Responses of Large Language Models in Medicine
Viaarxiv icon