Picture for Seunghyun Won

Seunghyun Won

EHRNoteQA: A Patient-Specific Question Answering Benchmark for Evaluating Large Language Models in Clinical Settings

Add code
Feb 27, 2024
Figure 1 for EHRNoteQA: A Patient-Specific Question Answering Benchmark for Evaluating Large Language Models in Clinical Settings
Figure 2 for EHRNoteQA: A Patient-Specific Question Answering Benchmark for Evaluating Large Language Models in Clinical Settings
Figure 3 for EHRNoteQA: A Patient-Specific Question Answering Benchmark for Evaluating Large Language Models in Clinical Settings
Figure 4 for EHRNoteQA: A Patient-Specific Question Answering Benchmark for Evaluating Large Language Models in Clinical Settings
Viaarxiv icon

KorNAT: LLM Alignment Benchmark for Korean Social Values and Common Knowledge

Add code
Feb 22, 2024
Viaarxiv icon

VisAlign: Dataset for Measuring the Degree of Alignment between AI and Humans in Visual Perception

Add code
Aug 03, 2023
Viaarxiv icon