Picture for Danielle Bitterman

Danielle Bitterman

Multi-OphthaLingua: A Multilingual Benchmark for Assessing and Debiasing LLM Ophthalmological QA in LMICs

Add code
Dec 18, 2024
Figure 1 for Multi-OphthaLingua: A Multilingual Benchmark for Assessing and Debiasing LLM Ophthalmological QA in LMICs
Figure 2 for Multi-OphthaLingua: A Multilingual Benchmark for Assessing and Debiasing LLM Ophthalmological QA in LMICs
Figure 3 for Multi-OphthaLingua: A Multilingual Benchmark for Assessing and Debiasing LLM Ophthalmological QA in LMICs
Figure 4 for Multi-OphthaLingua: A Multilingual Benchmark for Assessing and Debiasing LLM Ophthalmological QA in LMICs
Viaarxiv icon

ClinicalBench: Can LLMs Beat Traditional ML Models in Clinical Prediction?

Add code
Nov 10, 2024
Viaarxiv icon

Position Paper On Diagnostic Uncertainty Estimation from Large Language Models: Next-Word Probability Is Not Pre-test Probability

Add code
Nov 07, 2024
Viaarxiv icon

Mapping Bias in Vision Language Models: Signposts, Pitfalls, and the Road Ahead

Add code
Oct 17, 2024
Viaarxiv icon

Wait, but Tylenol is Acetaminophen... Investigating and Improving Language Models' Ability to Resist Requests for Misinformation

Add code
Sep 30, 2024
Viaarxiv icon

When Raw Data Prevails: Are Large Language Model Embeddings Effective in Numerical Data Representation for Medical Machine Learning Applications?

Add code
Aug 15, 2024
Figure 1 for When Raw Data Prevails: Are Large Language Model Embeddings Effective in Numerical Data Representation for Medical Machine Learning Applications?
Figure 2 for When Raw Data Prevails: Are Large Language Model Embeddings Effective in Numerical Data Representation for Medical Machine Learning Applications?
Figure 3 for When Raw Data Prevails: Are Large Language Model Embeddings Effective in Numerical Data Representation for Medical Machine Learning Applications?
Figure 4 for When Raw Data Prevails: Are Large Language Model Embeddings Effective in Numerical Data Representation for Medical Machine Learning Applications?
Viaarxiv icon

Language Models are Surprisingly Fragile to Drug Names in Biomedical Benchmarks

Add code
Jun 17, 2024
Viaarxiv icon

Measuring Pointwise $\mathcal{V}$-Usable Information In-Context-ly

Add code
Oct 18, 2023
Figure 1 for Measuring Pointwise $\mathcal{V}$-Usable Information In-Context-ly
Figure 2 for Measuring Pointwise $\mathcal{V}$-Usable Information In-Context-ly
Figure 3 for Measuring Pointwise $\mathcal{V}$-Usable Information In-Context-ly
Figure 4 for Measuring Pointwise $\mathcal{V}$-Usable Information In-Context-ly
Viaarxiv icon