Picture for Aryan Shrivastava

Aryan Shrivastava

Moving Beyond Medical Exam Questions: A Clinician-Annotated Dataset of Real-World Tasks and Ambiguity in Mental Healthcare

Add code
Feb 22, 2025
Viaarxiv icon

Measuring Free-Form Decision-Making Inconsistency of Language Models in Military Crisis Simulations

Add code
Oct 17, 2024
Figure 1 for Measuring Free-Form Decision-Making Inconsistency of Language Models in Military Crisis Simulations
Figure 2 for Measuring Free-Form Decision-Making Inconsistency of Language Models in Military Crisis Simulations
Figure 3 for Measuring Free-Form Decision-Making Inconsistency of Language Models in Military Crisis Simulations
Figure 4 for Measuring Free-Form Decision-Making Inconsistency of Language Models in Military Crisis Simulations
Viaarxiv icon