Picture for Max Lamparth

Max Lamparth

Michael Pokorny

Moving Beyond Medical Exam Questions: A Clinician-Annotated Dataset of Real-World Tasks and Ambiguity in Mental Healthcare

Add code
Feb 22, 2025
Viaarxiv icon

Multi-Agent Risks from Advanced AI

Add code
Feb 19, 2025
Viaarxiv icon

Humanity's Last Exam

Add code
Jan 24, 2025
Viaarxiv icon

BetterBench: Assessing AI Benchmarks, Uncovering Issues, and Establishing Best Practices

Add code
Nov 20, 2024
Figure 1 for BetterBench: Assessing AI Benchmarks, Uncovering Issues, and Establishing Best Practices
Figure 2 for BetterBench: Assessing AI Benchmarks, Uncovering Issues, and Establishing Best Practices
Figure 3 for BetterBench: Assessing AI Benchmarks, Uncovering Issues, and Establishing Best Practices
Figure 4 for BetterBench: Assessing AI Benchmarks, Uncovering Issues, and Establishing Best Practices
Viaarxiv icon

Measuring Free-Form Decision-Making Inconsistency of Language Models in Military Crisis Simulations

Add code
Oct 17, 2024
Figure 1 for Measuring Free-Form Decision-Making Inconsistency of Language Models in Military Crisis Simulations
Figure 2 for Measuring Free-Form Decision-Making Inconsistency of Language Models in Military Crisis Simulations
Figure 3 for Measuring Free-Form Decision-Making Inconsistency of Language Models in Military Crisis Simulations
Figure 4 for Measuring Free-Form Decision-Making Inconsistency of Language Models in Military Crisis Simulations
Viaarxiv icon

Markovian Agents for Truthful Language Modeling

Add code
Apr 29, 2024
Viaarxiv icon

Human vs. Machine: Language Models and Wargames

Add code
Mar 06, 2024
Figure 1 for Human vs. Machine: Language Models and Wargames
Figure 2 for Human vs. Machine: Language Models and Wargames
Figure 3 for Human vs. Machine: Language Models and Wargames
Figure 4 for Human vs. Machine: Language Models and Wargames
Viaarxiv icon

Escalation Risks from Language Models in Military and Diplomatic Decision-Making

Add code
Jan 07, 2024
Viaarxiv icon

TomOpt: Differential optimisation for task- and constraint-aware design of particle detectors in the context of muon tomography

Add code
Sep 25, 2023
Viaarxiv icon

Analyzing And Editing Inner Mechanisms Of Backdoored Language Models

Add code
Feb 24, 2023
Figure 1 for Analyzing And Editing Inner Mechanisms Of Backdoored Language Models
Figure 2 for Analyzing And Editing Inner Mechanisms Of Backdoored Language Models
Figure 3 for Analyzing And Editing Inner Mechanisms Of Backdoored Language Models
Figure 4 for Analyzing And Editing Inner Mechanisms Of Backdoored Language Models
Viaarxiv icon