Picture for Max Lamparth

Max Lamparth

BetterBench: Assessing AI Benchmarks, Uncovering Issues, and Establishing Best Practices

Add code
Nov 20, 2024
Viaarxiv icon

Measuring Free-Form Decision-Making Inconsistency of Language Models in Military Crisis Simulations

Add code
Oct 17, 2024
Viaarxiv icon

Markovian Agents for Truthful Language Modeling

Add code
Apr 29, 2024
Viaarxiv icon

Human vs. Machine: Language Models and Wargames

Add code
Mar 06, 2024
Viaarxiv icon

Escalation Risks from Language Models in Military and Diplomatic Decision-Making

Add code
Jan 07, 2024
Viaarxiv icon

TomOpt: Differential optimisation for task- and constraint-aware design of particle detectors in the context of muon tomography

Add code
Sep 25, 2023
Viaarxiv icon

Analyzing And Editing Inner Mechanisms Of Backdoored Language Models

Add code
Feb 24, 2023
Figure 1 for Analyzing And Editing Inner Mechanisms Of Backdoored Language Models
Figure 2 for Analyzing And Editing Inner Mechanisms Of Backdoored Language Models
Figure 3 for Analyzing And Editing Inner Mechanisms Of Backdoored Language Models
Figure 4 for Analyzing And Editing Inner Mechanisms Of Backdoored Language Models
Viaarxiv icon

Virgo: Scalable Unsupervised Classification of Cosmological Shock Waves

Add code
Aug 17, 2022
Figure 1 for Virgo: Scalable Unsupervised Classification of Cosmological Shock Waves
Figure 2 for Virgo: Scalable Unsupervised Classification of Cosmological Shock Waves
Figure 3 for Virgo: Scalable Unsupervised Classification of Cosmological Shock Waves
Figure 4 for Virgo: Scalable Unsupervised Classification of Cosmological Shock Waves
Viaarxiv icon