Picture for Max Kaufmann

Max Kaufmann

Self-Regulation and Requesting Interventions

Add code
Feb 07, 2025
Viaarxiv icon

Visibility into AI Agents

Add code
Feb 04, 2024
Figure 1 for Visibility into AI Agents
Figure 2 for Visibility into AI Agents
Viaarxiv icon

The Reversal Curse: LLMs trained on "A is B" fail to learn "B is A"

Add code
Sep 22, 2023
Figure 1 for The Reversal Curse: LLMs trained on "A is B" fail to learn "B is A"
Figure 2 for The Reversal Curse: LLMs trained on "A is B" fail to learn "B is A"
Figure 3 for The Reversal Curse: LLMs trained on "A is B" fail to learn "B is A"
Figure 4 for The Reversal Curse: LLMs trained on "A is B" fail to learn "B is A"
Viaarxiv icon

Taken out of context: On measuring situational awareness in LLMs

Add code
Sep 01, 2023
Viaarxiv icon