Picture for Alan Cooney

Alan Cooney

Practical challenges of control monitoring in frontier AI deployments

Add code
Dec 15, 2025
Viaarxiv icon

Async Control: Stress-testing Asynchronous Control Measures for LLM Agents

Add code
Dec 15, 2025
Viaarxiv icon

Chain of Thought Monitorability: A New and Fragile Opportunity for AI Safety

Add code
Jul 15, 2025
Figure 1 for Chain of Thought Monitorability: A New and Fragile Opportunity for AI Safety
Viaarxiv icon

RepliBench: Evaluating the autonomous replication capabilities of language model agents

Add code
Apr 21, 2025
Viaarxiv icon

Summing Up the Facts: Additive Mechanisms Behind Factual Recall in LLMs

Add code
Feb 11, 2024
Figure 1 for Summing Up the Facts: Additive Mechanisms Behind Factual Recall in LLMs
Figure 2 for Summing Up the Facts: Additive Mechanisms Behind Factual Recall in LLMs
Figure 3 for Summing Up the Facts: Additive Mechanisms Behind Factual Recall in LLMs
Figure 4 for Summing Up the Facts: Additive Mechanisms Behind Factual Recall in LLMs
Viaarxiv icon