Picture for Miles Wang

Miles Wang

Tony

FrontierScience: Evaluating AI's Ability to Perform Expert-Level Scientific Tasks

Add code
Jan 29, 2026
Viaarxiv icon

Monitoring Monitorability

Add code
Dec 20, 2025
Figure 1 for Monitoring Monitorability
Figure 2 for Monitoring Monitorability
Figure 3 for Monitoring Monitorability
Figure 4 for Monitoring Monitorability
Viaarxiv icon

OpenAI GPT-5 System Card

Add code
Dec 19, 2025
Viaarxiv icon

Persona Features Control Emergent Misalignment

Add code
Jun 24, 2025
Figure 1 for Persona Features Control Emergent Misalignment
Figure 2 for Persona Features Control Emergent Misalignment
Figure 3 for Persona Features Control Emergent Misalignment
Figure 4 for Persona Features Control Emergent Misalignment
Viaarxiv icon

OpenAI o1 System Card

Add code
Dec 21, 2024
Figure 1 for OpenAI o1 System Card
Figure 2 for OpenAI o1 System Card
Figure 3 for OpenAI o1 System Card
Figure 4 for OpenAI o1 System Card
Viaarxiv icon

GPT-4o System Card

Add code
Oct 25, 2024
Viaarxiv icon

Forbidden Facts: An Investigation of Competing Objectives in Llama-2

Add code
Dec 31, 2023
Viaarxiv icon