Picture for Manas Joglekar

Manas Joglekar

OpenAI o1 System Card

Add code
Dec 21, 2024
Viaarxiv icon

Deliberative Alignment: Reasoning Enables Safer Language Models

Add code
Dec 20, 2024
Viaarxiv icon

Weak-to-Strong Generalization: Eliciting Strong Capabilities With Weak Supervision

Add code
Dec 14, 2023
Figure 1 for Weak-to-Strong Generalization: Eliciting Strong Capabilities With Weak Supervision
Figure 2 for Weak-to-Strong Generalization: Eliciting Strong Capabilities With Weak Supervision
Figure 3 for Weak-to-Strong Generalization: Eliciting Strong Capabilities With Weak Supervision
Figure 4 for Weak-to-Strong Generalization: Eliciting Strong Capabilities With Weak Supervision
Viaarxiv icon