Picture for Alexander Lyzhov

Alexander Lyzhov

Steering Without Side Effects: Improving Post-Deployment Control of Language Models

Add code
Jun 21, 2024
Viaarxiv icon

Inverse Scaling: When Bigger Isn't Better

Add code
Jun 15, 2023
Viaarxiv icon

Normative Disagreement as a Challenge for Cooperative AI

Add code
Nov 27, 2021
Figure 1 for Normative Disagreement as a Challenge for Cooperative AI
Figure 2 for Normative Disagreement as a Challenge for Cooperative AI
Figure 3 for Normative Disagreement as a Challenge for Cooperative AI
Figure 4 for Normative Disagreement as a Challenge for Cooperative AI
Viaarxiv icon

Greedy Policy Search: A Simple Baseline for Learnable Test-Time Augmentation

Add code
Feb 21, 2020
Figure 1 for Greedy Policy Search: A Simple Baseline for Learnable Test-Time Augmentation
Figure 2 for Greedy Policy Search: A Simple Baseline for Learnable Test-Time Augmentation
Figure 3 for Greedy Policy Search: A Simple Baseline for Learnable Test-Time Augmentation
Figure 4 for Greedy Policy Search: A Simple Baseline for Learnable Test-Time Augmentation
Viaarxiv icon

Pitfalls of In-Domain Uncertainty Estimation and Ensembling in Deep Learning

Add code
Feb 15, 2020
Figure 1 for Pitfalls of In-Domain Uncertainty Estimation and Ensembling in Deep Learning
Figure 2 for Pitfalls of In-Domain Uncertainty Estimation and Ensembling in Deep Learning
Figure 3 for Pitfalls of In-Domain Uncertainty Estimation and Ensembling in Deep Learning
Figure 4 for Pitfalls of In-Domain Uncertainty Estimation and Ensembling in Deep Learning
Viaarxiv icon