Picture for Ann-Kathrin Dombrowski

Ann-Kathrin Dombrowski

The WMDP Benchmark: Measuring and Reducing Malicious Use With Unlearning

Add code
Mar 06, 2024
Figure 1 for The WMDP Benchmark: Measuring and Reducing Malicious Use With Unlearning
Figure 2 for The WMDP Benchmark: Measuring and Reducing Malicious Use With Unlearning
Figure 3 for The WMDP Benchmark: Measuring and Reducing Malicious Use With Unlearning
Figure 4 for The WMDP Benchmark: Measuring and Reducing Malicious Use With Unlearning
Viaarxiv icon

Representation Engineering: A Top-Down Approach to AI Transparency

Add code
Oct 10, 2023
Figure 1 for Representation Engineering: A Top-Down Approach to AI Transparency
Figure 2 for Representation Engineering: A Top-Down Approach to AI Transparency
Figure 3 for Representation Engineering: A Top-Down Approach to AI Transparency
Figure 4 for Representation Engineering: A Top-Down Approach to AI Transparency
Viaarxiv icon

Diffeomorphic Counterfactuals with Generative Models

Add code
Jun 16, 2022
Figure 1 for Diffeomorphic Counterfactuals with Generative Models
Figure 2 for Diffeomorphic Counterfactuals with Generative Models
Figure 3 for Diffeomorphic Counterfactuals with Generative Models
Figure 4 for Diffeomorphic Counterfactuals with Generative Models
Viaarxiv icon

Automated Dissipation Control for Turbulence Simulation with Shell Models

Add code
Jan 07, 2022
Figure 1 for Automated Dissipation Control for Turbulence Simulation with Shell Models
Figure 2 for Automated Dissipation Control for Turbulence Simulation with Shell Models
Figure 3 for Automated Dissipation Control for Turbulence Simulation with Shell Models
Figure 4 for Automated Dissipation Control for Turbulence Simulation with Shell Models
Viaarxiv icon

Towards Robust Explanations for Deep Neural Networks

Add code
Dec 18, 2020
Figure 1 for Towards Robust Explanations for Deep Neural Networks
Figure 2 for Towards Robust Explanations for Deep Neural Networks
Figure 3 for Towards Robust Explanations for Deep Neural Networks
Figure 4 for Towards Robust Explanations for Deep Neural Networks
Viaarxiv icon

Fairwashing Explanations with Off-Manifold Detergent

Add code
Jul 20, 2020
Figure 1 for Fairwashing Explanations with Off-Manifold Detergent
Figure 2 for Fairwashing Explanations with Off-Manifold Detergent
Figure 3 for Fairwashing Explanations with Off-Manifold Detergent
Figure 4 for Fairwashing Explanations with Off-Manifold Detergent
Viaarxiv icon

Explanations can be manipulated and geometry is to blame

Add code
Jun 19, 2019
Figure 1 for Explanations can be manipulated and geometry is to blame
Figure 2 for Explanations can be manipulated and geometry is to blame
Figure 3 for Explanations can be manipulated and geometry is to blame
Figure 4 for Explanations can be manipulated and geometry is to blame
Viaarxiv icon

CNN Cascades for Segmenting Whole Slide Images of the Kidney

Add code
Aug 01, 2017
Figure 1 for CNN Cascades for Segmenting Whole Slide Images of the Kidney
Figure 2 for CNN Cascades for Segmenting Whole Slide Images of the Kidney
Figure 3 for CNN Cascades for Segmenting Whole Slide Images of the Kidney
Figure 4 for CNN Cascades for Segmenting Whole Slide Images of the Kidney
Viaarxiv icon