Picture for Tom Dupré la Tour

Tom Dupré la Tour

Persona Features Control Emergent Misalignment

Add code
Jun 24, 2025
Figure 1 for Persona Features Control Emergent Misalignment
Figure 2 for Persona Features Control Emergent Misalignment
Figure 3 for Persona Features Control Emergent Misalignment
Figure 4 for Persona Features Control Emergent Misalignment
Viaarxiv icon

Scaling and evaluating sparse autoencoders

Add code
Jun 06, 2024
Figure 1 for Scaling and evaluating sparse autoencoders
Figure 2 for Scaling and evaluating sparse autoencoders
Figure 3 for Scaling and evaluating sparse autoencoders
Figure 4 for Scaling and evaluating sparse autoencoders
Viaarxiv icon

Benchopt: Reproducible, efficient and collaborative optimization benchmarks

Add code
Jun 28, 2022
Figure 1 for Benchopt: Reproducible, efficient and collaborative optimization benchmarks
Figure 2 for Benchopt: Reproducible, efficient and collaborative optimization benchmarks
Figure 3 for Benchopt: Reproducible, efficient and collaborative optimization benchmarks
Figure 4 for Benchopt: Reproducible, efficient and collaborative optimization benchmarks
Viaarxiv icon