Picture for Alex Mallen

Alex Mallen

Subversion Strategy Eval: Evaluating AI's stateless strategic capabilities against control protocols

Add code
Dec 17, 2024
Viaarxiv icon

Balancing Label Quantity and Quality for Scalable Elicitation

Add code
Oct 17, 2024
Viaarxiv icon

Automatically Interpreting Millions of Features in Large Language Models

Add code
Oct 17, 2024
Viaarxiv icon

Neural Networks Learn Statistics of Increasing Complexity

Add code
Feb 13, 2024
Viaarxiv icon

Eliciting Latent Knowledge from Quirky Language Models

Add code
Dec 02, 2023
Viaarxiv icon

Representation Engineering: A Top-Down Approach to AI Transparency

Add code
Oct 10, 2023
Figure 1 for Representation Engineering: A Top-Down Approach to AI Transparency
Figure 2 for Representation Engineering: A Top-Down Approach to AI Transparency
Figure 3 for Representation Engineering: A Top-Down Approach to AI Transparency
Figure 4 for Representation Engineering: A Top-Down Approach to AI Transparency
Viaarxiv icon

When Not to Trust Language Models: Investigating Effectiveness and Limitations of Parametric and Non-Parametric Memories

Add code
Dec 20, 2022
Viaarxiv icon

Koopman-theoretic Approach for Identification of Exogenous Anomalies in Nonstationary Time-series Data

Add code
Sep 18, 2022
Figure 1 for Koopman-theoretic Approach for Identification of Exogenous Anomalies in Nonstationary Time-series Data
Figure 2 for Koopman-theoretic Approach for Identification of Exogenous Anomalies in Nonstationary Time-series Data
Figure 3 for Koopman-theoretic Approach for Identification of Exogenous Anomalies in Nonstationary Time-series Data
Figure 4 for Koopman-theoretic Approach for Identification of Exogenous Anomalies in Nonstationary Time-series Data
Viaarxiv icon

Deep Probabilistic Koopman: Long-term time-series forecasting under periodic uncertainties

Add code
Jun 10, 2021
Figure 1 for Deep Probabilistic Koopman: Long-term time-series forecasting under periodic uncertainties
Figure 2 for Deep Probabilistic Koopman: Long-term time-series forecasting under periodic uncertainties
Figure 3 for Deep Probabilistic Koopman: Long-term time-series forecasting under periodic uncertainties
Figure 4 for Deep Probabilistic Koopman: Long-term time-series forecasting under periodic uncertainties
Viaarxiv icon