Picture for Alexandre Variengien

Alexandre Variengien

BELLS: A Framework Towards Future Proof Benchmarks for the Evaluation of LLM Safeguards

Add code
Jun 03, 2024
Viaarxiv icon

Look Before You Leap: A Universal Emergent Decomposition of Retrieval Tasks in Language Models

Add code
Dec 13, 2023
Viaarxiv icon

How does GPT-2 compute greater-than?: Interpreting mathematical abilities in a pre-trained language model

Add code
Apr 30, 2023
Viaarxiv icon

Interpretability in the Wild: a Circuit for Indirect Object Identification in GPT-2 small

Add code
Nov 01, 2022
Viaarxiv icon

Towards self-organized control: Using neural cellular automata to robustly control a cart-pole agent

Add code
Jul 12, 2021
Figure 1 for Towards self-organized control: Using neural cellular automata to robustly control a cart-pole agent
Figure 2 for Towards self-organized control: Using neural cellular automata to robustly control a cart-pole agent
Figure 3 for Towards self-organized control: Using neural cellular automata to robustly control a cart-pole agent
Figure 4 for Towards self-organized control: Using neural cellular automata to robustly control a cart-pole agent
Viaarxiv icon

A journey in ESN and LSTM visualisations on a language task

Add code
Dec 13, 2020
Figure 1 for A journey in ESN and LSTM visualisations on a language task
Figure 2 for A journey in ESN and LSTM visualisations on a language task
Figure 3 for A journey in ESN and LSTM visualisations on a language task
Figure 4 for A journey in ESN and LSTM visualisations on a language task
Viaarxiv icon