Picture for Björn Deiseroth

Björn Deiseroth

SCAR: Sparse Conditioned Autoencoders for Concept Detection and Steering in LLMs

Add code
Nov 11, 2024
Viaarxiv icon

u-$μ$P: The Unit-Scaled Maximal Update Parametrization

Add code
Jul 24, 2024
Viaarxiv icon

T-FREE: Tokenizer-Free Generative LLMs via Sparse Representations for Memory-Efficient Embeddings

Add code
Jun 27, 2024
Figure 1 for T-FREE: Tokenizer-Free Generative LLMs via Sparse Representations for Memory-Efficient Embeddings
Figure 2 for T-FREE: Tokenizer-Free Generative LLMs via Sparse Representations for Memory-Efficient Embeddings
Figure 3 for T-FREE: Tokenizer-Free Generative LLMs via Sparse Representations for Memory-Efficient Embeddings
Figure 4 for T-FREE: Tokenizer-Free Generative LLMs via Sparse Representations for Memory-Efficient Embeddings
Viaarxiv icon

Mechanistic Design and Scaling of Hybrid Architectures

Add code
Mar 26, 2024
Figure 1 for Mechanistic Design and Scaling of Hybrid Architectures
Figure 2 for Mechanistic Design and Scaling of Hybrid Architectures
Figure 3 for Mechanistic Design and Scaling of Hybrid Architectures
Figure 4 for Mechanistic Design and Scaling of Hybrid Architectures
Viaarxiv icon

Divergent Token Metrics: Measuring degradation to prune away LLM components -- and optimize quantization

Add code
Nov 13, 2023
Viaarxiv icon

MultiFusion: Fusing Pre-Trained Models for Multi-Lingual, Multi-Modal Image Generation

Add code
May 24, 2023
Viaarxiv icon

AtMan: Understanding Transformer Predictions Through Memory Efficient Attention Manipulation

Add code
Jan 23, 2023
Viaarxiv icon

M-VADER: A Model for Diffusion with Multimodal Context

Add code
Dec 07, 2022
Viaarxiv icon

Safe Latent Diffusion: Mitigating Inappropriate Degeneration in Diffusion Models

Add code
Nov 19, 2022
Viaarxiv icon

Speaking Multiple Languages Affects the Moral Bias of Language Models

Add code
Nov 14, 2022
Viaarxiv icon