Picture for Vardan Papyan

Vardan Papyan

Attention Sinks and Outlier Features: A 'Catch, Tag, and Release' Mechanism for Embeddings

Add code
Feb 02, 2025
Viaarxiv icon

Transformer Alignment in Large Language Models

Add code
Jul 10, 2024
Figure 1 for Transformer Alignment in Large Language Models
Figure 2 for Transformer Alignment in Large Language Models
Figure 3 for Transformer Alignment in Large Language Models
Figure 4 for Transformer Alignment in Large Language Models
Viaarxiv icon

Sparsest Models Elude Pruning: An Exposé of Pruning's Current Capabilities

Add code
Jul 04, 2024
Viaarxiv icon

A False Sense of Safety: Unsafe Information Leakage in 'Safe' AI Responses

Add code
Jul 02, 2024
Viaarxiv icon

Linguistic Collapse: Neural Collapse in (Large) Language Models

Add code
May 28, 2024
Viaarxiv icon

Pushing Boundaries: Mixup's Influence on Neural Collapse

Add code
Feb 09, 2024
Figure 1 for Pushing Boundaries: Mixup's Influence on Neural Collapse
Figure 2 for Pushing Boundaries: Mixup's Influence on Neural Collapse
Figure 3 for Pushing Boundaries: Mixup's Influence on Neural Collapse
Figure 4 for Pushing Boundaries: Mixup's Influence on Neural Collapse
Viaarxiv icon

Residual Alignment: Uncovering the Mechanisms of Residual Networks

Add code
Jan 17, 2024
Viaarxiv icon

Out of the Ordinary: Spectrally Adapting Regression for Covariate Shift

Add code
Dec 29, 2023
Viaarxiv icon

LLM Censorship: A Machine Learning Challenge or a Computer Security Problem?

Add code
Jul 20, 2023
Viaarxiv icon

Neural Collapse Under MSE Loss: Proximity to and Dynamics on the Central Path

Add code
Jun 03, 2021
Figure 1 for Neural Collapse Under MSE Loss: Proximity to and Dynamics on the Central Path
Figure 2 for Neural Collapse Under MSE Loss: Proximity to and Dynamics on the Central Path
Figure 3 for Neural Collapse Under MSE Loss: Proximity to and Dynamics on the Central Path
Figure 4 for Neural Collapse Under MSE Loss: Proximity to and Dynamics on the Central Path
Viaarxiv icon