Picture for Druv Pai

Druv Pai

Active-Dormant Attention Heads: Mechanistically Demystifying Extreme-Token Phenomena in LLMs

Add code
Oct 17, 2024
Figure 1 for Active-Dormant Attention Heads: Mechanistically Demystifying Extreme-Token Phenomena in LLMs
Figure 2 for Active-Dormant Attention Heads: Mechanistically Demystifying Extreme-Token Phenomena in LLMs
Figure 3 for Active-Dormant Attention Heads: Mechanistically Demystifying Extreme-Token Phenomena in LLMs
Figure 4 for Active-Dormant Attention Heads: Mechanistically Demystifying Extreme-Token Phenomena in LLMs
Viaarxiv icon

A Global Geometric Analysis of Maximal Coding Rate Reduction

Add code
Jun 04, 2024
Figure 1 for A Global Geometric Analysis of Maximal Coding Rate Reduction
Figure 2 for A Global Geometric Analysis of Maximal Coding Rate Reduction
Figure 3 for A Global Geometric Analysis of Maximal Coding Rate Reduction
Figure 4 for A Global Geometric Analysis of Maximal Coding Rate Reduction
Viaarxiv icon

Scaling White-Box Transformers for Vision

Add code
Jun 03, 2024
Viaarxiv icon

Masked Completion via Structured Diffusion with White-Box Transformers

Add code
Apr 03, 2024
Viaarxiv icon

White-Box Transformers via Sparse Rate Reduction: Compression Is All There Is?

Add code
Nov 24, 2023
Viaarxiv icon

Emergence of Segmentation with Minimalistic White-Box Transformers

Add code
Aug 30, 2023
Figure 1 for Emergence of Segmentation with Minimalistic White-Box Transformers
Figure 2 for Emergence of Segmentation with Minimalistic White-Box Transformers
Figure 3 for Emergence of Segmentation with Minimalistic White-Box Transformers
Figure 4 for Emergence of Segmentation with Minimalistic White-Box Transformers
Viaarxiv icon

White-Box Transformers via Sparse Rate Reduction

Add code
Jun 01, 2023
Figure 1 for White-Box Transformers via Sparse Rate Reduction
Figure 2 for White-Box Transformers via Sparse Rate Reduction
Figure 3 for White-Box Transformers via Sparse Rate Reduction
Figure 4 for White-Box Transformers via Sparse Rate Reduction
Viaarxiv icon

Representation Learning via Manifold Flattening and Reconstruction

Add code
May 12, 2023
Viaarxiv icon

Closed-Loop Transcription via Convolutional Sparse Coding

Add code
Feb 18, 2023
Figure 1 for Closed-Loop Transcription via Convolutional Sparse Coding
Figure 2 for Closed-Loop Transcription via Convolutional Sparse Coding
Figure 3 for Closed-Loop Transcription via Convolutional Sparse Coding
Figure 4 for Closed-Loop Transcription via Convolutional Sparse Coding
Viaarxiv icon

Pursuit of a Discriminative Representation for Multiple Subspaces via Sequential Games

Add code
Jun 18, 2022
Figure 1 for Pursuit of a Discriminative Representation for Multiple Subspaces via Sequential Games
Figure 2 for Pursuit of a Discriminative Representation for Multiple Subspaces via Sequential Games
Figure 3 for Pursuit of a Discriminative Representation for Multiple Subspaces via Sequential Games
Figure 4 for Pursuit of a Discriminative Representation for Multiple Subspaces via Sequential Games
Viaarxiv icon