Picture for Kazuki Irie

Kazuki Irie

Key-value memory in the brain

Add code
Jan 06, 2025
Figure 1 for Key-value memory in the brain
Figure 2 for Key-value memory in the brain
Figure 3 for Key-value memory in the brain
Viaarxiv icon

Why Are Positional Encodings Nonessential for Deep Autoregressive Transformers? Revisiting a Petroglyph

Add code
Dec 31, 2024
Figure 1 for Why Are Positional Encodings Nonessential for Deep Autoregressive Transformers? Revisiting a Petroglyph
Viaarxiv icon

Neural networks that overcome classic challenges through practice

Add code
Oct 14, 2024
Figure 1 for Neural networks that overcome classic challenges through practice
Figure 2 for Neural networks that overcome classic challenges through practice
Figure 3 for Neural networks that overcome classic challenges through practice
Viaarxiv icon

MoEUT: Mixture-of-Experts Universal Transformers

Add code
May 25, 2024
Viaarxiv icon

Dissecting the Interplay of Attention Paths in a Statistical Mechanics Theory of Transformers

Add code
May 24, 2024
Viaarxiv icon

SwitchHead: Accelerating Transformers with Mixture-of-Experts Attention

Add code
Dec 14, 2023
Figure 1 for SwitchHead: Accelerating Transformers with Mixture-of-Experts Attention
Figure 2 for SwitchHead: Accelerating Transformers with Mixture-of-Experts Attention
Figure 3 for SwitchHead: Accelerating Transformers with Mixture-of-Experts Attention
Figure 4 for SwitchHead: Accelerating Transformers with Mixture-of-Experts Attention
Viaarxiv icon

Automating Continual Learning

Add code
Dec 01, 2023
Viaarxiv icon

Practical Computational Power of Linear Transformers and Their Recurrent and Self-Referential Extensions

Add code
Oct 24, 2023
Viaarxiv icon

Approximating Two-Layer Feedforward Networks for Efficient Transformers

Add code
Oct 23, 2023
Viaarxiv icon

Exploring the Promise and Limits of Real-Time Recurrent Learning

Add code
May 30, 2023
Figure 1 for Exploring the Promise and Limits of Real-Time Recurrent Learning
Figure 2 for Exploring the Promise and Limits of Real-Time Recurrent Learning
Figure 3 for Exploring the Promise and Limits of Real-Time Recurrent Learning
Figure 4 for Exploring the Promise and Limits of Real-Time Recurrent Learning
Viaarxiv icon