Picture for Xu Owen He

Xu Owen He

Forgetting Transformer: Softmax Attention with a Forget Gate

Add code
Mar 03, 2025
Viaarxiv icon

TRecViT: A Recurrent Video Transformer

Add code
Dec 18, 2024
Viaarxiv icon

Mixture of A Million Experts

Add code
Jul 04, 2024
Figure 1 for Mixture of A Million Experts
Figure 2 for Mixture of A Million Experts
Figure 3 for Mixture of A Million Experts
Figure 4 for Mixture of A Million Experts
Viaarxiv icon

NEVIS'22: A Stream of 100 Tasks Sampled from 30 Years of Computer Vision Research

Add code
Nov 15, 2022
Viaarxiv icon