Picture for Jinghan Yao

Jinghan Yao

DK

Training Ultra Long Context Language Model with Fully Pipelined Distributed Transformer

Add code
Aug 30, 2024
Figure 1 for Training Ultra Long Context Language Model with Fully Pipelined Distributed Transformer
Figure 2 for Training Ultra Long Context Language Model with Fully Pipelined Distributed Transformer
Figure 3 for Training Ultra Long Context Language Model with Fully Pipelined Distributed Transformer
Figure 4 for Training Ultra Long Context Language Model with Fully Pipelined Distributed Transformer
Viaarxiv icon

Exploiting Inter-Layer Expert Affinity for Accelerating Mixture-of-Experts Model Inference

Add code
Jan 17, 2024
Viaarxiv icon

Flover: A Temporal Fusion Framework for Efficient Autoregressive Model Parallel Inference

Add code
May 24, 2023
Figure 1 for Flover: A Temporal Fusion Framework for Efficient Autoregressive Model Parallel Inference
Figure 2 for Flover: A Temporal Fusion Framework for Efficient Autoregressive Model Parallel Inference
Figure 3 for Flover: A Temporal Fusion Framework for Efficient Autoregressive Model Parallel Inference
Figure 4 for Flover: A Temporal Fusion Framework for Efficient Autoregressive Model Parallel Inference
Viaarxiv icon

SOFT: Softmax-free Transformer with Linear Complexity

Add code
Oct 29, 2021
Figure 1 for SOFT: Softmax-free Transformer with Linear Complexity
Figure 2 for SOFT: Softmax-free Transformer with Linear Complexity
Figure 3 for SOFT: Softmax-free Transformer with Linear Complexity
Figure 4 for SOFT: Softmax-free Transformer with Linear Complexity
Viaarxiv icon

Single Pixel Reconstruction for One-stage Instance Segmentation

Add code
May 17, 2019
Figure 1 for Single Pixel Reconstruction for One-stage Instance Segmentation
Figure 2 for Single Pixel Reconstruction for One-stage Instance Segmentation
Figure 3 for Single Pixel Reconstruction for One-stage Instance Segmentation
Figure 4 for Single Pixel Reconstruction for One-stage Instance Segmentation
Viaarxiv icon