Picture for Shenggan Cheng

Shenggan Cheng

DSP: Dynamic Sequence Parallelism for Multi-Dimensional Transformers

Add code
Mar 15, 2024
Viaarxiv icon

AutoChunk: Automated Activation Chunk for Memory-Efficient Long Sequence Inference

Add code
Jan 19, 2024
Figure 1 for AutoChunk: Automated Activation Chunk for Memory-Efficient Long Sequence Inference
Figure 2 for AutoChunk: Automated Activation Chunk for Memory-Efficient Long Sequence Inference
Figure 3 for AutoChunk: Automated Activation Chunk for Memory-Efficient Long Sequence Inference
Figure 4 for AutoChunk: Automated Activation Chunk for Memory-Efficient Long Sequence Inference
Viaarxiv icon

FastFold: Reducing AlphaFold Training Time from 11 Days to 67 Hours

Add code
Mar 04, 2022
Figure 1 for FastFold: Reducing AlphaFold Training Time from 11 Days to 67 Hours
Figure 2 for FastFold: Reducing AlphaFold Training Time from 11 Days to 67 Hours
Figure 3 for FastFold: Reducing AlphaFold Training Time from 11 Days to 67 Hours
Figure 4 for FastFold: Reducing AlphaFold Training Time from 11 Days to 67 Hours
Viaarxiv icon