Picture for Shuai Zheng

Shuai Zheng

FlexCare: Leveraging Cross-Task Synergy for Flexible Multimodal Healthcare Prediction

Add code
Jun 17, 2024
Viaarxiv icon

Lancet: Accelerating Mixture-of-Experts Training via Whole Graph Computation-Communication Overlapping

Add code
Apr 30, 2024
Figure 1 for Lancet: Accelerating Mixture-of-Experts Training via Whole Graph Computation-Communication Overlapping
Figure 2 for Lancet: Accelerating Mixture-of-Experts Training via Whole Graph Computation-Communication Overlapping
Figure 3 for Lancet: Accelerating Mixture-of-Experts Training via Whole Graph Computation-Communication Overlapping
Figure 4 for Lancet: Accelerating Mixture-of-Experts Training via Whole Graph Computation-Communication Overlapping
Viaarxiv icon

DualFluidNet: an Attention-based Dual-pipeline Network for Accurate and Generalizable Fluid-solid Coupled Simulation

Add code
Dec 28, 2023
Viaarxiv icon

Contractive error feedback for gradient compression

Add code
Dec 13, 2023
Viaarxiv icon

DynaPipe: Optimizing Multi-task Training through Dynamic Pipelines

Add code
Nov 17, 2023
Viaarxiv icon

Unleashing the potential of GNNs via Bi-directional Knowledge Transfer

Add code
Oct 26, 2023
Figure 1 for Unleashing the potential of GNNs via Bi-directional Knowledge Transfer
Figure 2 for Unleashing the potential of GNNs via Bi-directional Knowledge Transfer
Figure 3 for Unleashing the potential of GNNs via Bi-directional Knowledge Transfer
Figure 4 for Unleashing the potential of GNNs via Bi-directional Knowledge Transfer
Viaarxiv icon

Vcc: Scaling Transformers to 128K Tokens or More by Prioritizing Important Tokens

Add code
May 07, 2023
Figure 1 for Vcc: Scaling Transformers to 128K Tokens or More by Prioritizing Important Tokens
Figure 2 for Vcc: Scaling Transformers to 128K Tokens or More by Prioritizing Important Tokens
Figure 3 for Vcc: Scaling Transformers to 128K Tokens or More by Prioritizing Important Tokens
Figure 4 for Vcc: Scaling Transformers to 128K Tokens or More by Prioritizing Important Tokens
Viaarxiv icon

Prompt Pre-Training with Twenty-Thousand Classes for Open-Vocabulary Visual Recognition

Add code
Apr 10, 2023
Figure 1 for Prompt Pre-Training with Twenty-Thousand Classes for Open-Vocabulary Visual Recognition
Figure 2 for Prompt Pre-Training with Twenty-Thousand Classes for Open-Vocabulary Visual Recognition
Figure 3 for Prompt Pre-Training with Twenty-Thousand Classes for Open-Vocabulary Visual Recognition
Figure 4 for Prompt Pre-Training with Twenty-Thousand Classes for Open-Vocabulary Visual Recognition
Viaarxiv icon

Disentangling Orthogonal Planes for Indoor Panoramic Room Layout Estimation with Cross-Scale Distortion Awareness

Add code
Mar 04, 2023
Viaarxiv icon

Decoupled Model Schedule for Deep Learning Training

Add code
Feb 16, 2023
Viaarxiv icon