Picture for Ziheng Jiang

Ziheng Jiang

FLUX: Fast Software-based Communication Overlap On GPUs Through Kernel Fusion

Add code
Jun 12, 2024
Figure 1 for FLUX: Fast Software-based Communication Overlap On GPUs Through Kernel Fusion
Figure 2 for FLUX: Fast Software-based Communication Overlap On GPUs Through Kernel Fusion
Figure 3 for FLUX: Fast Software-based Communication Overlap On GPUs Through Kernel Fusion
Figure 4 for FLUX: Fast Software-based Communication Overlap On GPUs Through Kernel Fusion
Viaarxiv icon

MegaScale: Scaling Large Language Model Training to More Than 10,000 GPUs

Add code
Feb 23, 2024
Figure 1 for MegaScale: Scaling Large Language Model Training to More Than 10,000 GPUs
Figure 2 for MegaScale: Scaling Large Language Model Training to More Than 10,000 GPUs
Figure 3 for MegaScale: Scaling Large Language Model Training to More Than 10,000 GPUs
Figure 4 for MegaScale: Scaling Large Language Model Training to More Than 10,000 GPUs
Viaarxiv icon

Relax: Composable Abstractions for End-to-End Dynamic Machine Learning

Add code
Nov 01, 2023
Viaarxiv icon

Federated Remote Physiological Measurement with Imperfect Data

Add code
Mar 11, 2022
Figure 1 for Federated Remote Physiological Measurement with Imperfect Data
Figure 2 for Federated Remote Physiological Measurement with Imperfect Data
Figure 3 for Federated Remote Physiological Measurement with Imperfect Data
Figure 4 for Federated Remote Physiological Measurement with Imperfect Data
Viaarxiv icon

EfficientPhys: Enabling Simple, Fast and Accurate Camera-Based Vitals Measurement

Add code
Oct 09, 2021
Figure 1 for EfficientPhys: Enabling Simple, Fast and Accurate Camera-Based Vitals Measurement
Figure 2 for EfficientPhys: Enabling Simple, Fast and Accurate Camera-Based Vitals Measurement
Figure 3 for EfficientPhys: Enabling Simple, Fast and Accurate Camera-Based Vitals Measurement
Figure 4 for EfficientPhys: Enabling Simple, Fast and Accurate Camera-Based Vitals Measurement
Viaarxiv icon

Automated Backend-Aware Post-Training Quantization

Add code
Mar 27, 2021
Figure 1 for Automated Backend-Aware Post-Training Quantization
Figure 2 for Automated Backend-Aware Post-Training Quantization
Figure 3 for Automated Backend-Aware Post-Training Quantization
Figure 4 for Automated Backend-Aware Post-Training Quantization
Viaarxiv icon

SplitSR: An End-to-End Approach to Super-Resolution on Mobile Devices

Add code
Jan 20, 2021
Figure 1 for SplitSR: An End-to-End Approach to Super-Resolution on Mobile Devices
Figure 2 for SplitSR: An End-to-End Approach to Super-Resolution on Mobile Devices
Figure 3 for SplitSR: An End-to-End Approach to Super-Resolution on Mobile Devices
Figure 4 for SplitSR: An End-to-End Approach to Super-Resolution on Mobile Devices
Viaarxiv icon

MetaPhys: Unsupervised Few-Shot Adaptation for Non-Contact Physiological Measurement

Add code
Oct 05, 2020
Figure 1 for MetaPhys: Unsupervised Few-Shot Adaptation for Non-Contact Physiological Measurement
Figure 2 for MetaPhys: Unsupervised Few-Shot Adaptation for Non-Contact Physiological Measurement
Figure 3 for MetaPhys: Unsupervised Few-Shot Adaptation for Non-Contact Physiological Measurement
Figure 4 for MetaPhys: Unsupervised Few-Shot Adaptation for Non-Contact Physiological Measurement
Viaarxiv icon

Exploring the Memorization-Generalization Continuum in Deep Learning

Add code
Feb 08, 2020
Figure 1 for Exploring the Memorization-Generalization Continuum in Deep Learning
Figure 2 for Exploring the Memorization-Generalization Continuum in Deep Learning
Figure 3 for Exploring the Memorization-Generalization Continuum in Deep Learning
Figure 4 for Exploring the Memorization-Generalization Continuum in Deep Learning
Viaarxiv icon

Relay: A High-Level IR for Deep Learning

Add code
Apr 17, 2019
Figure 1 for Relay: A High-Level IR for Deep Learning
Figure 2 for Relay: A High-Level IR for Deep Learning
Figure 3 for Relay: A High-Level IR for Deep Learning
Figure 4 for Relay: A High-Level IR for Deep Learning
Viaarxiv icon