Picture for Jianyu Huang

Jianyu Huang

Jack

Context Parallelism for Scalable Million-Token Inference

Add code
Nov 04, 2024
Viaarxiv icon

The Llama 3 Herd of Models

Add code
Jul 31, 2024
Viaarxiv icon

PVF (Parameter Vulnerability Factor): A Quantitative Metric Measuring AI Vulnerability and Resilience Against Parameter Corruptions

Add code
May 02, 2024
Viaarxiv icon

Low-Precision Hardware Architectures Meet Recommendation Model Inference at Scale

Add code
May 26, 2021
Figure 1 for Low-Precision Hardware Architectures Meet Recommendation Model Inference at Scale
Figure 2 for Low-Precision Hardware Architectures Meet Recommendation Model Inference at Scale
Figure 3 for Low-Precision Hardware Architectures Meet Recommendation Model Inference at Scale
Figure 4 for Low-Precision Hardware Architectures Meet Recommendation Model Inference at Scale
Viaarxiv icon

High-performance, Distributed Training of Large-scale Deep Learning Recommendation Models

Add code
Apr 15, 2021
Figure 1 for High-performance, Distributed Training of Large-scale Deep Learning Recommendation Models
Figure 2 for High-performance, Distributed Training of Large-scale Deep Learning Recommendation Models
Figure 3 for High-performance, Distributed Training of Large-scale Deep Learning Recommendation Models
Figure 4 for High-performance, Distributed Training of Large-scale Deep Learning Recommendation Models
Viaarxiv icon

FBGEMM: Enabling High-Performance Low-Precision Deep Learning Inference

Add code
Jan 13, 2021
Figure 1 for FBGEMM: Enabling High-Performance Low-Precision Deep Learning Inference
Figure 2 for FBGEMM: Enabling High-Performance Low-Precision Deep Learning Inference
Figure 3 for FBGEMM: Enabling High-Performance Low-Precision Deep Learning Inference
Figure 4 for FBGEMM: Enabling High-Performance Low-Precision Deep Learning Inference
Viaarxiv icon

Mixed-Precision Embedding Using a Cache

Add code
Oct 23, 2020
Figure 1 for Mixed-Precision Embedding Using a Cache
Figure 2 for Mixed-Precision Embedding Using a Cache
Figure 3 for Mixed-Precision Embedding Using a Cache
Figure 4 for Mixed-Precision Embedding Using a Cache
Viaarxiv icon

A Study of BFLOAT16 for Deep Learning Training

Add code
Jun 13, 2019
Figure 1 for A Study of BFLOAT16 for Deep Learning Training
Figure 2 for A Study of BFLOAT16 for Deep Learning Training
Figure 3 for A Study of BFLOAT16 for Deep Learning Training
Figure 4 for A Study of BFLOAT16 for Deep Learning Training
Viaarxiv icon

Deep Learning Recommendation Model for Personalization and Recommendation Systems

Add code
May 31, 2019
Figure 1 for Deep Learning Recommendation Model for Personalization and Recommendation Systems
Figure 2 for Deep Learning Recommendation Model for Personalization and Recommendation Systems
Figure 3 for Deep Learning Recommendation Model for Personalization and Recommendation Systems
Figure 4 for Deep Learning Recommendation Model for Personalization and Recommendation Systems
Viaarxiv icon