Picture for Jongsoo Park

Jongsoo Park

Jack

Context Parallelism for Scalable Million-Token Inference

Add code
Nov 04, 2024
Viaarxiv icon

The Llama 3 Herd of Models

Add code
Jul 31, 2024
Viaarxiv icon

Wukong: Towards a Scaling Law for Large-Scale Recommendation

Add code
Mar 08, 2024
Viaarxiv icon

Disaggregated Multi-Tower: Topology-aware Modeling Technique for Efficient Large-Scale Recommendation

Add code
Mar 07, 2024
Figure 1 for Disaggregated Multi-Tower: Topology-aware Modeling Technique for Efficient Large-Scale Recommendation
Figure 2 for Disaggregated Multi-Tower: Topology-aware Modeling Technique for Efficient Large-Scale Recommendation
Figure 3 for Disaggregated Multi-Tower: Topology-aware Modeling Technique for Efficient Large-Scale Recommendation
Figure 4 for Disaggregated Multi-Tower: Topology-aware Modeling Technique for Efficient Large-Scale Recommendation
Viaarxiv icon

MTrainS: Improving DLRM training efficiency using heterogeneous memories

Add code
Apr 19, 2023
Figure 1 for MTrainS: Improving DLRM training efficiency using heterogeneous memories
Figure 2 for MTrainS: Improving DLRM training efficiency using heterogeneous memories
Figure 3 for MTrainS: Improving DLRM training efficiency using heterogeneous memories
Figure 4 for MTrainS: Improving DLRM training efficiency using heterogeneous memories
Viaarxiv icon

Shared Microexponents: A Little Shifting Goes a Long Way

Add code
Feb 16, 2023
Figure 1 for Shared Microexponents: A Little Shifting Goes a Long Way
Figure 2 for Shared Microexponents: A Little Shifting Goes a Long Way
Figure 3 for Shared Microexponents: A Little Shifting Goes a Long Way
Figure 4 for Shared Microexponents: A Little Shifting Goes a Long Way
Viaarxiv icon

RecD: Deduplication for End-to-End Deep Learning Recommendation Model Training Infrastructure

Add code
Nov 14, 2022
Viaarxiv icon

DHEN: A Deep and Hierarchical Ensemble Network for Large-Scale Click-Through Rate Prediction

Add code
Mar 11, 2022
Figure 1 for DHEN: A Deep and Hierarchical Ensemble Network for Large-Scale Click-Through Rate Prediction
Figure 2 for DHEN: A Deep and Hierarchical Ensemble Network for Large-Scale Click-Through Rate Prediction
Figure 3 for DHEN: A Deep and Hierarchical Ensemble Network for Large-Scale Click-Through Rate Prediction
Figure 4 for DHEN: A Deep and Hierarchical Ensemble Network for Large-Scale Click-Through Rate Prediction
Viaarxiv icon

Low-Precision Hardware Architectures Meet Recommendation Model Inference at Scale

Add code
May 26, 2021
Figure 1 for Low-Precision Hardware Architectures Meet Recommendation Model Inference at Scale
Figure 2 for Low-Precision Hardware Architectures Meet Recommendation Model Inference at Scale
Figure 3 for Low-Precision Hardware Architectures Meet Recommendation Model Inference at Scale
Figure 4 for Low-Precision Hardware Architectures Meet Recommendation Model Inference at Scale
Viaarxiv icon

Alternate Model Growth and Pruning for Efficient Training of Recommendation Systems

Add code
May 04, 2021
Figure 1 for Alternate Model Growth and Pruning for Efficient Training of Recommendation Systems
Figure 2 for Alternate Model Growth and Pruning for Efficient Training of Recommendation Systems
Figure 3 for Alternate Model Growth and Pruning for Efficient Training of Recommendation Systems
Figure 4 for Alternate Model Growth and Pruning for Efficient Training of Recommendation Systems
Viaarxiv icon