Picture for Xianyan Jia

Xianyan Jia

TAP: Accelerating Large-Scale DNN Training Through Tensor Automatic Parallelisation

Add code
Feb 01, 2023
Viaarxiv icon

M6-10T: A Sharing-Delinking Paradigm for Efficient Multi-Trillion Parameter Pretraining

Add code
Oct 25, 2021
Figure 1 for M6-10T: A Sharing-Delinking Paradigm for Efficient Multi-Trillion Parameter Pretraining
Figure 2 for M6-10T: A Sharing-Delinking Paradigm for Efficient Multi-Trillion Parameter Pretraining
Figure 3 for M6-10T: A Sharing-Delinking Paradigm for Efficient Multi-Trillion Parameter Pretraining
Figure 4 for M6-10T: A Sharing-Delinking Paradigm for Efficient Multi-Trillion Parameter Pretraining
Viaarxiv icon

Exploring Sparse Expert Models and Beyond

Add code
Jun 14, 2021
Figure 1 for Exploring Sparse Expert Models and Beyond
Figure 2 for Exploring Sparse Expert Models and Beyond
Figure 3 for Exploring Sparse Expert Models and Beyond
Figure 4 for Exploring Sparse Expert Models and Beyond
Viaarxiv icon

M6: A Chinese Multimodal Pretrainer

Add code
Mar 02, 2021
Figure 1 for M6: A Chinese Multimodal Pretrainer
Figure 2 for M6: A Chinese Multimodal Pretrainer
Figure 3 for M6: A Chinese Multimodal Pretrainer
Figure 4 for M6: A Chinese Multimodal Pretrainer
Viaarxiv icon

Highly Scalable Deep Learning Training System with Mixed-Precision: Training ImageNet in Four Minutes

Add code
Jul 30, 2018
Figure 1 for Highly Scalable Deep Learning Training System with Mixed-Precision: Training ImageNet in Four Minutes
Figure 2 for Highly Scalable Deep Learning Training System with Mixed-Precision: Training ImageNet in Four Minutes
Figure 3 for Highly Scalable Deep Learning Training System with Mixed-Precision: Training ImageNet in Four Minutes
Viaarxiv icon

BigDL: A Distributed Deep Learning Framework for Big Data

Add code
Jun 25, 2018
Figure 1 for BigDL: A Distributed Deep Learning Framework for Big Data
Figure 2 for BigDL: A Distributed Deep Learning Framework for Big Data
Figure 3 for BigDL: A Distributed Deep Learning Framework for Big Data
Figure 4 for BigDL: A Distributed Deep Learning Framework for Big Data
Viaarxiv icon

Sentiment Analysis for Twitter : Going Beyond Tweet Text

Add code
Nov 29, 2016
Figure 1 for Sentiment Analysis for Twitter : Going Beyond Tweet Text
Figure 2 for Sentiment Analysis for Twitter : Going Beyond Tweet Text
Figure 3 for Sentiment Analysis for Twitter : Going Beyond Tweet Text
Figure 4 for Sentiment Analysis for Twitter : Going Beyond Tweet Text
Viaarxiv icon