Picture for Shigang Li

Shigang Li

Estimating Depth of Monocular Panoramic Image with Teacher-Student Model Fusing Equirectangular and Spherical Representations

Add code
May 27, 2024
Viaarxiv icon

TRANSOM: An Efficient Fault-Tolerant System for Training LLMs

Add code
Oct 18, 2023
Figure 1 for TRANSOM: An Efficient Fault-Tolerant System for Training LLMs
Figure 2 for TRANSOM: An Efficient Fault-Tolerant System for Training LLMs
Figure 3 for TRANSOM: An Efficient Fault-Tolerant System for Training LLMs
Figure 4 for TRANSOM: An Efficient Fault-Tolerant System for Training LLMs
Viaarxiv icon

Co-design Hardware and Algorithm for Vector Search

Add code
Jul 06, 2023
Viaarxiv icon

ASDL: A Unified Interface for Gradient Preconditioning in PyTorch

Add code
May 08, 2023
Viaarxiv icon

An End-to-End Network for Upright Adjustment of Panoramic Images

Add code
Apr 12, 2023
Viaarxiv icon

PipeFisher: Efficient Training of Large Language Models Using Pipelining and Fisher Information Matrices

Add code
Nov 25, 2022
Figure 1 for PipeFisher: Efficient Training of Large Language Models Using Pipelining and Fisher Information Matrices
Figure 2 for PipeFisher: Efficient Training of Large Language Models Using Pipelining and Fisher Information Matrices
Figure 3 for PipeFisher: Efficient Training of Large Language Models Using Pipelining and Fisher Information Matrices
Figure 4 for PipeFisher: Efficient Training of Large Language Models Using Pipelining and Fisher Information Matrices
Viaarxiv icon

Efficient Quantized Sparse Matrix Operations on Tensor Cores

Add code
Sep 14, 2022
Figure 1 for Efficient Quantized Sparse Matrix Operations on Tensor Cores
Figure 2 for Efficient Quantized Sparse Matrix Operations on Tensor Cores
Figure 3 for Efficient Quantized Sparse Matrix Operations on Tensor Cores
Figure 4 for Efficient Quantized Sparse Matrix Operations on Tensor Cores
Viaarxiv icon

HammingMesh: A Network Topology for Large-Scale Deep Learning

Add code
Sep 03, 2022
Figure 1 for HammingMesh: A Network Topology for Large-Scale Deep Learning
Figure 2 for HammingMesh: A Network Topology for Large-Scale Deep Learning
Figure 3 for HammingMesh: A Network Topology for Large-Scale Deep Learning
Figure 4 for HammingMesh: A Network Topology for Large-Scale Deep Learning
Viaarxiv icon

Near-Optimal Sparse Allreduce for Distributed Deep Learning

Add code
Jan 19, 2022
Figure 1 for Near-Optimal Sparse Allreduce for Distributed Deep Learning
Figure 2 for Near-Optimal Sparse Allreduce for Distributed Deep Learning
Figure 3 for Near-Optimal Sparse Allreduce for Distributed Deep Learning
Figure 4 for Near-Optimal Sparse Allreduce for Distributed Deep Learning
Viaarxiv icon

A Data-Centric Optimization Framework for Machine Learning

Add code
Oct 20, 2021
Figure 1 for A Data-Centric Optimization Framework for Machine Learning
Figure 2 for A Data-Centric Optimization Framework for Machine Learning
Figure 3 for A Data-Centric Optimization Framework for Machine Learning
Figure 4 for A Data-Centric Optimization Framework for Machine Learning
Viaarxiv icon