Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Communication-Efficient Separable Neural Network for Distributed Inference on Edge Devices

Nov 03, 2021

Jun-Liang Lin, Sheng-De Wang

Figure 1 for Communication-Efficient Separable Neural Network for Distributed Inference on Edge Devices

Figure 2 for Communication-Efficient Separable Neural Network for Distributed Inference on Edge Devices

Figure 3 for Communication-Efficient Separable Neural Network for Distributed Inference on Edge Devices

Figure 4 for Communication-Efficient Separable Neural Network for Distributed Inference on Edge Devices

Share this with someone who'll enjoy it:

Abstract:The inference of Neural Networks is usually restricted by the resources (e.g., computing power, memory, bandwidth) on edge devices. In addition to improving the hardware design and deploying efficient models, it is possible to aggregate the computing power of many devices to enable the machine learning models. In this paper, we proposed a novel method of exploiting model parallelism to separate a neural network for distributed inferences. To achieve a better balance between communication latency, computation latency, and performance, we adopt neural architecture search (NAS) to search for the best transmission policy and reduce the amount of communication. The best model we found decreases by 86.6% of the amount of data transmission compared to the baseline and does not impact performance much. Under proper specifications of devices and configurations of models, our experiments show that the inference of large neural networks on edge clusters can be distributed and accelerated, which provides a new solution for the deployment of intelligent applications in the internet of things (IoT).

View paper on

Share this with someone who'll enjoy it:

Title:Communication-Efficient Separable Neural Network for Distributed Inference on Edge Devices

Paper and Code