Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Dynamic ConvNets on Tiny Devices via Nested Sparsity

Mar 07, 2022

Matteo Grimaldi, Luca Mocerino, Antonio Cipolletta, Andrea Calimera

Figure 1 for Dynamic ConvNets on Tiny Devices via Nested Sparsity

Figure 2 for Dynamic ConvNets on Tiny Devices via Nested Sparsity

Figure 3 for Dynamic ConvNets on Tiny Devices via Nested Sparsity

Figure 4 for Dynamic ConvNets on Tiny Devices via Nested Sparsity

Share this with someone who'll enjoy it:

Abstract:This work introduces a new training and compression pipeline to build Nested Sparse ConvNets, a class of dynamic Convolutional Neural Networks (ConvNets) suited for inference tasks deployed on resource-constrained devices at the edge of the Internet-of-Things. A Nested Sparse ConvNet consists of a single ConvNet architecture containing N sparse sub-networks with nested weights subsets, like a Matryoshka doll, and can trade accuracy for latency at run time, using the model sparsity as a dynamic knob. To attain high accuracy at training time, we propose a gradient masking technique that optimally routes the learning signals across the nested weights subsets. To minimize the storage footprint and efficiently process the obtained models at inference time, we introduce a new sparse matrix compression format with dedicated compute kernels that fruitfully exploit the characteristic of the nested weights subsets. Tested on image classification and object detection tasks on an off-the-shelf ARM-M7 Micro Controller Unit (MCU), Nested Sparse ConvNets outperform variable-latency solutions naively built assembling single sparse models trained as stand-alone instances, achieving (i) comparable accuracy, (ii) remarkable storage savings, and (iii) high performance. Moreover, when compared to state-of-the-art dynamic strategies, like dynamic pruning and layer width scaling, Nested Sparse ConvNets turn out to be Pareto optimal in the accuracy vs. latency space.

* Submitted to the IEEE

View paper on

Share this with someone who'll enjoy it:

Title:Dynamic ConvNets on Tiny Devices via Nested Sparsity

Paper and Code