Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Tej Pandit

TENT: Efficient Quantization of Neural Networks on the tiny Edge with Tapered FixEd PoiNT

Apr 06, 2021

Hamed F. Langroudi, Vedant Karia, Tej Pandit, Dhireesha Kudithipudi

Figure 1 for TENT: Efficient Quantization of Neural Networks on the tiny Edge with Tapered FixEd PoiNT

Figure 2 for TENT: Efficient Quantization of Neural Networks on the tiny Edge with Tapered FixEd PoiNT

Figure 3 for TENT: Efficient Quantization of Neural Networks on the tiny Edge with Tapered FixEd PoiNT

Figure 4 for TENT: Efficient Quantization of Neural Networks on the tiny Edge with Tapered FixEd PoiNT

Abstract:In this research, we propose a new low-precision framework, TENT, to leverage the benefits of a tapered fixed-point numerical format in TinyML models. We introduce a tapered fixed-point quantization algorithm that matches the numerical format's dynamic range and distribution to that of the deep neural network model's parameter distribution at each layer. An accelerator architecture for the tapered fixed-point with TENT framework is proposed. Results show that the accuracy on classification tasks improves up to ~31 % with an energy overhead of ~17-30 % as compared to fixed-point, for ConvNet and ResNet-18 models.

* poster presented at the first tinyML Research Symposium, March 26, 2021

Via

Access Paper or Ask Questions

Deep Learning Inference on Embedded Devices: Fixed-Point vs Posit

May 22, 2018

Seyed H. F. Langroudi, Tej Pandit, Dhireesha Kudithipudi

Figure 1 for Deep Learning Inference on Embedded Devices: Fixed-Point vs Posit

Figure 2 for Deep Learning Inference on Embedded Devices: Fixed-Point vs Posit

Figure 3 for Deep Learning Inference on Embedded Devices: Fixed-Point vs Posit

Figure 4 for Deep Learning Inference on Embedded Devices: Fixed-Point vs Posit

Abstract:Performing the inference step of deep learning in resource constrained environments, such as embedded devices, is challenging. Success requires optimization at both software and hardware levels. Low precision arithmetic and specifically low precision fixed-point number systems have become the standard for performing deep learning inference. However, representing non-uniform data and distributed parameters (e.g. weights) by using uniformly distributed fixed-point values is still a major drawback when using this number system. Recently, the posit number system was proposed, which represents numbers in a non-uniform manner. Therefore, in this paper we are motivated to explore using the posit number system to represent the weights of Deep Convolutional Neural Networks. However, we do not apply any quantization techniques and hence the network weights do not require re-training. The results of this exploration show that using the posit number system outperformed the fixed point number system in terms of accuracy and memory utilization.

Via

Access Paper or Ask Questions