Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Poor Man's Training on MCUs: A Memory-Efficient Quantized Back-Propagation-Free Approach

Nov 07, 2024

Yequan Zhao, Hai Li, Ian Young, Zheng Zhang

Figure 1 for Poor Man's Training on MCUs: A Memory-Efficient Quantized Back-Propagation-Free Approach

Figure 2 for Poor Man's Training on MCUs: A Memory-Efficient Quantized Back-Propagation-Free Approach

Figure 3 for Poor Man's Training on MCUs: A Memory-Efficient Quantized Back-Propagation-Free Approach

Figure 4 for Poor Man's Training on MCUs: A Memory-Efficient Quantized Back-Propagation-Free Approach

Share this with someone who'll enjoy it:

Abstract:Back propagation (BP) is the default solution for gradient computation in neural network training. However, implementing BP-based training on various edge devices such as FPGA, microcontrollers (MCUs), and analog computing platforms face multiple major challenges, such as the lack of hardware resources, long time-to-market, and dramatic errors in a low-precision setting. This paper presents a simple BP-free training scheme on an MCU, which makes edge training hardware design as easy as inference hardware design. We adopt a quantized zeroth-order method to estimate the gradients of quantized model parameters, which can overcome the error of a straight-through estimator in a low-precision BP scheme. We further employ a few dimension reduction methods (e.g., node perturbation, sparse training) to improve the convergence of zeroth-order training. Experiment results show that our BP-free training achieves comparable performance as BP-based training on adapting a pre-trained image classifier to various corrupted data on resource-constrained edge devices (e.g., an MCU with 1024-KB SRAM for dense full-model training, or an MCU with 256-KB SRAM for sparse training). This method is most suitable for application scenarios where memory cost and time-to-market are the major concerns, but longer latency can be tolerated.

View paper on

Share this with someone who'll enjoy it:

Title:Poor Man's Training on MCUs: A Memory-Efficient Quantized Back-Propagation-Free Approach

Paper and Code