Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Aayan Kumar

MinUn: Accurate ML Inference on Microcontrollers

Oct 29, 2022

Shikhar Jaiswal, Rahul Kiran Kranti Goli, Aayan Kumar, Vivek Seshadri, Rahul Sharma

Figure 1 for MinUn: Accurate ML Inference on Microcontrollers

Figure 2 for MinUn: Accurate ML Inference on Microcontrollers

Figure 3 for MinUn: Accurate ML Inference on Microcontrollers

Figure 4 for MinUn: Accurate ML Inference on Microcontrollers

Abstract:Running machine learning inference on tiny devices, known as TinyML, is an emerging research area. This task requires generating inference code that uses memory frugally, a task that standard ML frameworks are ill-suited for. A deployment framework for TinyML must be a) parametric in the number representation to take advantage of the emerging representations like posits, b) carefully assign high-precision to a few tensors so that most tensors can be kept in low-precision while still maintaining model accuracy, and c) avoid memory fragmentation. We describe MinUn, the first TinyML framework that holistically addresses these issues to generate efficient code for ARM microcontrollers (e.g., Arduino Uno, Due and STM32H747) that outperforms the prior TinyML frameworks.

Via

Access Paper or Ask Questions

MAFIA: Machine Learning Acceleration on FPGAs for IoT Applications

Jul 08, 2021

Nikhil Pratap Ghanathe, Vivek Seshadri, Rahul Sharma, Steve Wilton, Aayan Kumar

Figure 1 for MAFIA: Machine Learning Acceleration on FPGAs for IoT Applications

Figure 2 for MAFIA: Machine Learning Acceleration on FPGAs for IoT Applications

Figure 3 for MAFIA: Machine Learning Acceleration on FPGAs for IoT Applications

Figure 4 for MAFIA: Machine Learning Acceleration on FPGAs for IoT Applications

Abstract:Recent breakthroughs in ML have produced new classes of models that allow ML inference to run directly on milliwatt-powered IoT devices. On one hand, existing ML-to-FPGA compilers are designed for deep neural-networks on large FPGAs. On the other hand, general-purpose HLS tools fail to exploit properties specific to ML inference, thereby resulting in suboptimal performance. We propose MAFIA, a tool to compile ML inference on small form-factor FPGAs for IoT applications. MAFIA provides native support for linear algebra operations and can express a variety of ML algorithms, including state-of-the-art models. We show that MAFIA-generated programs outperform best-performing variant of a commercial HLS compiler by 2.5x on average.

* Accepted at The International Conference on Field-Programmable Logic and Applications (FPL), 2021

Via

Access Paper or Ask Questions