Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Chae Young Lee

Luke

HyperCam: Low-Power Onboard Computer Vision for IoT Cameras

Jan 17, 2025

Chae Young Lee, Pu, Yi, Maxwell Fite, Tejus Rao, Sara Achour, Zerina Kapetanovic

Abstract:We present HyperCam, an energy-efficient image classification pipeline that enables computer vision tasks onboard low-power IoT camera systems. HyperCam leverages hyperdimensional computing to perform training and inference efficiently on low-power microcontrollers. We implement a low-power wireless camera platform using off-the-shelf hardware and demonstrate that HyperCam can achieve an accuracy of 93.60%, 84.06%, 92.98%, and 72.79% for MNIST, Fashion-MNIST, Face Detection, and Face Identification tasks, respectively, while significantly outperforming other classifiers in resource efficiency. Specifically, it delivers inference latency of 0.08-0.27s while using 42.91-63.00KB flash memory and 22.25KB RAM at peak. Among other machine learning classifiers such as SVM, xgBoost, MicroNets, MobileNetV3, and MCUNetV3, HyperCam is the only classifier that achieves competitive accuracy while maintaining competitive memory footprint and inference latency that meets the resource requirements of low-power camera systems.

Via

Access Paper or Ask Questions

CLEval: Character-Level Evaluation for Text Detection and Recognition Tasks

Jun 11, 2020

Youngmin Baek, Daehyun Nam, Sungrae Park, Junyeop Lee, Seung Shin, Jeonghun Baek, Chae Young Lee, Hwalsuk Lee

Figure 1 for CLEval: Character-Level Evaluation for Text Detection and Recognition Tasks

Figure 2 for CLEval: Character-Level Evaluation for Text Detection and Recognition Tasks

Figure 3 for CLEval: Character-Level Evaluation for Text Detection and Recognition Tasks

Figure 4 for CLEval: Character-Level Evaluation for Text Detection and Recognition Tasks

Abstract:Despite the recent success of text detection and recognition methods, existing evaluation metrics fail to provide a fair and reliable comparison among those methods. In addition, there exists no end-to-end evaluation metric that takes characteristics of OCR tasks into account. Previous end-to-end metric contains cascaded errors from the binary scoring process applied in both detection and recognition tasks. Ignoring partially correct results raises a gap between quantitative and qualitative analysis, and prevents fine-grained assessment. Based on the fact that character is a key element of text, we hereby propose a Character-Level Evaluation metric (CLEval). In CLEval, the \textit{instance matching} process handles split and merge detection cases, and the \textit{scoring process} conducts character-level evaluation. By aggregating character-level scores, the CLEval metric provides a fine-grained evaluation of end-to-end results composed of the detection and recognition as well as individual evaluations for each module from the end-performance perspective. We believe that our metrics can play a key role in developing and analyzing state-of-the-art text detection and recognition methods. The evaluation code is publicly available at https://github.com/clovaai/CLEval.

* 12 pages, 8 figures

Via

Access Paper or Ask Questions

TedEval: A Fair Evaluation Metric for Scene Text Detectors

Jul 02, 2019

Chae Young Lee, Youngmin Baek, Hwalsuk Lee

Figure 1 for TedEval: A Fair Evaluation Metric for Scene Text Detectors

Figure 2 for TedEval: A Fair Evaluation Metric for Scene Text Detectors

Figure 3 for TedEval: A Fair Evaluation Metric for Scene Text Detectors

Figure 4 for TedEval: A Fair Evaluation Metric for Scene Text Detectors

Abstract:Despite the recent success of scene text detection methods, common evaluation metrics fail to provide a fair and reliable comparison among detectors. They have obvious drawbacks in reflecting the inherent characteristic of text detection tasks, unable to address issues such as granularity, multiline, and character incompleteness. In this paper, we propose a novel evaluation protocol called TedEval (Text detector Evaluation), which evaluates text detections by an instance-level matching and a character-level scoring. Based on a firm standard rewarding behaviors that result in successful recognition, TedEval can act as a reliable standard for comparing and quantizing the detection quality throughout all difficulty levels. In this regard, we believe that TedEval can play a key role in developing state-of-the-art scene text detectors. The code is publicly available at https://github.com/clovaai/TedEval.

* 7 pages, 10 figures, Accepted by Workshop on Industrial Applications of Document Analysis and Recognition 2019

Via

Access Paper or Ask Questions

Conditional WaveGAN

Sep 27, 2018

Chae Young Lee, Anoop Toffy, Gue Jun Jung, Woo-Jin Han

Abstract:Generative models are successfully used for image synthesis in the recent years. But when it comes to other modalities like audio, text etc little progress has been made. Recent works focus on generating audio from a generative model in an unsupervised setting. We explore the possibility of using generative models conditioned on class labels. Concatenation based conditioning and conditional scaling were explored in this work with various hyper-parameter tuning methods. In this paper we introduce Conditional WaveGANs (cWaveGAN). Find our implementation at https://github.com/acheketa/cwavegan

* Preprint

Via

Access Paper or Ask Questions