Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Patrick Hansen

A LiDAR-Guided Framework for Video Enhancement

Mar 15, 2021

Yu Feng, Patrick Hansen, Paul N. Whatmough, Guoyu Lu, Yuhao Zhu

Figure 1 for A LiDAR-Guided Framework for Video Enhancement

Figure 2 for A LiDAR-Guided Framework for Video Enhancement

Figure 3 for A LiDAR-Guided Framework for Video Enhancement

Figure 4 for A LiDAR-Guided Framework for Video Enhancement

Abstract:This paper presents a general framework that simultaneously improves the quality and the execution speed of a range of video enhancement tasks, such as super-sampling, deblurring, and denoising. The key to our framework is a pixel motion estimation algorithm that generates accurate motion from low-quality videos while being computationally very lightweight. Our motion estimation algorithm leverages point cloud information, which is readily available in today's autonomous devices and will only become more common in the future. We demonstrate a generic framework that leverages the motion information to guide high-quality image reconstruction. Experiments show that our framework consistently outperforms the state-of-the-art video enhancement algorithms while improving the execution speed by an order of magnitude.

Via

Access Paper or Ask Questions

ISP4ML: Understanding the Role of Image Signal Processing in Efficient Deep Learning Vision Systems

Nov 25, 2019

Patrick Hansen, Alexey Vilkin, Yury Khrustalev, James Imber, David Hanwell, Matthew Mattina, Paul N. Whatmough

Figure 1 for ISP4ML: Understanding the Role of Image Signal Processing in Efficient Deep Learning Vision Systems

Figure 2 for ISP4ML: Understanding the Role of Image Signal Processing in Efficient Deep Learning Vision Systems

Figure 3 for ISP4ML: Understanding the Role of Image Signal Processing in Efficient Deep Learning Vision Systems

Figure 4 for ISP4ML: Understanding the Role of Image Signal Processing in Efficient Deep Learning Vision Systems

Abstract:Convolutional neural networks (CNNs) are now predominant components in a variety of computer vision (CV) systems. These systems typically include an image signal processor (ISP), even though the ISP is traditionally designed to produce images that look appealing to humans. In CV systems, it is not clear what the role of the ISP is, or if it is even required at all for accurate prediction. In this work, we investigate the efficacy of the ISP in CNN classification tasks, and outline the system-level trade-offs between prediction accuracy and computational cost. To do so, we build software models of a configurable ISP and an imaging sensor in order to train CNNs on ImageNet with a range of different ISP settings and functionality. Results on ImageNet show that an ISP improves accuracy by 4.6%-12.2% on MobileNet architectures of different widths. Results using ResNets demonstrate that these trends also generalize to deeper networks. An ablation study of the various processing stages in a typical ISP reveals that the tone mapper is the most significant stage when operating on high dynamic range (HDR) images, by providing 5.8% average accuracy improvement alone. Overall, the ISP benefits system efficiency because the memory and computational costs of the ISP is minimal compared to the cost of using a larger CNN to achieve the same accuracy.

* 13 pages, 11 figures

Via

Access Paper or Ask Questions

FixyNN: Efficient Hardware for Mobile Computer Vision via Transfer Learning

Feb 27, 2019

Paul N. Whatmough, Chuteng Zhou, Patrick Hansen, Shreyas Kolala Venkataramanaiah, Jae-sun Seo, Matthew Mattina

Figure 1 for FixyNN: Efficient Hardware for Mobile Computer Vision via Transfer Learning

Figure 2 for FixyNN: Efficient Hardware for Mobile Computer Vision via Transfer Learning

Figure 3 for FixyNN: Efficient Hardware for Mobile Computer Vision via Transfer Learning

Figure 4 for FixyNN: Efficient Hardware for Mobile Computer Vision via Transfer Learning

Abstract:The computational demands of computer vision tasks based on state-of-the-art Convolutional Neural Network (CNN) image classification far exceed the energy budgets of mobile devices. This paper proposes FixyNN, which consists of a fixed-weight feature extractor that generates ubiquitous CNN features, and a conventional programmable CNN accelerator which processes a dataset-specific CNN. Image classification models for FixyNN are trained end-to-end via transfer learning, with the common feature extractor representing the transfered part, and the programmable part being learnt on the target dataset. Experimental results demonstrate FixyNN hardware can achieve very high energy efficiencies up to 26.6 TOPS/W ($4.81 \times$ better than iso-area programmable accelerator). Over a suite of six datasets we trained models via transfer learning with an accuracy loss of $<1\%$ resulting in up to 11.2 TOPS/W - nearly $2 \times$ more efficient than a conventional programmable CNN accelerator of the same area.

* 10 pages, 8 figures, paper accepted at SysML2019 conference

Via

Access Paper or Ask Questions

Energy Efficient Hardware for On-Device CNN Inference via Transfer Learning

Dec 04, 2018

Paul Whatmough, Chuteng Zhou, Patrick Hansen, Matthew Mattina

Figure 1 for Energy Efficient Hardware for On-Device CNN Inference via Transfer Learning

Figure 2 for Energy Efficient Hardware for On-Device CNN Inference via Transfer Learning

Figure 3 for Energy Efficient Hardware for On-Device CNN Inference via Transfer Learning

Abstract:On-device CNN inference for real-time computer vision applications can result in computational demands that far exceed the energy budgets of mobile devices. This paper proposes FixyNN, a co-designed hardware accelerator platform which splits a CNN model into two parts: a set of layers that are fixed in the hardware platform as a front-end fixed-weight feature extractor, and the remaining layers which become a back-end classifier running on a conventional programmable CNN accelerator. The common front-end provides ubiquitous CNN features for all FixyNN models, while the back-end is programmable and specific to a given dataset. Image classification models for FixyNN are trained end-to-end via transfer learning, with front-end layers fixed for the shared feature extractor, and back-end layers fine-tuned for a specific task. Over a suite of six datasets, we trained models via transfer learning with an accuracy loss of <1%, resulting in a FixyNN hardware platform with nearly 2 times better energy efficiency than a conventional programmable CNN accelerator of the same silicon area (i.e. hardware cost).

* 4 pages, 2 figures, NeurIPS 2018 on-device ML workshop

Via

Access Paper or Ask Questions