Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Varun Sundar

Generalized Event Cameras

Jul 02, 2024

Varun Sundar, Matthew Dutson, Andrei Ardelean, Claudio Bruschini, Edoardo Charbon, Mohit Gupta

Abstract:Event cameras capture the world at high time resolution and with minimal bandwidth requirements. However, event streams, which only encode changes in brightness, do not contain sufficient scene information to support a wide variety of downstream tasks. In this work, we design generalized event cameras that inherently preserve scene intensity in a bandwidth-efficient manner. We generalize event cameras in terms of when an event is generated and what information is transmitted. To implement our designs, we turn to single-photon sensors that provide digital access to individual photon detections; this modality gives us the flexibility to realize a rich space of generalized event cameras. Our single-photon event cameras are capable of high-speed, high-fidelity imaging at low readout rates. Consequently, these event cameras can support plug-and-play downstream inference, without capturing new event datasets or designing specialized event-vision models. As a practical implication, our designs, which involve lightweight and near-sensor-compatible computations, provide a way to use single-photon sensors without exorbitant bandwidth costs.

* CVPR 2024

Via

Access Paper or Ask Questions

SoDaCam: Software-defined Cameras via Single-Photon Imaging

Sep 08, 2023

Varun Sundar, Andrei Ardelean, Tristan Swedish, Claudio Bruschini, Edoardo Charbon, Mohit Gupta

Figure 1 for SoDaCam: Software-defined Cameras via Single-Photon Imaging

Figure 2 for SoDaCam: Software-defined Cameras via Single-Photon Imaging

Figure 3 for SoDaCam: Software-defined Cameras via Single-Photon Imaging

Figure 4 for SoDaCam: Software-defined Cameras via Single-Photon Imaging

Abstract:Reinterpretable cameras are defined by their post-processing capabilities that exceed traditional imaging. We present "SoDaCam" that provides reinterpretable cameras at the granularity of photons, from photon-cubes acquired by single-photon devices. Photon-cubes represent the spatio-temporal detections of photons as a sequence of binary frames, at frame-rates as high as 100 kHz. We show that simple transformations of the photon-cube, or photon-cube projections, provide the functionality of numerous imaging systems including: exposure bracketing, flutter shutter cameras, video compressive systems, event cameras, and even cameras that move during exposure. Our photon-cube projections offer the flexibility of being software-defined constructs that are only limited by what is computable, and shot-noise. We exploit this flexibility to provide new capabilities for the emulated cameras. As an added benefit, our projections provide camera-dependent compression of photon-cubes, which we demonstrate using an implementation of our projections on a novel compute architecture that is designed for single-photon imaging.

* Accepted at ICCV 2023 (oral). Project webpage can be found at https://wisionlab.com/project/sodacam/

Via

Access Paper or Ask Questions

Single-Photon Structured Light

Apr 11, 2022

Varun Sundar, Sizhuo Ma, Aswin C. Sankaranarayanan, Mohit Gupta

Figure 1 for Single-Photon Structured Light

Figure 2 for Single-Photon Structured Light

Figure 3 for Single-Photon Structured Light

Figure 4 for Single-Photon Structured Light

Abstract:We present a novel structured light technique that uses Single Photon Avalanche Diode (SPAD) arrays to enable 3D scanning at high-frame rates and low-light levels. This technique, called "Single-Photon Structured Light", works by sensing binary images that indicates the presence or absence of photon arrivals during each exposure; the SPAD array is used in conjunction with a high-speed binary projector, with both devices operated at speeds as high as 20~kHz. The binary images that we acquire are heavily influenced by photon noise and are easily corrupted by ambient sources of light. To address this, we develop novel temporal sequences using error correction codes that are designed to be robust to short-range effects like projector and camera defocus as well as resolution mismatch between the two devices. Our lab prototype is capable of 3D imaging in challenging scenarios involving objects with extremely low albedo or undergoing fast motion, as well as scenes under strong ambient illumination.

* Accepted at CVPR 2022 (poster). 26 pages, 23 figures

Via

Access Paper or Ask Questions

[Reproducibility Report] Rigging the Lottery: Making All Tickets Winners

Mar 30, 2021

Varun Sundar, Rajat Vadiraj Dwaraknath

Figure 1 for [Reproducibility Report] Rigging the Lottery: Making All Tickets Winners

Figure 2 for [Reproducibility Report] Rigging the Lottery: Making All Tickets Winners

Figure 3 for [Reproducibility Report] Rigging the Lottery: Making All Tickets Winners

Figure 4 for [Reproducibility Report] Rigging the Lottery: Making All Tickets Winners

Abstract:$\textit{RigL}$, a sparse training algorithm, claims to directly train sparse networks that match or exceed the performance of existing dense-to-sparse training techniques (such as pruning) for a fixed parameter count and compute budget. We implement $\textit{RigL}$ from scratch in Pytorch and reproduce its performance on CIFAR-10 within 0.1% of the reported value. On both CIFAR-10/100, the central claim holds -- given a fixed training budget, $\textit{RigL}$ surpasses existing dynamic-sparse training methods over a range of target sparsities. By training longer, the performance can match or exceed iterative pruning, while consuming constant FLOPs throughout training. We also show that there is little benefit in tuning $\textit{RigL}$'s hyper-parameters for every sparsity, initialization pair -- the reference choice of hyperparameters is often close to optimal performance. Going beyond the original paper, we find that the optimal initialization scheme depends on the training constraint. While the Erdos-Renyi-Kernel distribution outperforms the Uniform distribution for a fixed parameter count, for a fixed FLOP count, the latter performs better. Finally, redistributing layer-wise sparsity while training can bridge the performance gap between the two initialization schemes, but increases computational cost.

* Under review at ML Reproducibility Challenge 2020. Code available at https://github.com/varun19299/rigl-reproducibility. Training plots and other logs available at https://wandb.ai/ml-reprod-2020

Via

Access Paper or Ask Questions

FlatNet: Towards Photorealistic Scene Reconstruction from Lensless Measurements

Oct 29, 2020

Salman S. Khan, Varun Sundar, Vivek Boominathan, Ashok Veeraraghavan, Kaushik Mitra

Figure 1 for FlatNet: Towards Photorealistic Scene Reconstruction from Lensless Measurements

Figure 2 for FlatNet: Towards Photorealistic Scene Reconstruction from Lensless Measurements

Figure 3 for FlatNet: Towards Photorealistic Scene Reconstruction from Lensless Measurements

Figure 4 for FlatNet: Towards Photorealistic Scene Reconstruction from Lensless Measurements

Abstract:Lensless imaging has emerged as a potential solution towards realizing ultra-miniature cameras by eschewing the bulky lens in a traditional camera. Without a focusing lens, the lensless cameras rely on computational algorithms to recover the scenes from multiplexed measurements. However, the current iterative-optimization-based reconstruction algorithms produce noisier and perceptually poorer images. In this work, we propose a non-iterative deep learning based reconstruction approach that results in orders of magnitude improvement in image quality for lensless reconstructions. Our approach, called $\textit{FlatNet}$, lays down a framework for reconstructing high-quality photorealistic images from mask-based lensless cameras, where the camera's forward model formulation is known. FlatNet consists of two stages: (1) an inversion stage that maps the measurement into a space of intermediate reconstruction by learning parameters within the forward model formulation, and (2) a perceptual enhancement stage that improves the perceptual quality of this intermediate reconstruction. These stages are trained together in an end-to-end manner. We show high-quality reconstructions by performing extensive experiments on real and challenging scenes using two different types of lensless prototypes: one which uses a separable forward model and another, which uses a more general non-separable cropped-convolution model. Our end-to-end approach is fast, produces photorealistic reconstructions, and is easy to adopt for other mask-based lensless cameras.

* Accepted to IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI) 2020. Supplementary material attached. For project website, see https://siddiquesalman.github.io/flatnet/

Via

Access Paper or Ask Questions

Deep Atrous Guided Filter for Image Restoration in Under Display Cameras

Sep 01, 2020

Varun Sundar, Sumanth Hegde, Divya Kothandaraman, Kaushik Mitra

Figure 1 for Deep Atrous Guided Filter for Image Restoration in Under Display Cameras

Figure 2 for Deep Atrous Guided Filter for Image Restoration in Under Display Cameras

Figure 3 for Deep Atrous Guided Filter for Image Restoration in Under Display Cameras

Figure 4 for Deep Atrous Guided Filter for Image Restoration in Under Display Cameras

Abstract:Under Display Cameras present a promising opportunity for phone manufacturers to achieve bezel-free displays by positioning the camera behind semi-transparent OLED screens. Unfortunately, such imaging systems suffer from severe image degradation due to light attenuation and diffraction effects. In this work, we present Deep Atrous Guided Filter (DAGF), a two-stage, end-to-end approach for image restoration in UDC systems. A Low-Resolution Network first restores image quality at low-resolution, which is subsequently used by the Guided Filter Network as a filtering input to produce a high-resolution output. Besides the initial downsampling, our low-resolution network uses multiple, parallel atrous convolutions to preserve spatial resolution and emulates multi-scale processing. Our approach's ability to directly train on megapixel images results in significant performance improvement. We additionally propose a simple simulation scheme to pre-train our model and boost performance. Our overall framework ranks 2nd and 5th in the RLQ-TOD'20 UDC Challenge for POLED and TOLED displays, respectively.

* To appear in ECCV 2020 RLQ Workshop. Supplementary material attached. For project website, see https://varun19299.github.io/deep-atrous-guided-filter/

Via

Access Paper or Ask Questions

UDC 2020 Challenge on Image Restoration of Under-Display Camera: Methods and Results

Aug 18, 2020

Yuqian Zhou, Michael Kwan, Kyle Tolentino, Neil Emerton, Sehoon Lim, Tim Large, Lijiang Fu, Zhihong Pan, Baopu Li, Qirui Yang(+35 more)

Figure 1 for UDC 2020 Challenge on Image Restoration of Under-Display Camera: Methods and Results

Figure 2 for UDC 2020 Challenge on Image Restoration of Under-Display Camera: Methods and Results

Figure 3 for UDC 2020 Challenge on Image Restoration of Under-Display Camera: Methods and Results

Figure 4 for UDC 2020 Challenge on Image Restoration of Under-Display Camera: Methods and Results

Abstract:This paper is the report of the first Under-Display Camera (UDC) image restoration challenge in conjunction with the RLQ workshop at ECCV 2020. The challenge is based on a newly-collected database of Under-Display Camera. The challenge tracks correspond to two types of display: a 4k Transparent OLED (T-OLED) and a phone Pentile OLED (P-OLED). Along with about 150 teams registered the challenge, eight and nine teams submitted the results during the testing phase for each track. The results in the paper are state-of-the-art restoration performance of Under-Display Camera Restoration. Datasets and paper are available at https://yzhouas.github.io/projects/UDC/udc.html.

* 15 pages

Via

Access Paper or Ask Questions