Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Cheng Gu

The Spatio-Temporal Poisson Point Process: A Simple Model for the Alignment of Event Camera Data

Jun 13, 2021

Cheng Gu, Erik Learned-Miller, Daniel Sheldon, Guillermo Gallego, Pia Bideau

Figure 1 for The Spatio-Temporal Poisson Point Process: A Simple Model for the Alignment of Event Camera Data

Figure 2 for The Spatio-Temporal Poisson Point Process: A Simple Model for the Alignment of Event Camera Data

Figure 3 for The Spatio-Temporal Poisson Point Process: A Simple Model for the Alignment of Event Camera Data

Figure 4 for The Spatio-Temporal Poisson Point Process: A Simple Model for the Alignment of Event Camera Data

Abstract:Event cameras, inspired by biological vision systems, provide a natural and data efficient representation of visual information. Visual information is acquired in the form of events that are triggered by local brightness changes. Each pixel location of the camera's sensor records events asynchronously and independently with very high temporal resolution. However, because most brightness changes are triggered by relative motion of the camera and the scene, the events recorded at a single sensor location seldom correspond to the same world point. To extract meaningful information from event cameras, it is helpful to register events that were triggered by the same underlying world point. In this work we propose a new model of event data that captures its natural spatio-temporal structure. We start by developing a model for aligned event data. That is, we develop a model for the data as though it has been perfectly registered already. In particular, we model the aligned data as a spatio-temporal Poisson point process. Based on this model, we develop a maximum likelihood approach to registering events that are not yet aligned. That is, we find transformations of the observed events that make them as likely as possible under our model. In particular we extract the camera rotation that leads to the best event alignment. We show new state of the art accuracy for rotational velocity estimation on the DAVIS 240C dataset. In addition, our method is also faster and has lower computational complexity than several competing methods.

Via

Access Paper or Ask Questions

Complementing Representation Deficiency in Few-shot Image Classification: A Meta-Learning Approach

Jul 21, 2020

Xian Zhong, Cheng Gu, Wenxin Huang, Lin Li, Shuqin Chen, Chia-Wen Lin

Figure 1 for Complementing Representation Deficiency in Few-shot Image Classification: A Meta-Learning Approach

Figure 2 for Complementing Representation Deficiency in Few-shot Image Classification: A Meta-Learning Approach

Figure 3 for Complementing Representation Deficiency in Few-shot Image Classification: A Meta-Learning Approach

Figure 4 for Complementing Representation Deficiency in Few-shot Image Classification: A Meta-Learning Approach

Abstract:Few-shot learning is a challenging problem that has attracted more and more attention recently since abundant training samples are difficult to obtain in practical applications. Meta-learning has been proposed to address this issue, which focuses on quickly adapting a predictor as a base-learner to new tasks, given limited labeled samples. However, a critical challenge for meta-learning is the representation deficiency since it is hard to discover common information from a small number of training samples or even one, as is the representation of key features from such little information. As a result, a meta-learner cannot be trained well in a high-dimensional parameter space to generalize to new tasks. Existing methods mostly resort to extracting less expressive features so as to avoid the representation deficiency. Aiming at learning better representations, we propose a meta-learning approach with complemented representations network (MCRNet) for few-shot image classification. In particular, we embed a latent space, where latent codes are reconstructed with extra representation information to complement the representation deficiency. Furthermore, the latent space is established with variational inference, collaborating well with different base-learners, and can be extended to other models. Finally, our end-to-end framework achieves the state-of-the-art performance in image classification on three standard few-shot learning datasets.

* 25th International Conference on Pattern Recognition (ICPR2020)

Via

Access Paper or Ask Questions