Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Luca Rigazio

Contrastive Neural Processes for Self-Supervised Learning

Oct 31, 2021

Konstantinos Kallidromitis, Denis Gudovskiy, Kazuki Kozuka, Ohama Iku, Luca Rigazio

Figure 1 for Contrastive Neural Processes for Self-Supervised Learning

Figure 2 for Contrastive Neural Processes for Self-Supervised Learning

Figure 3 for Contrastive Neural Processes for Self-Supervised Learning

Figure 4 for Contrastive Neural Processes for Self-Supervised Learning

Abstract:Recent contrastive methods show significant improvement in self-supervised learning in several domains. In particular, contrastive methods are most effective where data augmentation can be easily constructed e.g. in computer vision. However, they are less successful in domains without established data transformations such as time series data. In this paper, we propose a novel self-supervised learning framework that combines contrastive learning with neural processes. It relies on recent advances in neural processes to perform time series forecasting. This allows to generate augmented versions of data by employing a set of various sampling functions and, hence, avoid manually designed augmentations. We extend conventional neural processes and propose a new contrastive loss to learn times series representations in a self-supervised setup. Therefore, unlike previous self-supervised methods, our augmentation pipeline is task-agnostic, enabling our method to perform well across various applications. In particular, a ResNet with a linear classifier trained using our approach is able to outperform state-of-the-art techniques across industrial, medical and audio datasets improving accuracy over 10% in ECG periodic data. We further demonstrate that our self-supervised representations are more efficient in the latent space, improving multiple clustering indexes and that fine-tuning our method on 10% of labels achieves results competitive to fully-supervised learning.

* 16 pages, 6 figures, ACML 2021

Via

Access Paper or Ask Questions

AutoDO: Robust AutoAugment for Biased Data with Label Noise via Scalable Probabilistic Implicit Differentiation

Mar 11, 2021

Denis Gudovskiy, Luca Rigazio, Shun Ishizaka, Kazuki Kozuka, Sotaro Tsukizawa

Figure 1 for AutoDO: Robust AutoAugment for Biased Data with Label Noise via Scalable Probabilistic Implicit Differentiation

Figure 2 for AutoDO: Robust AutoAugment for Biased Data with Label Noise via Scalable Probabilistic Implicit Differentiation

Figure 3 for AutoDO: Robust AutoAugment for Biased Data with Label Noise via Scalable Probabilistic Implicit Differentiation

Figure 4 for AutoDO: Robust AutoAugment for Biased Data with Label Noise via Scalable Probabilistic Implicit Differentiation

Abstract:AutoAugment has sparked an interest in automated augmentation methods for deep learning models. These methods estimate image transformation policies for train data that improve generalization to test data. While recent papers evolved in the direction of decreasing policy search complexity, we show that those methods are not robust when applied to biased and noisy data. To overcome these limitations, we reformulate AutoAugment as a generalized automated dataset optimization (AutoDO) task that minimizes the distribution shift between test data and distorted train dataset. In our AutoDO model, we explicitly estimate a set of per-point hyperparameters to flexibly change distribution of train data. In particular, we include hyperparameters for augmentation, loss weights, and soft-labels that are jointly estimated using implicit differentiation. We develop a theoretical probabilistic interpretation of this framework using Fisher information and show that its complexity scales linearly with the dataset size. Our experiments on SVHN, CIFAR-10/100, and ImageNet classification show up to 9.3% improvement for biased datasets with label noise compared to prior methods and, importantly, up to 36.6% gain for underrepresented SVHN classes.

* Accepted to CVPR 2021. Preprint

Via

Access Paper or Ask Questions

DoorGym: A Scalable Door Opening Environment And Baseline Agent

Aug 07, 2019

Yusuke Urakami, Alec Hodgkinson, Casey Carlin, Randall Leu, Luca Rigazio, Pieter Abbeel

Figure 1 for DoorGym: A Scalable Door Opening Environment And Baseline Agent

Figure 2 for DoorGym: A Scalable Door Opening Environment And Baseline Agent

Figure 3 for DoorGym: A Scalable Door Opening Environment And Baseline Agent

Figure 4 for DoorGym: A Scalable Door Opening Environment And Baseline Agent

Abstract:Reinforcement Learning (RL) has brought forth ideas of autonomous robots that can navigate real-world environments with ease, aiding humans in a variety of tasks. RL agents have just begun to make their way out of simulation into the real world. Once in the real world, benchmark tasks often fail to transfer into useful skills. We introduce DoorGym, a simulation environment intended to be the first step to move RL from toy environments towards useful atomic skills that can be composed and extended towards a broader goal. DoorGym is an open-source door simulation framework designed to be highly configurable. We also provide a baseline PPO (Proximal Policy Optimization) and SAC (Soft Actor-Critic)implementation, which achieves a success rate of up to 70% for common tasks in this environment. Environment kit available here:https://github.com/PSVL/DoorGym/

* Fixed authors' affiliation. Deleted the redundant expression on page 7

Via

Access Paper or Ask Questions

DNN Feature Map Compression using Learned Representation over GF(2)

Aug 15, 2018

Denis A. Gudovskiy, Alec Hodgkinson, Luca Rigazio

Figure 1 for DNN Feature Map Compression using Learned Representation over GF(2)

Figure 2 for DNN Feature Map Compression using Learned Representation over GF(2)

Figure 3 for DNN Feature Map Compression using Learned Representation over GF(2)

Figure 4 for DNN Feature Map Compression using Learned Representation over GF(2)

Abstract:In this paper, we introduce a method to compress intermediate feature maps of deep neural networks (DNNs) to decrease memory storage and bandwidth requirements during inference. Unlike previous works, the proposed method is based on converting fixed-point activations into vectors over the smallest GF(2) finite field followed by nonlinear dimensionality reduction (NDR) layers embedded into a DNN. Such an end-to-end learned representation finds more compact feature maps by exploiting quantization redundancies within the fixed-point activations along the channel or spatial dimensions. We apply the proposed network architectures derived from modified SqueezeNet and MobileNetV2 to the tasks of ImageNet classification and PASCAL VOC object detection. Compared to prior approaches, the conducted experiments show a factor of 2 decrease in memory requirements with minor degradation in accuracy while adding only bitwise computations.

* CEFRL2018

Via

Access Paper or Ask Questions

TransFlow: Unsupervised Motion Flow by Joint Geometric and Pixel-level Estimation

Oct 30, 2017

Stefano Alletto, Davide Abati, Simone Calderara, Rita Cucchiara, Luca Rigazio

Figure 1 for TransFlow: Unsupervised Motion Flow by Joint Geometric and Pixel-level Estimation

Figure 2 for TransFlow: Unsupervised Motion Flow by Joint Geometric and Pixel-level Estimation

Figure 3 for TransFlow: Unsupervised Motion Flow by Joint Geometric and Pixel-level Estimation

Figure 4 for TransFlow: Unsupervised Motion Flow by Joint Geometric and Pixel-level Estimation

Abstract:We address unsupervised optical flow estimation for ego-centric motion. We argue that optical flow can be cast as a geometrical warping between two successive video frames and devise a deep architecture to estimate such transformation in two stages. First, a dense pixel-level flow is computed with a geometric prior imposing strong spatial constraints. Such prior is typical of driving scenes, where the point of view is coherent with the vehicle motion. We show how such global transformation can be approximated with an homography and how spatial transformer layers can be employed to compute the flow field implied by such transformation. The second stage then refines the prediction feeding a second deeper network. A final reconstruction loss compares the warping of frame X(t) with the subsequent frame X(t+1) and guides both estimates. The model, which we named TransFlow, performs favorably compared to other unsupervised algorithms, and shows better generalization compared to supervised methods with a 3x reduction in error on unseen data.

* We have found a bug in the flow evaluation code compromising the experimental evaluation and the results provided in the paper are no longer correct. We are currently working on a new experimental campaign but we estimate that results will be available in a few weeks and will drastically change the paper, hence the withdraw request

Via

Access Paper or Ask Questions

Path Integral Networks: End-to-End Differentiable Optimal Control

Jun 29, 2017

Masashi Okada, Luca Rigazio, Takenobu Aoshima

Figure 1 for Path Integral Networks: End-to-End Differentiable Optimal Control

Figure 2 for Path Integral Networks: End-to-End Differentiable Optimal Control

Figure 3 for Path Integral Networks: End-to-End Differentiable Optimal Control

Figure 4 for Path Integral Networks: End-to-End Differentiable Optimal Control

Abstract:In this paper, we introduce Path Integral Networks (PI-Net), a recurrent network representation of the Path Integral optimal control algorithm. The network includes both system dynamics and cost models, used for optimal control based planning. PI-Net is fully differentiable, learning both dynamics and cost models end-to-end by back-propagation and stochastic gradient descent. Because of this, PI-Net can learn to plan. PI-Net has several advantages: it can generalize to unseen states thanks to planning, it can be applied to continuous control tasks, and it allows for a wide variety learning schemes, including imitation and reinforcement learning. Preliminary experiment results show that PI-Net, trained by imitation learning, can mimic control demonstrations for two simulated problems; a linear system and a pendulum swing-up problem. We also show that PI-Net is able to learn dynamics and cost models latent in the demonstrations.

Via

Access Paper or Ask Questions

ShiftCNN: Generalized Low-Precision Architecture for Inference of Convolutional Neural Networks

Jun 07, 2017

Denis A. Gudovskiy, Luca Rigazio

Figure 1 for ShiftCNN: Generalized Low-Precision Architecture for Inference of Convolutional Neural Networks

Figure 2 for ShiftCNN: Generalized Low-Precision Architecture for Inference of Convolutional Neural Networks

Figure 3 for ShiftCNN: Generalized Low-Precision Architecture for Inference of Convolutional Neural Networks

Figure 4 for ShiftCNN: Generalized Low-Precision Architecture for Inference of Convolutional Neural Networks

Abstract:In this paper we introduce ShiftCNN, a generalized low-precision architecture for inference of multiplierless convolutional neural networks (CNNs). ShiftCNN is based on a power-of-two weight representation and, as a result, performs only shift and addition operations. Furthermore, ShiftCNN substantially reduces computational cost of convolutional layers by precomputing convolution terms. Such an optimization can be applied to any CNN architecture with a relatively small codebook of weights and allows to decrease the number of product operations by at least two orders of magnitude. The proposed architecture targets custom inference accelerators and can be realized on FPGAs or ASICs. Extensive evaluation on ImageNet shows that the state-of-the-art CNNs can be converted without retraining into ShiftCNN with less than 1% drop in accuracy when the proposed quantization algorithm is employed. RTL simulations, targeting modern FPGAs, show that power consumption of convolutional layers is reduced by a factor of 4 compared to conventional 8-bit fixed-point architectures.

Via

Access Paper or Ask Questions

Similarity Mapping with Enhanced Siamese Network for Multi-Object Tracking

Jan 24, 2017

Minyoung Kim, Stefano Alletto, Luca Rigazio

Figure 1 for Similarity Mapping with Enhanced Siamese Network for Multi-Object Tracking

Figure 2 for Similarity Mapping with Enhanced Siamese Network for Multi-Object Tracking

Figure 3 for Similarity Mapping with Enhanced Siamese Network for Multi-Object Tracking

Figure 4 for Similarity Mapping with Enhanced Siamese Network for Multi-Object Tracking

Abstract:Multi-object tracking has recently become an important area of computer vision, especially for Advanced Driver Assistance Systems (ADAS). Despite growing attention, achieving high performance tracking is still challenging, with state-of-the- art systems resulting in high complexity with a large number of hyper parameters. In this paper, we focus on reducing overall system complexity and the number hyper parameters that need to be tuned to a specific environment. We introduce a novel tracking system based on similarity mapping by Enhanced Siamese Neural Network (ESNN), which accounts for both appearance and geometric information, and is trainable end-to-end. Our system achieves competitive performance in both speed and accuracy on MOT16 challenge, compared to known state-of-the-art methods.

* 1) accepted as a poster presentation at WiML (Women in Machine Learning) workshop 2016, colocated with NIPS 2016 in Barcelona, Spain, 2) accepted as a poster presentation at MLITS (Machine Learning for Intelligent Transportation Systems) Workshop held in conjunction with the NIPS 2016 in Barcelona, Spain

Via

Access Paper or Ask Questions

Towards Deep Neural Network Architectures Robust to Adversarial Examples

Apr 09, 2015

Shixiang Gu, Luca Rigazio

Figure 1 for Towards Deep Neural Network Architectures Robust to Adversarial Examples

Figure 2 for Towards Deep Neural Network Architectures Robust to Adversarial Examples

Figure 3 for Towards Deep Neural Network Architectures Robust to Adversarial Examples

Figure 4 for Towards Deep Neural Network Architectures Robust to Adversarial Examples

Abstract:Recent work has shown deep neural networks (DNNs) to be highly susceptible to well-designed, small perturbations at the input layer, or so-called adversarial examples. Taking images as an example, such distortions are often imperceptible, but can result in 100% mis-classification for a state of the art DNN. We study the structure of adversarial examples and explore network topology, pre-processing and training strategies to improve the robustness of DNNs. We perform various experiments to assess the removability of adversarial examples by corrupting with additional noise and pre-processing with denoising autoencoders (DAEs). We find that DAEs can remove substantial amounts of the adversarial noise. How- ever, when stacking the DAE with the original DNN, the resulting network can again be attacked by new adversarial examples with even smaller distortion. As a solution, we propose Deep Contractive Network, a model with a new end-to-end training procedure that includes a smoothness penalty inspired by the contractive autoencoder (CAE). This increases the network robustness to adversarial examples, without a significant performance penalty.

Via

Access Paper or Ask Questions

Deep Clustered Convolutional Kernels

Mar 06, 2015

Minyoung Kim, Luca Rigazio

Figure 1 for Deep Clustered Convolutional Kernels

Figure 2 for Deep Clustered Convolutional Kernels

Figure 3 for Deep Clustered Convolutional Kernels

Figure 4 for Deep Clustered Convolutional Kernels

Abstract:Deep neural networks have recently achieved state of the art performance thanks to new training algorithms for rapid parameter estimation and new regularization methods to reduce overfitting. However, in practice the network architecture has to be manually set by domain experts, generally by a costly trial and error procedure, which often accounts for a large portion of the final system performance. We view this as a limitation and propose a novel training algorithm that automatically optimizes network architecture, by progressively increasing model complexity and then eliminating model redundancy by selectively removing parameters at training time. For convolutional neural networks, our method relies on iterative split/merge clustering of convolutional kernels interleaved by stochastic gradient descent. We present a training algorithm and experimental results on three different vision tasks, showing improved performance compared to similarly sized hand-crafted architectures.

* JMLR: Workshop and Conference Proceedings 44 (2015) 160-172
* draft

Via

Access Paper or Ask Questions