Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Lapo Faggi

PARTIME: Scalable and Parallel Processing Over Time with Deep Neural Networks

Oct 17, 2022

Enrico Meloni, Lapo Faggi, Simone Marullo, Alessandro Betti, Matteo Tiezzi, Marco Gori, Stefano Melacci

Figure 1 for PARTIME: Scalable and Parallel Processing Over Time with Deep Neural Networks

Figure 2 for PARTIME: Scalable and Parallel Processing Over Time with Deep Neural Networks

Figure 3 for PARTIME: Scalable and Parallel Processing Over Time with Deep Neural Networks

Figure 4 for PARTIME: Scalable and Parallel Processing Over Time with Deep Neural Networks

Abstract:In this paper, we present PARTIME, a software library written in Python and based on PyTorch, designed specifically to speed up neural networks whenever data is continuously streamed over time, for both learning and inference. Existing libraries are designed to exploit data-level parallelism, assuming that samples are batched, a condition that is not naturally met in applications that are based on streamed data. Differently, PARTIME starts processing each data sample at the time in which it becomes available from the stream. PARTIME wraps the code that implements a feed-forward multi-layer network and it distributes the layer-wise processing among multiple devices, such as Graphics Processing Units (GPUs). Thanks to its pipeline-based computational scheme, PARTIME allows the devices to perform computations in parallel. At inference time this results in scaling capabilities that are theoretically linear with respect to the number of devices. During the learning stage, PARTIME can leverage the non-i.i.d. nature of the streamed data with samples that are smoothly evolving over time for efficient gradient computations. Experiments are performed in order to empirically compare PARTIME with classic non-parallel neural computations in online learning, distributing operations on up to 8 NVIDIA GPUs, showing significant speedups that are almost linear in the number of devices, mitigating the impact of the data transfer overhead.

* 9 pages, accepted at International Conference on Machine Learning and Applications

Via

Access Paper or Ask Questions

Stochastic Coherence Over Attention Trajectory For Continuous Learning In Video Streams

Apr 26, 2022

Matteo Tiezzi, Simone Marullo, Lapo Faggi, Enrico Meloni, Alessandro Betti, Stefano Melacci

Figure 1 for Stochastic Coherence Over Attention Trajectory For Continuous Learning In Video Streams

Figure 2 for Stochastic Coherence Over Attention Trajectory For Continuous Learning In Video Streams

Figure 3 for Stochastic Coherence Over Attention Trajectory For Continuous Learning In Video Streams

Figure 4 for Stochastic Coherence Over Attention Trajectory For Continuous Learning In Video Streams

Abstract:Devising intelligent agents able to live in an environment and learn by observing the surroundings is a longstanding goal of Artificial Intelligence. From a bare Machine Learning perspective, challenges arise when the agent is prevented from leveraging large fully-annotated dataset, but rather the interactions with supervisory signals are sparsely distributed over space and time. This paper proposes a novel neural-network-based approach to progressively and autonomously develop pixel-wise representations in a video stream. The proposed method is based on a human-like attention mechanism that allows the agent to learn by observing what is moving in the attended locations. Spatio-temporal stochastic coherence along the attention trajectory, paired with a contrastive term, leads to an unsupervised learning criterion that naturally copes with the considered setting. Differently from most existing works, the learned representations are used in open-set class-incremental classification of each frame pixel, relying on few supervisions. Our experiments leverage 3D virtual environments and they show that the proposed agents can learn to distinguish objects just by observing the video stream. Inheriting features from state-of-the art models is not as powerful as one might expect.

* Accepted for publication in the 31st International Joint Conference on Artificial Intelligence (IJCAI-ECAI 2022)

Via

Access Paper or Ask Questions

Evaluating Continual Learning Algorithms by Generating 3D Virtual Environments

Sep 16, 2021

Enrico Meloni, Alessandro Betti, Lapo Faggi, Simone Marullo, Matteo Tiezzi, Stefano Melacci

Figure 1 for Evaluating Continual Learning Algorithms by Generating 3D Virtual Environments

Figure 2 for Evaluating Continual Learning Algorithms by Generating 3D Virtual Environments

Figure 3 for Evaluating Continual Learning Algorithms by Generating 3D Virtual Environments

Figure 4 for Evaluating Continual Learning Algorithms by Generating 3D Virtual Environments

Abstract:Continual learning refers to the ability of humans and animals to incrementally learn over time in a given environment. Trying to simulate this learning process in machines is a challenging task, also due to the inherent difficulty in creating conditions for designing continuously evolving dynamics that are typical of the real-world. Many existing research works usually involve training and testing of virtual agents on datasets of static images or short videos, considering sequences of distinct learning tasks. However, in order to devise continual learning algorithms that operate in more realistic conditions, it is fundamental to gain access to rich, fully customizable and controlled experimental playgrounds. Focussing on the specific case of vision, we thus propose to leverage recent advances in 3D virtual environments in order to approach the automatic generation of potentially life-long dynamic scenes with photo-realistic appearance. Scenes are composed of objects that move along variable routes with different and fully customizable timings, and randomness can also be included in their evolution. A novel element of this paper is that scenes are described in a parametric way, thus allowing the user to fully control the visual complexity of the input stream the agent perceives. These general principles are concretely implemented exploiting a recently published 3D virtual environment. The user can generate scenes without the need of having strong skills in computer graphics, since all the generation facilities are exposed through a simple high-level Python interface. We publicly share the proposed generator.

* 8 pages, 7 figures, accepted at the 1st International Workshop on Continual Semi-Supervised Learning (CSSL) @ IJCAI 2021

Via

Access Paper or Ask Questions

Wave Propagation of Visual Stimuli in Focus of Attention

Jun 19, 2020

Lapo Faggi, Alessandro Betti, Dario Zanca, Stefano Melacci, Marco Gori

Figure 1 for Wave Propagation of Visual Stimuli in Focus of Attention

Figure 2 for Wave Propagation of Visual Stimuli in Focus of Attention

Figure 3 for Wave Propagation of Visual Stimuli in Focus of Attention

Figure 4 for Wave Propagation of Visual Stimuli in Focus of Attention

Abstract:Fast reactions to changes in the surrounding visual environment require efficient attention mechanisms to reallocate computational resources to most relevant locations in the visual field. While current computational models keep improving their predictive ability thanks to the increasing availability of data, they still struggle approximating the effectiveness and efficiency exhibited by foveated animals. In this paper, we present a biologically-plausible computational model of focus of attention that exhibits spatiotemporal locality and that is very well-suited for parallel and distributed implementations. Attention emerges as a wave propagation process originated by visual stimuli corresponding to details and motion information. The resulting field obeys the principle of "inhibition of return" so as not to get stuck in potential holes. An accurate experimentation of the model shows that it achieves top level performance in scanpath prediction tasks. This can easily be understood at the light of a theoretical result that we establish in the paper, where we prove that as the velocity of wave propagation goes to infinity, the proposed model reduces to recently proposed state of the art gravitational models of focus of attention.

Via

Access Paper or Ask Questions