Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Jan Köhler

The streaming rollout of deep networks - towards fully model-parallel execution

Nov 02, 2018

Volker Fischer, Jan Köhler, Thomas Pfeil

Figure 1 for The streaming rollout of deep networks - towards fully model-parallel execution

Figure 2 for The streaming rollout of deep networks - towards fully model-parallel execution

Figure 3 for The streaming rollout of deep networks - towards fully model-parallel execution

Abstract:Deep neural networks, and in particular recurrent networks, are promising candidates to control autonomous agents that interact in real-time with the physical world. However, this requires a seamless integration of temporal features into the network's architecture. For the training of and inference with recurrent neural networks, they are usually rolled out over time, and different rollouts exist. Conventionally during inference, the layers of a network are computed in a sequential manner resulting in sparse temporal integration of information and long response times. In this study, we present a theoretical framework to describe rollouts, the level of model-parallelization they induce, and demonstrate differences in solving specific tasks. We prove that certain rollouts, also for networks with only skip and no recurrent connections, enable earlier and more frequent responses, and show empirically that these early responses have better performance. The streaming rollout maximizes these properties and enables a fully parallel execution of the network reducing runtime on massively parallel devices. Finally, we provide an open-source toolbox to design, train, evaluate, and interact with streaming rollouts.

* To appear at NIPS 2018

Via

Access Paper or Ask Questions