Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Sparse approximation in learning via neural ODEs

Feb 26, 2021

Carlos Esteve Yagüe, Borjan Geshkovski

Figure 1 for Sparse approximation in learning via neural ODEs

Figure 2 for Sparse approximation in learning via neural ODEs

Figure 3 for Sparse approximation in learning via neural ODEs

Figure 4 for Sparse approximation in learning via neural ODEs

Share this with someone who'll enjoy it:

Abstract:We consider the continuous-time, neural ordinary differential equation (neural ODE) perspective of deep supervised learning, and study the impact of the final time horizon $T$ in training. We focus on a cost consisting of an integral of the empirical risk over the time interval, and $L^1$--parameter regularization. Under homogeneity assumptions on the dynamics (typical for ReLU activations), we prove that any global minimizer is sparse, in the sense that there exists a positive stopping time $T^*$ beyond which the optimal parameters vanish. Moreover, under appropriate interpolation assumptions on the neural ODE, we provide quantitative estimates of the stopping time $T^\ast$, and of the training error of the trajectories at the stopping time. The latter stipulates a quantitative approximation property of neural ODE flows with sparse parameters. In practical terms, a shorter time-horizon in the training problem can be interpreted as considering a shallower residual neural network (ResNet), and since the optimal parameters are concentrated over a shorter time horizon, such a consideration may lower the computational cost of training without discarding relevant information.

* 24 pages, 4 figures

View paper on

Share this with someone who'll enjoy it:

Title:Sparse approximation in learning via neural ODEs

Paper and Code