Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Onno Eberhard

A Pontryagin Perspective on Reinforcement Learning

May 28, 2024

Onno Eberhard, Claire Vernade, Michael Muehlebach

Figure 1 for A Pontryagin Perspective on Reinforcement Learning

Figure 2 for A Pontryagin Perspective on Reinforcement Learning

Figure 3 for A Pontryagin Perspective on Reinforcement Learning

Figure 4 for A Pontryagin Perspective on Reinforcement Learning

Abstract:Reinforcement learning has traditionally focused on learning state-dependent policies to solve optimal control problems in a closed-loop fashion. In this work, we introduce the paradigm of open-loop reinforcement learning where a fixed action sequence is learned instead. We present three new algorithms: one robust model-based method and two sample-efficient model-free methods. Rather than basing our algorithms on Bellman's equation from dynamic programming, our work builds on Pontryagin's principle from the theory of open-loop optimal control. We provide convergence guarantees and evaluate all methods empirically on a pendulum swing-up task, as well as on two high-dimensional MuJoCo tasks, demonstrating remarkable performance compared to existing baselines.

Via

Access Paper or Ask Questions

Effects of Layer Freezing when Transferring DeepSpeech to New Languages

Feb 08, 2021

Onno Eberhard, Torsten Zesch

Figure 1 for Effects of Layer Freezing when Transferring DeepSpeech to New Languages

Figure 2 for Effects of Layer Freezing when Transferring DeepSpeech to New Languages

Figure 3 for Effects of Layer Freezing when Transferring DeepSpeech to New Languages

Figure 4 for Effects of Layer Freezing when Transferring DeepSpeech to New Languages

Abstract:In this paper, we train Mozilla's DeepSpeech architecture on German and Swiss German speech datasets and compare the results of different training methods. We first train the models from scratch on both languages and then improve upon the results by using an English pretrained version of DeepSpeech for weight initialization and experiment with the effects of freezing different layers during training. We see that even freezing only one layer already improves the results dramatically.

Via

Access Paper or Ask Questions