Showing 1–2 of 2 results for author: Eberhard, O

Search v0.5.6 released 2020-02-24

arXiv:2405.18100 [pdf, other]

cs.LG math.OC

A Pontryagin Perspective on Reinforcement Learning

Authors: Onno Eberhard, Claire Vernade, Michael Muehlebach

Abstract: Reinforcement learning has traditionally focused on learning state-dependent policies to solve optimal control problems in a closed-loop fashion. In this work, we introduce the paradigm of open-loop reinforcement learning where a fixed action sequence is learned instead. We present three new algorithms: one robust model-based method and two sample-efficient model-free methods. Rather than basing o… ▽ More Reinforcement learning has traditionally focused on learning state-dependent policies to solve optimal control problems in a closed-loop fashion. In this work, we introduce the paradigm of open-loop reinforcement learning where a fixed action sequence is learned instead. We present three new algorithms: one robust model-based method and two sample-efficient model-free methods. Rather than basing our algorithms on Bellman's equation from dynamic programming, our work builds on Pontryagin's principle from the theory of open-loop optimal control. We provide convergence guarantees and evaluate all methods empirically on a pendulum swing-up task, as well as on two high-dimensional MuJoCo tasks, demonstrating remarkable performance compared to existing baselines. △ Less

Submitted 28 May, 2024; originally announced May 2024.
arXiv:2102.04097 [pdf, other]

cs.CL

Effects of Layer Freezing on Transferring a Speech Recognition System to Under-resourced Languages

Authors: Onno Eberhard, Torsten Zesch

Abstract: In this paper, we investigate the effect of layer freezing on the effectiveness of model transfer in the area of automatic speech recognition. We experiment with Mozilla's DeepSpeech architecture on German and Swiss German speech datasets and compare the results of either training from scratch vs. transferring a pre-trained model. We compare different layer freezing schemes and find that even free… ▽ More In this paper, we investigate the effect of layer freezing on the effectiveness of model transfer in the area of automatic speech recognition. We experiment with Mozilla's DeepSpeech architecture on German and Swiss German speech datasets and compare the results of either training from scratch vs. transferring a pre-trained model. We compare different layer freezing schemes and find that even freezing only one layer already significantly improves results. △ Less

Submitted 4 October, 2022; v1 submitted 8 February, 2021; originally announced February 2021.

Comments: Published at KONVENS 2021

ACM Class: I.2.7

Search v0.5.6 released 2020-02-24