Skip to main content

Showing 1–2 of 2 results for author: Eberhard, O

.
  1. arXiv:2405.18100  [pdf, other

    cs.LG math.OC

    A Pontryagin Perspective on Reinforcement Learning

    Authors: Onno Eberhard, Claire Vernade, Michael Muehlebach

    Abstract: Reinforcement learning has traditionally focused on learning state-dependent policies to solve optimal control problems in a closed-loop fashion. In this work, we introduce the paradigm of open-loop reinforcement learning where a fixed action sequence is learned instead. We present three new algorithms: one robust model-based method and two sample-efficient model-free methods. Rather than basing o… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

  2. arXiv:2102.04097  [pdf, other

    cs.CL

    Effects of Layer Freezing on Transferring a Speech Recognition System to Under-resourced Languages

    Authors: Onno Eberhard, Torsten Zesch

    Abstract: In this paper, we investigate the effect of layer freezing on the effectiveness of model transfer in the area of automatic speech recognition. We experiment with Mozilla's DeepSpeech architecture on German and Swiss German speech datasets and compare the results of either training from scratch vs. transferring a pre-trained model. We compare different layer freezing schemes and find that even free… ▽ More

    Submitted 4 October, 2022; v1 submitted 8 February, 2021; originally announced February 2021.

    Comments: Published at KONVENS 2021

    ACM Class: I.2.7