Skip to main content

Showing 1–6 of 6 results for author: Paischer, F

Searching in archive cs. Search in all archives.
.
  1. arXiv:2307.05591  [pdf, other

    cs.CV cs.CL cs.LG

    Linear Alignment of Vision-language Models for Image Captioning

    Authors: Fabian Paischer, Markus Hofmarcher, Sepp Hochreiter, Thomas Adler

    Abstract: Recently, vision-language models like CLIP have advanced the state of the art in a variety of multi-modal tasks including image captioning and caption evaluation. Many approaches adapt CLIP-style models to a downstream task by training a map** network between CLIP and a language model. This is costly as it usually involves calculating gradients for large models. We propose a more efficient train… ▽ More

    Submitted 6 February, 2024; v1 submitted 10 July, 2023; originally announced July 2023.

    Comments: 8 pages (+ references and appendix)

  2. arXiv:2306.14884  [pdf, other

    cs.LG cs.AI

    Learning to Modulate pre-trained Models in RL

    Authors: Thomas Schmied, Markus Hofmarcher, Fabian Paischer, Razvan Pascanu, Sepp Hochreiter

    Abstract: Reinforcement Learning (RL) has been successful in various domains like robotics, game playing, and simulation. While RL agents have shown impressive capabilities in their specific tasks, they insufficiently adapt to new tasks. In supervised learning, this adaptation problem is addressed by large-scale pre-training followed by fine-tuning to new down-stream tasks. Recently, pre-training on multipl… ▽ More

    Submitted 27 October, 2023; v1 submitted 26 June, 2023; originally announced June 2023.

    Comments: 10 pages (+ references and appendix), Code: https://github.com/ml-jku/L2M

  3. arXiv:2306.09312  [pdf, other

    cs.LG cs.AI cs.CL stat.ML

    Semantic HELM: A Human-Readable Memory for Reinforcement Learning

    Authors: Fabian Paischer, Thomas Adler, Markus Hofmarcher, Sepp Hochreiter

    Abstract: Reinforcement learning agents deployed in the real world often have to cope with partially observable environments. Therefore, most agents employ memory mechanisms to approximate the state of the environment. Recently, there have been impressive success stories in mastering partially observable environments, mostly in the realm of computer games like Dota 2, StarCraft II, or MineCraft. However, ex… ▽ More

    Submitted 27 October, 2023; v1 submitted 15 June, 2023; originally announced June 2023.

    Comments: To appear at NeurIPS 2023, 10 pages (+ references and appendix), Code: https://github.com/ml-jku/helm

  4. arXiv:2207.05742  [pdf, other

    cs.LG cs.AI

    Reactive Exploration to Cope with Non-Stationarity in Lifelong Reinforcement Learning

    Authors: Christian Steinparz, Thomas Schmied, Fabian Paischer, Marius-Constantin Dinu, Vihang Patil, Angela Bitto-Nemling, Hamid Eghbal-zadeh, Sepp Hochreiter

    Abstract: In lifelong learning, an agent learns throughout its entire life without resets, in a constantly changing environment, as we humans do. Consequently, lifelong learning comes with a plethora of research problems such as continual domain shifts, which result in non-stationary rewards and environment dynamics. These non-stationarities are difficult to detect and cope with due to their continuous natu… ▽ More

    Submitted 22 September, 2022; v1 submitted 12 July, 2022; originally announced July 2022.

    Comments: CoLLAs 2022

  5. arXiv:2205.12258  [pdf, other

    cs.LG cs.CL stat.ML

    History Compression via Language Models in Reinforcement Learning

    Authors: Fabian Paischer, Thomas Adler, Vihang Patil, Angela Bitto-Nemling, Markus Holzleitner, Sebastian Lehner, Hamid Eghbal-zadeh, Sepp Hochreiter

    Abstract: In a partially observable Markov decision process (POMDP), an agent typically uses a representation of the past to approximate the underlying MDP. We propose to utilize a frozen Pretrained Language Transformer (PLT) for history representation and compression to improve sample efficiency. To avoid training of the Transformer, we introduce FrozenHopfield, which automatically associates observations… ▽ More

    Submitted 21 February, 2023; v1 submitted 24 May, 2022; originally announced May 2022.

    Comments: ICML 2022

  6. WECHSEL: Effective initialization of subword embeddings for cross-lingual transfer of monolingual language models

    Authors: Benjamin Minixhofer, Fabian Paischer, Navid Rekabsaz

    Abstract: Large pretrained language models (LMs) have become the central building block of many NLP applications. Training these models requires ever more computational resources and most of the existing models are trained on English text only. It is exceedingly expensive to train these models in other languages. To alleviate this problem, we introduce a novel method -- called WECHSEL -- to efficiently and… ▽ More

    Submitted 4 May, 2022; v1 submitted 13 December, 2021; originally announced December 2021.

    Comments: NAACL 2022