Skip to main content

Showing 1–5 of 5 results for author: Muškardin, E

Searching in archive cs. Search in all archives.
.
  1. arXiv:2306.17204  [pdf, other

    cs.LG cs.FL

    Learning Environment Models with Continuous Stochastic Dynamics

    Authors: Martin Tappler, Edi Muškardin, Bernhard K. Aichernig, Bettina Könighofer

    Abstract: Solving control tasks in complex environments automatically through learning offers great potential. While contemporary techniques from deep reinforcement learning (DRL) provide effective solutions, their decision-making is not transparent. We aim to provide insights into the decisions faced by the agent by learning an automaton model of environmental behavior under the control of an agent. Howeve… ▽ More

    Submitted 29 June, 2023; originally announced June 2023.

  2. arXiv:2306.16854  [pdf, other

    cs.LG

    On the Relationship Between RNN Hidden State Vectors and Semantic Ground Truth

    Authors: Edi Muškardin, Martin Tappler, Ingo Pill, Bernhard K. Aichernig, Thomas Pock

    Abstract: We examine the assumption that the hidden-state vectors of recurrent neural networks (RNNs) tend to form clusters of semantically similar vectors, which we dub the clustering hypothesis. While this hypothesis has been assumed in the analysis of RNNs in recent years, its validity has not been studied thoroughly on modern neural network architectures. We examine the clustering hypothesis in the cont… ▽ More

    Submitted 29 June, 2023; originally announced June 2023.

  3. arXiv:2212.01838  [pdf, other

    cs.LG cs.LO

    Automata Learning meets Shielding

    Authors: Martin Tappler, Stefan Pranger, Bettina Könighofer, Edi Muškardin, Roderick Bloem, Kim Larsen

    Abstract: Safety is still one of the major research challenges in reinforcement learning (RL). In this paper, we address the problem of how to avoid safety violations of RL agents during exploration in probabilistic and partially unknown environments. Our approach combines automata learning for Markov Decision Processes (MDPs) and shield synthesis in an iterative approach. Initially, the MDP representing th… ▽ More

    Submitted 4 December, 2022; originally announced December 2022.

  4. Active vs. Passive: A Comparison of Automata Learning Paradigms for Network Protocols

    Authors: Bernhard K. Aichernig, Edi Muškardin, Andrea Pferscher

    Abstract: Active automata learning became a popular tool for the behavioral analysis of communication protocols. The main advantage is that no manual modeling effort is required since a behavioral model is automatically inferred from a black-box system. However, several real-world applications of this technique show that the overhead for the establishment of an active interface might hamper the practical ap… ▽ More

    Submitted 28 September, 2022; originally announced September 2022.

    Comments: In Proceedings FMAS2022 ASYDE2022, arXiv:2209.13181

    Journal ref: EPTCS 371, 2022, pp. 1-19

  5. arXiv:2206.11708  [pdf, other

    cs.LG cs.AI

    Reinforcement Learning under Partial Observability Guided by Learned Environment Models

    Authors: Edi Muskardin, Martin Tappler, Bernhard K. Aichernig, Ingo Pill

    Abstract: In practical applications, we can rarely assume full observability of a system's environment, despite such knowledge being important for determining a reactive control system's precise interaction with its environment. Therefore, we propose an approach for reinforcement learning (RL) in partially observable environments. While assuming that the environment behaves like a partially observable Marko… ▽ More

    Submitted 23 June, 2022; originally announced June 2022.