Skip to main content

Showing 1–2 of 2 results for author: Koops, W

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.00826  [pdf, other

    cs.LG eess.SY

    Learning-Based Verification of Stochastic Dynamical Systems with Neural Network Policies

    Authors: Thom Badings, Wietze Koops, Sebastian Junges, Nils Jansen

    Abstract: We consider the verification of neural network policies for reach-avoid control tasks in stochastic dynamical systems. We use a verification procedure that trains another neural network, which acts as a certificate proving that the policy satisfies the task. For reach-avoid tasks, it suffices to show that this certificate network is a reach-avoid supermartingale (RASM). As our main contribution, w… ▽ More

    Submitted 2 June, 2024; originally announced June 2024.

  2. arXiv:2405.05662  [pdf, other

    cs.AI

    Approximate Dec-POMDP Solving Using Multi-Agent A*

    Authors: Wietze Koops, Sebastian Junges, Nils Jansen

    Abstract: We present an A*-based algorithm to compute policies for finite-horizon Dec-POMDPs. Our goal is to sacrifice optimality in favor of scalability for larger horizons. The main ingredients of our approach are (1) using clustered sliding window memory, (2) pruning the A* search tree, and (3) using novel A* heuristics. Our experiments show competitive performance to the state-of-the-art. Moreover, for… ▽ More

    Submitted 9 May, 2024; originally announced May 2024.

    Comments: 19 pages, 3 figures. Extended version (with appendix) of the paper to appear in IJCAI 2024