Skip to main content

Showing 1–11 of 11 results for author: Haussmann, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.04088  [pdf, other

    cs.LG

    Deterministic Uncertainty Propagation for Improved Model-Based Offline Reinforcement Learning

    Authors: Abdullah Akgül, Manuel Haußmann, Melih Kandemir

    Abstract: Current approaches to model-based offline Reinforcement Learning (RL) often incorporate uncertainty-based reward penalization to address the distributional shift problem. While these approaches have achieved some success, we argue that this penalization introduces excessive conservatism, potentially resulting in suboptimal policies through underestimation. We identify as an important cause of over… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

  2. arXiv:2402.05758  [pdf, other

    cs.LG stat.ML

    Latent variable model for high-dimensional point process with structured missingness

    Authors: Maksim Sinelnikov, Manuel Haussmann, Harri Lähdesmäki

    Abstract: Longitudinal data are important in numerous fields, such as healthcare, sociology and seismology, but real-world datasets present notable challenges for practitioners because they can be high-dimensional, contain structured missingness patterns, and measurement time points can be governed by an unknown stochastic process. While various solutions have been suggested, the majority of them have been… ▽ More

    Submitted 28 June, 2024; v1 submitted 8 February, 2024; originally announced February 2024.

  3. arXiv:2311.03002  [pdf, other

    cs.LG stat.ML

    Estimating treatment effects from single-arm trials via latent-variable modeling

    Authors: Manuel Haussmann, Tran Minh Son Le, Viivi Halla-aho, Samu Kurki, Jussi V. Leinonen, Miika Koskinen, Samuel Kaski, Harri Lähdesmäki

    Abstract: Randomized controlled trials (RCTs) are the accepted standard for treatment effect estimation but they can be infeasible due to ethical reasons and prohibitive costs. Single-arm trials, where all patients belong to the treatment group, can be a viable alternative but require access to an external control group. We propose an identifiable deep latent-variable model for this scenario that can also a… ▽ More

    Submitted 4 March, 2024; v1 submitted 6 November, 2023; originally announced November 2023.

    Comments: Published at the 27th International Conference on Artificial Intelligence and Statistics (AISTATS) 2024

  4. arXiv:2306.10915  [pdf, other

    stat.ML cs.LG

    Practical Equivariances via Relational Conditional Neural Processes

    Authors: Daolang Huang, Manuel Haussmann, Ulpu Remes, ST John, Grégoire Clarté, Kevin Sebastian Luck, Samuel Kaski, Luigi Acerbi

    Abstract: Conditional Neural Processes (CNPs) are a class of metalearning models popular for combining the runtime efficiency of amortized inference with reliable uncertainty quantification. Many relevant machine learning tasks, such as in spatio-temporal modeling, Bayesian Optimization and continuous control, inherently contain equivariances -- for example to translation -- which the model can exploit for… ▽ More

    Submitted 5 November, 2023; v1 submitted 19 June, 2023; originally announced June 2023.

    Comments: 38 pages, 8 figures. Accepted at the 37th Conference on Neural Information Processing Systems (NeurIPS 2023)

  5. arXiv:2301.12776  [pdf, other

    cs.LG stat.ML

    PAC-Bayesian Soft Actor-Critic Learning

    Authors: Bahareh Tasdighi, Abdullah Akgül, Manuel Haussmann, Kenny Kazimirzak Brink, Melih Kandemir

    Abstract: Actor-critic algorithms address the dual goals of reinforcement learning (RL), policy evaluation and improvement via two separate function approximators. The practicality of this approach comes at the expense of training instability, caused mainly by the destructive effect of the approximation errors of the critic on the actor. We tackle this bottleneck by employing an existing Probably Approximat… ▽ More

    Submitted 10 June, 2024; v1 submitted 30 January, 2023; originally announced January 2023.

    Comments: 19 pages, 2 figures

  6. arXiv:2106.01216  [pdf, other

    cs.LG

    Evidential Turing Processes

    Authors: Melih Kandemir, Abdullah Akgül, Manuel Haussmann, Gozde Unal

    Abstract: A probabilistic classifier with reliable predictive uncertainties i) fits successfully to the target domain data, ii) provides calibrated class probabilities in difficult regions of the target domain (e.g.\ class overlap), and iii) accurately identifies queries coming out of the target domain and rejects them. We introduce an original combination of Evidential Deep Learning, Neural Processes, and… ▽ More

    Submitted 8 March, 2022; v1 submitted 2 June, 2021; originally announced June 2021.

    Comments: accepted at ICLR2022; camera ready version

  7. Understanding Event-Generation Networks via Uncertainties

    Authors: Marco Bellagente, Manuel Haußmann, Michel Luchmann, Tilman Plehn

    Abstract: Following the growing success of generative neural networks in LHC simulations, the crucial question is how to control the networks and assign uncertainties to their event output. We show how Bayesian normalizing flow or invertible networks capture uncertainties from the training and turn them into an uncertainty on the event weight. Fundamentally, the interplay between density and uncertainty est… ▽ More

    Submitted 1 October, 2021; v1 submitted 9 April, 2021; originally announced April 2021.

    Comments: 24 pages

    Journal ref: SciPost Phys. 13, 003 (2022)

  8. arXiv:2006.09914  [pdf, other

    cs.LG stat.ML

    Learning Partially Known Stochastic Dynamics with Empirical PAC Bayes

    Authors: Manuel Haussmann, Sebastian Gerwinn, Andreas Look, Barbara Rakitsch, Melih Kandemir

    Abstract: Neural Stochastic Differential Equations model a dynamical environment with neural nets assigned to their drift and diffusion terms. The high expressive power of their nonlinearity comes at the expense of instability in the identification of the large set of free parameters. This paper presents a recipe to improve the prediction accuracy of such models in three steps: i) accounting for epistemic u… ▽ More

    Submitted 26 February, 2021; v1 submitted 17 June, 2020; originally announced June 2020.

    Comments: Accepted at AISTATS 2021

  9. arXiv:1906.11471  [pdf, other

    stat.ML cs.LG

    Deep Active Learning with Adaptive Acquisition

    Authors: Manuel Haussmann, Fred A. Hamprecht, Melih Kandemir

    Abstract: Model selection is treated as a standard performance boosting step in many machine learning applications. Once all other properties of a learning problem are fixed, the model is selected by grid search on a held-out validation set. This is strictly inapplicable to active learning. Within the standardized workflow, the acquisition function is chosen among available heuristics a priori, and its succ… ▽ More

    Submitted 27 June, 2019; originally announced June 2019.

    Comments: Accepted at IJCAI 2019

  10. arXiv:1906.00816  [pdf, ps, other

    stat.ML cs.LG

    Bayesian Evidential Deep Learning with PAC Regularization

    Authors: Manuel Haussmann, Sebastian Gerwinn, Melih Kandemir

    Abstract: We propose a novel method for closed-form predictive distribution modeling with neural nets. In quantifying prediction uncertainty, we build on Evidential Deep Learning, which has been impactful as being both simple to implement and giving closed-form access to predictive uncertainty. We employ it to model aleatoric uncertainty and extend it to account also for epistemic uncertainty by converting… ▽ More

    Submitted 21 January, 2021; v1 submitted 3 June, 2019; originally announced June 2019.

    Comments: Presented at AABI 2020

  11. arXiv:1805.07654  [pdf, other

    stat.ML cs.LG

    Sampling-Free Variational Inference of Bayesian Neural Networks by Variance Backpropagation

    Authors: Manuel Haussmann, Fred A. Hamprecht, Melih Kandemir

    Abstract: We propose a new Bayesian Neural Net formulation that affords variational inference for which the evidence lower bound is analytically tractable subject to a tight approximation. We achieve this tractability by (i) decomposing ReLU nonlinearities into the product of an identity and a Heaviside step function, (ii) introducing a separate path that decomposes the neural net expectation from its varia… ▽ More

    Submitted 12 June, 2019; v1 submitted 19 May, 2018; originally announced May 2018.

    Comments: Accepted at UAI 2019