Skip to main content

Showing 1–20 of 20 results for author: Kandemir, M

Searching in archive stat. Search in all archives.
.
  1. arXiv:2406.03890  [pdf, other

    cs.LG stat.ML

    Exploring Pessimism and Optimism Dynamics in Deep Reinforcement Learning

    Authors: Bahareh Tasdighi, Nicklas Werge, Yi-Shan Wu, Melih Kandemir

    Abstract: Off-policy actor-critic algorithms have shown promise in deep reinforcement learning for continuous control tasks. Their success largely stems from leveraging pessimistic state-action value function updates, which effectively address function approximation errors and improve performance. However, such pessimism can lead to under-exploration, constraining the agent's ability to explore/refine its p… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

  2. arXiv:2309.14298  [pdf, other

    stat.ML cs.LG

    Improved Algorithms for Stochastic Linear Bandits Using Tail Bounds for Martingale Mixtures

    Authors: Hamish Flynn, David Reeb, Melih Kandemir, Jan Peters

    Abstract: We present improved algorithms with worst-case regret guarantees for the stochastic linear bandit problem. The widely used "optimism in the face of uncertainty" principle reduces a stochastic bandit problem to the construction of a confidence sequence for the unknown reward function. The performance of the resulting bandit algorithm depends on the size of the confidence sequence, with smaller conf… ▽ More

    Submitted 27 September, 2023; v1 submitted 25 September, 2023; originally announced September 2023.

    Comments: Accepted at NeurIPS 2023. 35 pages, 6 figures

  3. arXiv:2309.08332  [pdf, other

    cs.LG stat.ME

    Estimation of Counterfactual Interventions under Uncertainties

    Authors: Juliane Weilbach, Sebastian Gerwinn, Melih Kandemir, Martin Fraenzle

    Abstract: Counterfactual analysis is intuitively performed by humans on a daily basis eg. "What should I have done differently to get the loan approved?". Such counterfactual questions also steer the formulation of scientific hypotheses. More formally it provides insights about potential improvements of a system by inferring the effects of hypothetical interventions into a past observation of the system's b… ▽ More

    Submitted 15 September, 2023; originally announced September 2023.

  4. arXiv:2309.08256  [pdf, other

    cs.LG stat.ML

    Sampling-Free Probabilistic Deep State-Space Models

    Authors: Andreas Look, Melih Kandemir, Barbara Rakitsch, Jan Peters

    Abstract: Many real-world dynamical systems can be described as State-Space Models (SSMs). In this formulation, each observation is emitted by a latent state, which follows first-order Markovian dynamics. A Probabilistic Deep SSM (ProDSSM) generalizes this framework to dynamical systems of unknown parametric form, where the transition and emission models are described by neural networks with uncertain weigh… ▽ More

    Submitted 15 September, 2023; originally announced September 2023.

  5. arXiv:2307.03587  [pdf, other

    cs.LG stat.ML

    BOF-UCB: A Bayesian-Optimistic Frequentist Algorithm for Non-Stationary Contextual Bandits

    Authors: Nicklas Werge, Abdullah Akgül, Melih Kandemir

    Abstract: We propose a novel Bayesian-Optimistic Frequentist Upper Confidence Bound (BOF-UCB) algorithm for stochastic contextual linear bandits in non-stationary environments. This unique combination of Bayesian and frequentist principles enhances adaptability and performance in dynamic settings. The BOF-UCB algorithm utilizes sequential Bayesian updates to infer the posterior distribution of the unknown r… ▽ More

    Submitted 19 July, 2023; v1 submitted 7 July, 2023; originally announced July 2023.

  6. arXiv:2305.01773  [pdf, other

    cs.LG cs.AI stat.ML

    Cheap and Deterministic Inference for Deep State-Space Models of Interacting Dynamical Systems

    Authors: Andreas Look, Melih Kandemir, Barbara Rakitsch, Jan Peters

    Abstract: Graph neural networks are often used to model interacting dynamical systems since they gracefully scale to systems with a varying and high number of agents. While there has been much progress made for deterministic interacting systems, modeling is much more challenging for stochastic systems in which one is interested in obtaining a predictive distribution over future trajectories. Existing method… ▽ More

    Submitted 2 May, 2023; originally announced May 2023.

  7. arXiv:2301.12776  [pdf, other

    cs.LG stat.ML

    PAC-Bayesian Soft Actor-Critic Learning

    Authors: Bahareh Tasdighi, Abdullah Akgül, Manuel Haussmann, Kenny Kazimirzak Brink, Melih Kandemir

    Abstract: Actor-critic algorithms address the dual goals of reinforcement learning (RL), policy evaluation and improvement via two separate function approximators. The practicality of this approach comes at the expense of training instability, caused mainly by the destructive effect of the approximation errors of the critic on the actor. We tackle this bottleneck by employing an existing Probably Approximat… ▽ More

    Submitted 10 June, 2024; v1 submitted 30 January, 2023; originally announced January 2023.

    Comments: 19 pages, 2 figures

  8. PAC-Bayes Bounds for Bandit Problems: A Survey and Experimental Comparison

    Authors: Hamish Flynn, David Reeb, Melih Kandemir, Jan Peters

    Abstract: PAC-Bayes has recently re-emerged as an effective theory with which one can derive principled learning algorithms with tight performance guarantees. However, applications of PAC-Bayes to bandit problems are relatively rare, which is a great misfortune. Many decision-making problems in healthcare, finance and natural sciences can be modelled as bandit problems. In many of these applications, princi… ▽ More

    Submitted 16 July, 2023; v1 submitted 29 November, 2022; originally announced November 2022.

    Comments: 32 pages, 8 figures

    Journal ref: IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2023

  9. arXiv:2205.11894  [pdf, other

    cs.LG stat.ML

    Learning Interacting Dynamical Systems with Latent Gaussian Process ODEs

    Authors: Çağatay Yıldız, Melih Kandemir, Barbara Rakitsch

    Abstract: We study time uncertainty-aware modeling of continuous-time dynamics of interacting objects. We introduce a new model that decomposes independent dynamics of single objects accurately from their interactions. By employing latent Gaussian process ordinary differential equations, our model infers both independent dynamics and their interactions with reliable uncertainty estimates. In our formulation… ▽ More

    Submitted 12 October, 2022; v1 submitted 24 May, 2022; originally announced May 2022.

  10. PAC-Bayesian Lifelong Learning For Multi-Armed Bandits

    Authors: Hamish Flynn, David Reeb, Melih Kandemir, Jan Peters

    Abstract: We present a PAC-Bayesian analysis of lifelong learning. In the lifelong learning problem, a sequence of learning tasks is observed one-at-a-time, and the goal is to transfer information acquired from previous tasks to new learning tasks. We consider the case when each learning task is a multi-armed bandit problem. We derive lower bounds on the expected average reward that would be obtained if a g… ▽ More

    Submitted 7 March, 2022; originally announced March 2022.

    Comments: 29 pages, 5 figures

    Journal ref: Data Mining and Knowledge Discovery, 2022, Special Issue of the Journal Track ECML PKDD 2022

  11. arXiv:2203.00936  [pdf, other

    cs.LG stat.ML

    Continual Learning of Multi-modal Dynamics with External Memory

    Authors: Abdullah Akgül, Gozde Unal, Melih Kandemir

    Abstract: We study the problem of fitting a model to a dynamical environment when new modes of behavior emerge sequentially. The learning model is aware when a new mode appears, but it cannot access the true modes of individual training sequences. The state-of-the-art continual learning approaches cannot handle this setup, because parameter transfer suffers from catastrophic interference and episodic memory… ▽ More

    Submitted 9 May, 2024; v1 submitted 2 March, 2022; originally announced March 2022.

  12. arXiv:2112.03230  [pdf, other

    cs.LG stat.ML

    Traversing Time with Multi-Resolution Gaussian Process State-Space Models

    Authors: Krista Longi, Jakob Lindinger, Olaf Duennbier, Melih Kandemir, Arto Klami, Barbara Rakitsch

    Abstract: Gaussian Process state-space models capture complex temporal dependencies in a principled manner by placing a Gaussian Process prior on the transition function. These models have a natural interpretation as discretized stochastic differential equations, but inference for long sequences with fast and slow transitions is difficult. Fast transitions need tight discretizations whereas slow transitions… ▽ More

    Submitted 23 February, 2022; v1 submitted 6 December, 2021; originally announced December 2021.

    Comments: Added links to code and dataset. Added author contributions

  13. arXiv:2006.13866  [pdf, other

    cs.LG stat.ML

    Minimal Variance Sampling with Provable Guarantees for Fast Training of Graph Neural Networks

    Authors: Weilin Cong, Rana Forsati, Mahmut Kandemir, Mehrdad Mahdavi

    Abstract: Sampling methods (e.g., node-wise, layer-wise, or subgraph) has become an indispensable strategy to speed up training large-scale Graph Neural Networks (GNNs). However, existing sampling methods are mostly based on the graph structural information and ignore the dynamicity of optimization, which leads to high variance in estimating the stochastic gradients. The high variance issue can be very pron… ▽ More

    Submitted 5 September, 2021; v1 submitted 24 June, 2020; originally announced June 2020.

  14. arXiv:2006.09914  [pdf, other

    cs.LG stat.ML

    Learning Partially Known Stochastic Dynamics with Empirical PAC Bayes

    Authors: Manuel Haussmann, Sebastian Gerwinn, Andreas Look, Barbara Rakitsch, Melih Kandemir

    Abstract: Neural Stochastic Differential Equations model a dynamical environment with neural nets assigned to their drift and diffusion terms. The high expressive power of their nonlinearity comes at the expense of instability in the identification of the large set of free parameters. This paper presents a recipe to improve the prediction accuracy of such models in three steps: i) accounting for epistemic u… ▽ More

    Submitted 26 February, 2021; v1 submitted 17 June, 2020; originally announced June 2020.

    Comments: Accepted at AISTATS 2021

  15. arXiv:2006.08973  [pdf, other

    cs.LG stat.ML

    A Deterministic Approximation to Neural SDEs

    Authors: Andreas Look, Melih Kandemir, Barbara Rakitsch, Jan Peters

    Abstract: Neural Stochastic Differential Equations (NSDEs) model the drift and diffusion functions of a stochastic process as neural networks. While NSDEs are known to make accurate predictions, their uncertainty quantification properties have been remained unexplored so far. We report the empirical finding that obtaining well-calibrated uncertainty estimations from NSDEs is computationally prohibitive. As… ▽ More

    Submitted 12 September, 2022; v1 submitted 16 June, 2020; originally announced June 2020.

  16. arXiv:1912.00796  [pdf, other

    cs.LG stat.ML

    Differential Bayesian Neural Nets

    Authors: Andreas Look, Melih Kandemir

    Abstract: Neural Ordinary Differential Equations (N-ODEs) are a powerful building block for learning systems, which extend residual networks to a continuous-time dynamical system. We propose a Bayesian version of N-ODEs that enables well-calibrated quantification of prediction uncertainty, while maintaining the expressive power of their deterministic counterpart. We assign Bayesian Neural Nets (BNNs) to bot… ▽ More

    Submitted 18 February, 2020; v1 submitted 2 December, 2019; originally announced December 2019.

    Journal ref: 4th workshop on Bayesian Deep Learning (NeurIPS 2019), Vancouver, Canada

  17. arXiv:1906.11471  [pdf, other

    stat.ML cs.LG

    Deep Active Learning with Adaptive Acquisition

    Authors: Manuel Haussmann, Fred A. Hamprecht, Melih Kandemir

    Abstract: Model selection is treated as a standard performance boosting step in many machine learning applications. Once all other properties of a learning problem are fixed, the model is selected by grid search on a held-out validation set. This is strictly inapplicable to active learning. Within the standardized workflow, the acquisition function is chosen among available heuristics a priori, and its succ… ▽ More

    Submitted 27 June, 2019; originally announced June 2019.

    Comments: Accepted at IJCAI 2019

  18. arXiv:1906.00816  [pdf, ps, other

    stat.ML cs.LG

    Bayesian Evidential Deep Learning with PAC Regularization

    Authors: Manuel Haussmann, Sebastian Gerwinn, Melih Kandemir

    Abstract: We propose a novel method for closed-form predictive distribution modeling with neural nets. In quantifying prediction uncertainty, we build on Evidential Deep Learning, which has been impactful as being both simple to implement and giving closed-form access to predictive uncertainty. We employ it to model aleatoric uncertainty and extend it to account also for epistemic uncertainty by converting… ▽ More

    Submitted 21 January, 2021; v1 submitted 3 June, 2019; originally announced June 2019.

    Comments: Presented at AABI 2020

  19. arXiv:1806.01768  [pdf, other

    cs.LG stat.ML

    Evidential Deep Learning to Quantify Classification Uncertainty

    Authors: Murat Sensoy, Lance Kaplan, Melih Kandemir

    Abstract: Deterministic neural nets have been shown to learn effective predictors on a wide range of machine learning problems. However, as the standard approach is to train the network to minimize a prediction loss, the resultant model remains ignorant to its prediction confidence. Orthogonally to Bayesian neural nets that indirectly infer prediction uncertainty through weight uncertainties, we propose exp… ▽ More

    Submitted 31 October, 2018; v1 submitted 5 June, 2018; originally announced June 2018.

  20. arXiv:1805.07654  [pdf, other

    stat.ML cs.LG

    Sampling-Free Variational Inference of Bayesian Neural Networks by Variance Backpropagation

    Authors: Manuel Haussmann, Fred A. Hamprecht, Melih Kandemir

    Abstract: We propose a new Bayesian Neural Net formulation that affords variational inference for which the evidence lower bound is analytically tractable subject to a tight approximation. We achieve this tractability by (i) decomposing ReLU nonlinearities into the product of an identity and a Heaviside step function, (ii) introducing a separate path that decomposes the neural net expectation from its varia… ▽ More

    Submitted 12 June, 2019; v1 submitted 19 May, 2018; originally announced May 2018.

    Comments: Accepted at UAI 2019