Skip to main content

Showing 1–22 of 22 results for author: Kirschner, J

Searching in archive cs. Search in all archives.
.
  1. arXiv:2404.04004  [pdf, other

    cs.RO

    Towards Safe Robot Use with Edged or Pointed Objects: A Surrogate Study Assembling a Human Hand Injury Protection Database

    Authors: Robin Jeanne Kirschner, Carina M. Micheler, Yangcan Zhou, Sebastian Siegner, Mazin Hamad, Claudio Glowalla, Jan Neumann, Nader Rajaei, Rainer Burgkart, Sami Haddadin

    Abstract: The use of pointed or edged tools or objects is one of the most challenging aspects of today's application of physical human-robot interaction (pHRI). One reason for this is that the severity of harm caused by such edged or pointed impactors is less well studied than for blunt impactors. Consequently, the standards specify well-reasoned force and pressure thresholds for blunt impactors and advise… ▽ More

    Submitted 5 April, 2024; originally announced April 2024.

    Comments: accepted fo presentation at IEEE ICRA 2024

  2. arXiv:2403.10379  [pdf, other

    cs.LG

    Regret Minimization via Saddle Point Optimization

    Authors: Johannes Kirschner, Seyed Alireza Bakhtiari, Kushagra Chandak, Volodymyr Tkachuk, Csaba Szepesvári

    Abstract: A long line of works characterizes the sample complexity of regret minimization in sequential decision-making by min-max programs. In the corresponding saddle-point game, the min-player optimizes the sampling distribution against an adversarial max-player that chooses confusing models leading to large regret. The most recent instantiation of this idea is the decision-estimation coefficient (DEC),… ▽ More

    Submitted 15 March, 2024; originally announced March 2024.

  3. arXiv:2312.00616  [pdf, other

    cs.LG stat.ME stat.ML

    Investigating a domain adaptation approach for integrating different measurement instruments in a longitudinal clinical registry

    Authors: Maren Hackenberg, Michelle Pfaffenlehner, Max Behrens, Astrid Pechmann, Janbernd Kirschner, Harald Binder

    Abstract: In a longitudinal clinical registry, different measurement instruments might have been used for assessing individuals at different time points. To combine them, we investigate deep learning techniques for obtaining a joint latent representation, to which the items of different measurement instruments are mapped. This corresponds to domain adaptation, an established concept in computer science for… ▽ More

    Submitted 1 December, 2023; originally announced December 2023.

    Comments: 18 pages, 4 figures

  4. arXiv:2311.16286  [pdf, other

    stat.ME cs.LG stat.AP stat.ML

    A statistical approach to latent dynamic modeling with differential equations

    Authors: Maren Hackenberg, Astrid Pechmann, Clemens Kreutz, Janbernd Kirschner, Harald Binder

    Abstract: Ordinary differential equations (ODEs) can provide mechanistic models of temporally local changes of processes, where parameters are often informed by external knowledge. While ODEs are popular in systems modeling, they are less established for statistical modeling of longitudinal cohort data, e.g., in a clinical setting. Yet, modeling of local changes could also be attractive for assessing the tr… ▽ More

    Submitted 27 November, 2023; originally announced November 2023.

    Comments: 29 pages, 6 figures

  5. arXiv:2309.09936  [pdf, other

    cs.RO

    A Concise Overview of Safety Aspects in Human-Robot Interaction

    Authors: Mazin Hamad, Simone Nertinger, Robin J. Kirschner, Luis Figueredo, Abdeldjallil Naceri, Sami Haddadin

    Abstract: As of today, robots exhibit impressive agility but also pose potential hazards to humans using/collaborating with them. Consequently, safety is considered the most paramount factor in human-robot interaction (HRI). This paper presents a multi-layered safety architecture, integrating both physical and cognitive aspects for effective HRI. We outline critical requirements for physical safety layers a… ▽ More

    Submitted 18 September, 2023; originally announced September 2023.

    Comments: Accepted for Human-Friendly Robotics 2023: 16th International Workshop

  6. arXiv:2302.04376  [pdf, other

    cs.LG cs.AI stat.ML

    Efficient Planning in Combinatorial Action Spaces with Applications to Cooperative Multi-Agent Reinforcement Learning

    Authors: Volodymyr Tkachuk, Seyed Alireza Bakhtiari, Johannes Kirschner, Matej Jusup, Ilija Bogunovic, Csaba Szepesvári

    Abstract: A practical challenge in reinforcement learning are combinatorial action spaces that make planning computationally demanding. For example, in cooperative multi-agent reinforcement learning, a potentially large number of agents jointly optimize a global reward function, which leads to a combinatorial blow-up in the action space by the number of agents. As a minimal requirement, we assume access to… ▽ More

    Submitted 8 February, 2023; originally announced February 2023.

  7. arXiv:2302.03683  [pdf, ps, other

    cs.LG stat.ML

    Linear Partial Monitoring for Sequential Decision-Making: Algorithms, Regret Bounds and Applications

    Authors: Johannes Kirschner, Tor Lattimore, Andreas Krause

    Abstract: Partial monitoring is an expressive framework for sequential decision-making with an abundance of applications, including graph-structured and dueling bandits, dynamic pricing and transductive feedback models. We survey and extend recent results on the linear formulation of partial monitoring that naturally generalizes the standard linear bandit setting. The main result is that a single algorithm,… ▽ More

    Submitted 13 November, 2023; v1 submitted 7 February, 2023; originally announced February 2023.

  8. arXiv:2212.09510  [pdf, other

    stat.ML cs.AI cs.LG

    Near-optimal Policy Identification in Active Reinforcement Learning

    Authors: Xiang Li, Viraj Mehta, Johannes Kirschner, Ian Char, Willie Neiswanger, Jeff Schneider, Andreas Krause, Ilija Bogunovic

    Abstract: Many real-world reinforcement learning tasks require control of complex dynamical systems that involve both costly data acquisition processes and large state spaces. In cases where the transition dynamics can be readily evaluated at specified states (e.g., via a simulator), agents can operate in what is often referred to as planning with a \emph{generative model}. We propose the AE-LSVI algorithm… ▽ More

    Submitted 19 December, 2022; originally announced December 2022.

  9. arXiv:2212.08949  [pdf, other

    cs.LG eess.SY stat.ML

    Managing Temporal Resolution in Continuous Value Estimation: A Fundamental Trade-off

    Authors: Zichen Zhang, Johannes Kirschner, Junxi Zhang, Francesco Zanini, Alex Ayoub, Masood Dehghan, Dale Schuurmans

    Abstract: A default assumption in reinforcement learning (RL) and optimal control is that observations arrive at discrete time points on a fixed clock cycle. Yet, many applications involve continuous-time systems where the time discretization, in principle, can be managed. The impact of time discretization on RL methods has not been fully characterized in existing theory, but a more detailed analysis of its… ▽ More

    Submitted 16 January, 2024; v1 submitted 17 December, 2022; originally announced December 2022.

    Comments: NeurIPS 2023

  10. Tuning Particle Accelerators with Safety Constraints using Bayesian Optimization

    Authors: Johannes Kirschner, Mojmir Mutný, Andreas Krause, Jaime Coello de Portugal, Nicole Hiller, Jochem Snuverink

    Abstract: Tuning machine parameters of particle accelerators is a repetitive and time-consuming task that is challenging to automate. While many off-the-shelf optimization algorithms are available, in practice their use is limited because most methods do not account for safety-critical constraints in each iteration, such as loss signals or step-size limitations. One notable exception is safe Bayesian optimi… ▽ More

    Submitted 30 June, 2022; v1 submitted 25 March, 2022; originally announced March 2022.

  11. arXiv:2203.02706  [pdf, other

    cs.RO

    ISO/TS 15066: How Different Interpretations Affect Risk Assessment

    Authors: Robin Jeanne Kirschner, Nico Mansfeld, Saeed Abdolshah, Sami Haddadin

    Abstract: The current technical specification ISO/TS15066:2016(E) for safe human-robot interaction contains logically conflicting definitions for the contact between human and robot. This may result in different interpretations for the contact classification and thus no unique outcome can be expected, which may even cause a risk to the human. In previous work, we showed a first set of implications. This pap… ▽ More

    Submitted 5 March, 2022; originally announced March 2022.

  12. Expectable Motion Unit: Avoiding Hazards From Human Involuntary Motions in Human-Robot Interaction

    Authors: Robin Jeanne Kirschner, Henning Mayer, Lisa Burr, Nico Mansfeld, Saeed Abdolshah, Sami Haddadin

    Abstract: In robotics, many control and planning schemes have been developed to ensure human physical safety in human-robot interaction. The human psychological state and the expectation towards the robot, however, are typically neglected. Even if the robot behaviour is regarded as biomechanically safe, humans may still react with a rapid involuntary motion (IM) caused by a startle or surprise. Such sudden,… ▽ More

    Submitted 4 April, 2024; v1 submitted 15 September, 2021; originally announced September 2021.

    Journal ref: in IEEE Robotics and Automation Letters, vol. 7, no. 2, pp. 2993-3000, April 2022

  13. arXiv:2105.11802  [pdf, other

    stat.ML cs.LG

    Bias-Robust Bayesian Optimization via Dueling Bandits

    Authors: Johannes Kirschner, Andreas Krause

    Abstract: We consider Bayesian optimization in settings where observations can be adversarially biased, for example by an uncontrolled hidden confounder. Our first contribution is a reduction of the confounded setting to the dueling bandit model. Then we propose a novel approach for dueling bandits based on information-directed sampling (IDS). Thereby, we obtain the first efficient kernelized algorithm for… ▽ More

    Submitted 9 June, 2021; v1 submitted 25 May, 2021; originally announced May 2021.

  14. arXiv:2101.08534  [pdf, other

    stat.ML cs.LG

    Efficient Pure Exploration for Combinatorial Bandits with Semi-Bandit Feedback

    Authors: Marc Jourdan, Mojmír Mutný, Johannes Kirschner, Andreas Krause

    Abstract: Combinatorial bandits with semi-bandit feedback generalize multi-armed bandits, where the agent chooses sets of arms and observes a noisy reward for each arm contained in the chosen set. The action set satisfies a given structure such as forming a base of a matroid or a path in a graph. We focus on the pure-exploration problem of identifying the best arm with fixed confidence, as well as a more ge… ▽ More

    Submitted 21 January, 2021; originally announced January 2021.

    Comments: 45 pages. 3 tables. Appendices: from A to I. Figures: 1(a), 1(b), 2(a), 2(b), 3(a), 3(b), 3(c), 4(a), 4(b), 5(a), 5(b), 5(c), 5(d), 6(a), 6(b). To be published in the 32nd International Conference on Algorithmic Learning Theory and the Proceedings of Machine Learning Research vol 132:1-45, 2021

  15. arXiv:2012.00634  [pdf, other

    stat.ML cs.LG

    Deep dynamic modeling with just two time points: Can we still allow for individual trajectories?

    Authors: Maren Hackenberg, Philipp Harms, Michelle Pfaffenlehner, Astrid Pechmann, Janbernd Kirschner, Thorsten Schmidt, Harald Binder

    Abstract: Longitudinal biomedical data are often characterized by a sparse time grid and individual-specific development patterns. Specifically, in epidemiological cohort studies and clinical registries we are facing the question of what can be learned from the data in an early phase of the study, when only a baseline characterization and one follow-up measurement are available. Inspired by recent advances… ▽ More

    Submitted 20 December, 2021; v1 submitted 1 December, 2020; originally announced December 2020.

    Comments: 23 pages, 7 figures

  16. arXiv:2011.05944  [pdf, other

    stat.ML cs.LG

    Asymptotically Optimal Information-Directed Sampling

    Authors: Johannes Kirschner, Tor Lattimore, Claire Vernade, Csaba Szepesvári

    Abstract: We introduce a simple and efficient algorithm for stochastic linear bandits with finitely many actions that is asymptotically optimal and (nearly) worst-case optimal in finite time. The approach is based on the frequentist information-directed sampling (IDS) framework, with a surrogate for the information gain that is informed by the optimization problem that defines the asymptotic lower bound. Ou… ▽ More

    Submitted 2 July, 2021; v1 submitted 11 November, 2020; originally announced November 2020.

    Comments: Accepted at COLT 2021

  17. arXiv:2007.14443  [pdf

    cs.CY cond-mat.mtrl-sci

    A user-centered approach to designing an experimental laboratory data platform

    Authors: Ha-Kyung Kwon, Chirranjeevi Balaji Gopal, Jared Kirschner, Santiago Caicedo, Brian D. Storey

    Abstract: While automated experiments and high-throughput methods are becoming more mainstream in the age of data, empowering individual researchers to capture, collate, and contextualize their data faster and more reproducibly still remains a challenge in science. Despite the abundance of software products to help digitize and organize scientific information, their broader adoption in the scientific commun… ▽ More

    Submitted 28 July, 2020; originally announced July 2020.

    Comments: 15 pages, 3 figures (38 pages in Supplementary Materials)

  18. arXiv:2002.11182  [pdf, other

    stat.ML cs.LG

    Information Directed Sampling for Linear Partial Monitoring

    Authors: Johannes Kirschner, Tor Lattimore, Andreas Krause

    Abstract: Partial monitoring is a rich framework for sequential decision making under uncertainty that generalizes many well known bandit models, including linear, combinatorial and dueling bandits. We introduce information directed sampling (IDS) for stochastic partial monitoring with a linear reward and observation structure. IDS achieves adaptive worst-case regret rates that depend on precise observabili… ▽ More

    Submitted 25 February, 2020; originally announced February 2020.

  19. arXiv:2002.09038  [pdf, other

    stat.ML cs.LG

    Distributionally Robust Bayesian Optimization

    Authors: Johannes Kirschner, Ilija Bogunovic, Stefanie Jegelka, Andreas Krause

    Abstract: Robustness to distributional shift is one of the key challenges of contemporary machine learning. Attaining such robustness is the goal of distributionally robust optimization, which seeks a solution to an optimization problem that is worst-case robust under a specified distributional shift of an uncontrolled covariate. In this paper, we study such a problem when the distributional shift is measur… ▽ More

    Submitted 22 March, 2020; v1 submitted 20 February, 2020; originally announced February 2020.

    Comments: Accepted at AISTATS 2020

  20. arXiv:1906.02685  [pdf, other

    stat.ML cs.LG

    Stochastic Bandits with Context Distributions

    Authors: Johannes Kirschner, Andreas Krause

    Abstract: We introduce a stochastic contextual bandit model where at each time step the environment chooses a distribution over a context set and samples the context from this distribution. The learner observes only the context distribution while the exact context realization remains hidden. This allows for a broad range of applications where the context is stochastic or when the learner needs to predict th… ▽ More

    Submitted 14 November, 2019; v1 submitted 6 June, 2019; originally announced June 2019.

    Comments: Accepted at NeurIPS 2019

  21. arXiv:1902.03229  [pdf, other

    cs.LG stat.ML

    Adaptive and Safe Bayesian Optimization in High Dimensions via One-Dimensional Subspaces

    Authors: Johannes Kirschner, Mojmír Mutný, Nicole Hiller, Rasmus Ischebeck, Andreas Krause

    Abstract: Bayesian optimization is known to be difficult to scale to high dimensions, because the acquisition step requires solving a non-convex optimization problem in the same search space. In order to scale the method and keep its benefits, we propose an algorithm (LineBO) that restricts the problem to a sequence of iteratively chosen one-dimensional sub-problems that can be solved efficiently. We show t… ▽ More

    Submitted 28 May, 2019; v1 submitted 8 February, 2019; originally announced February 2019.

  22. arXiv:1812.07544  [pdf, other

    cs.LG cs.AI stat.ML

    Information-Directed Exploration for Deep Reinforcement Learning

    Authors: Nikolay Nikolov, Johannes Kirschner, Felix Berkenkamp, Andreas Krause

    Abstract: Efficient exploration remains a major challenge for reinforcement learning. One reason is that the variability of the returns often depends on the current state and action, and is therefore heteroscedastic. Classical exploration strategies such as upper confidence bound algorithms and Thompson sampling fail to appropriately account for heteroscedasticity, even in the bandit setting. Motivated by r… ▽ More

    Submitted 24 March, 2019; v1 submitted 18 December, 2018; originally announced December 2018.