Skip to main content

Showing 1–30 of 30 results for author: Beck, J

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.07981  [pdf, other

    cs.RO eess.SY

    Diagnosing and Predicting Autonomous Vehicle Operational Safety Using Multiple Simulation Modalities and a Virtual Environment

    Authors: Joe Beck, Shean Huff, Subhadeep Chakraborty

    Abstract: Even as technology and performance gains are made in the sphere of automated driving, safety concerns remain. Vehicle simulation has long been seen as a tool to overcome the cost associated with a massive amount of on-road testing for development and discovery of safety critical "edge-cases". However, purely software-based vehicle models may leave a large realism gap between their real-world count… ▽ More

    Submitted 13 May, 2024; originally announced May 2024.

    Comments: Preprint. Under Review

  2. arXiv:2403.19062  [pdf, other

    eess.SY cs.RO

    GENESIS-RL: GEnerating Natural Edge-cases with Systematic Integration of Safety considerations and Reinforcement Learning

    Authors: Hsin-Jung Yang, Joe Beck, Md Zahid Hasan, Ekin Beyazit, Subhadeep Chakraborty, Tichakorn Wongpiromsarn, Soumik Sarkar

    Abstract: In the rapidly evolving field of autonomous systems, the safety and reliability of the system components are fundamental requirements. These components are often vulnerable to complex and unforeseen environments, making natural edge-case generation essential for enhancing system resilience. This paper presents GENESIS-RL, a novel framework that leverages system-level safety considerations and rein… ▽ More

    Submitted 27 March, 2024; originally announced March 2024.

  3. arXiv:2403.03020  [pdf, other

    cs.LG cs.AI

    SplAgger: Split Aggregation for Meta-Reinforcement Learning

    Authors: Jacob Beck, Matthew Jackson, Risto Vuorio, Zheng Xiong, Shimon Whiteson

    Abstract: A core ambition of reinforcement learning (RL) is the creation of agents capable of rapid learning in novel tasks. Meta-RL aims to achieve this by directly learning such agents. Black box methods do so by training off-the-shelf sequence models end-to-end. By contrast, task inference methods explicitly infer a posterior distribution over the unknown task, typically using distinct objectives and seq… ▽ More

    Submitted 1 June, 2024; v1 submitted 5 March, 2024; originally announced March 2024.

    Comments: Published at Reinforcement Learning Conference (RLC) 2024. Code is provided at https://github.com/jacooba/hyper

  4. arXiv:2402.12231  [pdf, other

    cs.LG

    Diffusion Tempering Improves Parameter Estimation with Probabilistic Integrators for Ordinary Differential Equations

    Authors: Jonas Beck, Nathanael Bosch, Michael Deistler, Kyra L. Kadhim, Jakob H. Macke, Philipp Hennig, Philipp Berens

    Abstract: Ordinary differential equations (ODEs) are widely used to describe dynamical systems in science, but identifying parameters that explain experimental measurements is challenging. In particular, although ODEs are differentiable and would allow for gradient-based parameter optimization, the nonlinear dynamics of ODEs often lead to many local minima and extreme sensitivity to initial conditions. We t… ▽ More

    Submitted 19 February, 2024; originally announced February 2024.

  5. arXiv:2402.06570  [pdf, other

    cs.LG cs.RO

    Distilling Morphology-Conditioned Hypernetworks for Efficient Universal Morphology Control

    Authors: Zheng Xiong, Risto Vuorio, Jacob Beck, Matthieu Zimmer, Kun Shao, Shimon Whiteson

    Abstract: Learning a universal policy across different robot morphologies can significantly improve learning efficiency and enable zero-shot generalization to unseen morphologies. However, learning a highly performant universal policy requires sophisticated architectures like transformers (TF) that have larger memory and computational cost than simpler multi-layer perceptrons (MLP). To achieve both good per… ▽ More

    Submitted 3 June, 2024; v1 submitted 9 February, 2024; originally announced February 2024.

    Comments: ICML 2024

  6. arXiv:2401.13883  [pdf, other

    cs.AI

    Domain-Independent Dynamic Programming

    Authors: Ryo Kuroiwa, J. Christopher Beck

    Abstract: For combinatorial optimization problems, model-based paradigms such as mixed-integer programming (MIP) and constraint programming (CP) aim to decouple modeling and solving a problem: the `holy grail' of declarative problem solving. We propose domain-independent dynamic programming (DIDP), a new model-based paradigm based on dynamic programming (DP). While DP is not new, it has typically been imple… ▽ More

    Submitted 31 May, 2024; v1 submitted 24 January, 2024; originally announced January 2024.

    Comments: Manuscript submitted to Artificial Intelligence

    ACM Class: F.2.2; I.2.8

  7. arXiv:2312.11675  [pdf, other

    cs.AI

    PRP Rebooted: Advancing the State of the Art in FOND Planning

    Authors: Christian Muise, Sheila A. McIlraith, J. Christopher Beck

    Abstract: Fully Observable Non-Deterministic (FOND) planning is a variant of classical symbolic planning in which actions are nondeterministic, with an action's outcome known only upon execution. It is a popular planning paradigm with applications ranging from robot planning to dialogue-agent design and reactive synthesis. Over the last 20 years, a number of approaches to FOND planning have emerged. In this… ▽ More

    Submitted 19 December, 2023; v1 submitted 18 December, 2023; originally announced December 2023.

    Comments: 13 pages, 4 figures, AAAI conference paper Update: Fixed abstract and typos

    ACM Class: I.2.8

  8. arXiv:2311.14212  [pdf, other

    stat.ML cs.CL cs.LG stat.ME

    Annotation Sensitivity: Training Data Collection Methods Affect Model Performance

    Authors: Christoph Kern, Stephanie Eckman, Jacob Beck, Rob Chew, Bolei Ma, Frauke Kreuter

    Abstract: When training data are collected from human annotators, the design of the annotation instrument, the instructions given to annotators, the characteristics of the annotators, and their interactions can impact training data. This study demonstrates that design choices made when creating an annotation instrument also impact the models trained on the resulting annotations. We introduce the term annota… ▽ More

    Submitted 22 January, 2024; v1 submitted 23 November, 2023; originally announced November 2023.

    Comments: EMNLP 2023 Findings: https://aclanthology.org/2023.findings-emnlp.992/

  9. arXiv:2309.14970  [pdf, other

    cs.LG cs.AI cs.RO

    Recurrent Hypernetworks are Surprisingly Strong in Meta-RL

    Authors: Jacob Beck, Risto Vuorio, Zheng Xiong, Shimon Whiteson

    Abstract: Deep reinforcement learning (RL) is notoriously impractical to deploy due to sample inefficiency. Meta-RL directly addresses this sample inefficiency by learning to perform few-shot learning when a distribution of related tasks is available for meta-training. While many specialized meta-RL methods have been proposed, recent work suggests that end-to-end learning in conjunction with an off-the-shel… ▽ More

    Submitted 26 December, 2023; v1 submitted 26 September, 2023; originally announced September 2023.

    Comments: Published at NeurIPS 2023. We provide code at https://github.com/jacooba/hyper

  10. Fully Embedded Time-Series Generative Adversarial Networks

    Authors: Joe Beck, Subhadeep Chakraborty

    Abstract: Generative Adversarial Networks (GANs) should produce synthetic data that fits the underlying distribution of the data being modeled. For real valued time-series data, this implies the need to simultaneously capture the static distribution of the data, but also the full temporal distribution of the data for any potential time horizon. This temporal element produces a more complex problem that can… ▽ More

    Submitted 13 May, 2024; v1 submitted 29 August, 2023; originally announced August 2023.

    Comments: Final Manuscript. Accepted. Neural Computing and Applications May 2024

    Journal ref: Neural Comput & Applic (2024)

  11. arXiv:2302.11070  [pdf, other

    cs.AI cs.RO stat.ML

    Universal Morphology Control via Contextual Modulation

    Authors: Zheng Xiong, Jacob Beck, Shimon Whiteson

    Abstract: Learning a universal policy across different robot morphologies can significantly improve learning efficiency and generalization in continuous control. However, it poses a challenging multi-task reinforcement learning problem, as the optimal policy may be quite different across robots and critically depend on the morphology. Existing methods utilize graph neural networks or transformers to handle… ▽ More

    Submitted 3 August, 2023; v1 submitted 21 February, 2023; originally announced February 2023.

    Comments: Accepted by ICML 2023

  12. arXiv:2301.08028  [pdf, other

    cs.LG

    A Survey of Meta-Reinforcement Learning

    Authors: Jacob Beck, Risto Vuorio, Evan Zheran Liu, Zheng Xiong, Luisa Zintgraf, Chelsea Finn, Shimon Whiteson

    Abstract: While deep reinforcement learning (RL) has fueled multiple high-profile successes in machine learning, it is held back from more widespread adoption by its often poor data efficiency and the limited generality of the policies it produces. A promising approach for alleviating these limitations is to cast the development of better RL algorithms as a machine learning problem itself in a process calle… ▽ More

    Submitted 19 January, 2023; originally announced January 2023.

  13. Domain-Independent Dynamic Programming: Generic State Space Search for Combinatorial Optimization

    Authors: Ryo Kuroiwa, J. Christopher Beck

    Abstract: For combinatorial optimization problems, model-based approaches such as mixed-integer programming (MIP) and constraint programming (CP) aim to decouple modeling and solving a problem: the 'holy grail' of declarative problem solving. We propose domain-independent dynamic programming (DIDP), a new model-based paradigm based on dynamic programming (DP). While DP is not new, it has typically been impl… ▽ More

    Submitted 1 March, 2023; v1 submitted 25 November, 2022; originally announced November 2022.

    Comments: This paper was accepted at the 33rd International Conference on Automated Planning and Scheduling (ICAPS) 2023

    Journal ref: Proceedings of the International Conference on Automated Planning and Scheduling, 33(1), 2023, 236-244

  14. arXiv:2210.11915  [pdf, other

    cs.LG

    Efficient identification of informative features in simulation-based inference

    Authors: Jonas Beck, Michael Deistler, Yves Bernaerts, Jakob Macke, Philipp Berens

    Abstract: Simulation-based Bayesian inference (SBI) can be used to estimate the parameters of complex mechanistic models given observed model outputs without requiring access to explicit likelihood evaluations. A prime example for the application of SBI in neuroscience involves estimating the parameters governing the response dynamics of Hodgkin-Huxley (HH) models from electrophysiological measurements, by… ▽ More

    Submitted 25 November, 2022; v1 submitted 21 October, 2022; originally announced October 2022.

  15. arXiv:2210.11348  [pdf, other

    cs.LG cs.AI cs.RO

    Hypernetworks in Meta-Reinforcement Learning

    Authors: Jacob Beck, Matthew Thomas Jackson, Risto Vuorio, Shimon Whiteson

    Abstract: Training a reinforcement learning (RL) agent on a real-world robotics task remains generally impractical due to sample inefficiency. Multi-task RL and meta-RL aim to improve sample efficiency by generalizing over a distribution of related tasks. However, doing so is difficult in practice: In multi-task RL, state of the art methods often fail to outperform a degenerate solution that simply learns e… ▽ More

    Submitted 20 October, 2022; originally announced October 2022.

    Comments: Published at CoRL 2022

  16. arXiv:2209.11303  [pdf, other

    cs.LG

    An Investigation of the Bias-Variance Tradeoff in Meta-Gradients

    Authors: Risto Vuorio, Jacob Beck, Shimon Whiteson, Jakob Foerster, Gregory Farquhar

    Abstract: Meta-gradients provide a general approach for optimizing the meta-parameters of reinforcement learning (RL) algorithms. Estimation of meta-gradients is central to the performance of these meta-algorithms, and has been studied in the setting of MAML-style short-horizon meta-RL problems. In this context, prior work has investigated the estimation of the Hessian of the RL objective, as well as tackli… ▽ More

    Submitted 22 September, 2022; originally announced September 2022.

  17. arXiv:2202.00082  [pdf, other

    cs.LG

    Trust Region Bounds for Decentralized PPO Under Non-stationarity

    Authors: Mingfei Sun, Sam Devlin, Jacob Beck, Katja Hofmann, Shimon Whiteson

    Abstract: We present trust region bounds for optimizing decentralized policies in cooperative Multi-Agent Reinforcement Learning (MARL), which holds even when the transition dynamics are non-stationary. This new analysis provides a theoretical understanding of the strong performance of two recent actor-critic methods for MARL, which both rely on independent ratios, i.e., computing probability ratios separat… ▽ More

    Submitted 15 February, 2023; v1 submitted 31 January, 2022; originally announced February 2022.

    Comments: AAMAS 2023

  18. arXiv:2112.00478  [pdf, other

    cs.LG cs.AI stat.ML

    On the Practical Consistency of Meta-Reinforcement Learning Algorithms

    Authors: Zheng Xiong, Luisa Zintgraf, Jacob Beck, Risto Vuorio, Shimon Whiteson

    Abstract: Consistency is the theoretical property of a meta learning algorithm that ensures that, under certain assumptions, it can adapt to any task at test time. An open question is whether and how theoretical consistency translates into practice, in comparison to inconsistent algorithms. In this paper, we empirically investigate this question on a set of representative meta-RL algorithms. We find that th… ▽ More

    Submitted 1 December, 2021; originally announced December 2021.

  19. arXiv:1908.08641  [pdf, other

    cs.HC cs.AI cs.GT

    Stackelberg Punishment and Bully-Proofing Autonomous Vehicles

    Authors: Matt Cooper, Jun Ki Lee, Jacob Beck, Joshua D. Fishman, Michael Gillett, Zoë Papakipos, Aaron Zhang, Jerome Ramos, Aansh Shah, Michael L. Littman

    Abstract: Mutually beneficial behavior in repeated games can be enforced via the threat of punishment, as enshrined in game theory's well-known "folk theorem." There is a cost, however, to a player for generating these disincentives. In this work, we seek to minimize this cost by computing a "Stackelberg punishment," in which the player selects a behavior that sufficiently punishes the other player while ma… ▽ More

    Submitted 22 August, 2019; originally announced August 2019.

    Comments: 10 pages, The 11th International Conference on Social Robotics

  20. arXiv:1908.05188  [pdf, other

    cs.HC eess.IV

    A Research Framework for Virtual Reality Neurosurgery Based on Open-Source Tools

    Authors: Lukas D. J. Fiederer, Hisham Alwanni, Martin Völker, Oliver Schnell, Jürgen Beck, Tonio Ball

    Abstract: Fully immersive virtual reality (VR) has the potential to improve neurosurgical planning. For example, it may offer 3D visualizations of relevant anatomical structures with complex shapes, such as blood vessels and tumors. However, there is a lack of research tools specifically tailored for this area. We present a research framework for VR neurosurgery based on open-source tools and preliminary ev… ▽ More

    Submitted 14 August, 2019; originally announced August 2019.

  21. arXiv:1903.11451  [pdf, other

    cs.SI cs.LG stat.ML

    Sensing Social Media Signals for Cryptocurrency News

    Authors: Johannes Beck, Roberta Huang, David Lindner, Tian Guo, Ce Zhang, Dirk Helbing, Nino Antulov-Fantulin

    Abstract: The ability to track and monitor relevant and important news in real-time is of crucial interest in multiple industrial sectors. In this work, we focus on the set of cryptocurrency news, which recently became of emerging interest to the general and financial audience. In order to track relevant news in real-time, we (i) match news from the web with tweets from social media, (ii) track their intrad… ▽ More

    Submitted 27 March, 2019; originally announced March 2019.

    Comments: full version of the paper, that is accepted at ACM WWW '19 Conference, MSM'19 Workshop

  22. Talking about interaction*

    Authors: Stuart Reeves, Jordan Beck

    Abstract: Recent research has exposed disagreements over the nature and usefulness of what may (or may not) be Human-Computer Interaction's fundamental phenomenon: 'interaction'. For some, HCI's theorising about interaction has been deficient, impacting its capacity to inform decisions in design, suggesting the need either to perform first-principles definition work or broader administrative clarification a… ▽ More

    Submitted 28 May, 2019; v1 submitted 8 March, 2019; originally announced March 2019.

  23. arXiv:1901.05101  [pdf, other

    cs.LG cs.AI cs.RO stat.ML

    ReNeg and Backseat Driver: Learning from Demonstration with Continuous Human Feedback

    Authors: Jacob Beck, Zoe Papakipos, Michael Littman

    Abstract: In autonomous vehicle (AV) control, allowing mistakes can be quite dangerous and costly in the real world. For this reason we investigate methods of training an AV without allowing the agent to explore and instead having a human explorer collect the data. Supervised learning has been explored for AV control, but it encounters the issue of the covariate shift. That is, training data collected from… ▽ More

    Submitted 15 January, 2019; originally announced January 2019.

  24. arXiv:1812.00148  [pdf, ps, other

    cs.HC

    Conversations for Vision: Remote Sighted Assistants Hel** People with Visual Impairments

    Authors: Sooyeon Lee, Madison Reddie, Krish Gurdasani, Xiying Wang, Jordan Beck, Mary Beth Rosson, John M. Carroll

    Abstract: People with visual impairment (PVI) must interact with a world they cannot see. Remote sighted assistance has emerged as a conversational/social support system. We interviewed participants who either provide or receive assistance via a conversational/social prosthetic called Aira (https://aira.io/). We identified four types of support provided: scene description, performance, social interaction, a… ▽ More

    Submitted 1 December, 2018; originally announced December 2018.

    Comments: 19 pages

  25. arXiv:1807.11121  [pdf, other

    cs.LG cs.AI stat.ML

    Neural Mesh: Introducing a Notion of Space and Conservation of Energy to Neural Networks

    Authors: Jacob Beck, Zoe Papakipos

    Abstract: Neural networks are based on a simplified model of the brain. In this project, we wanted to relax the simplifying assumptions of a traditional neural network by making a model that more closely emulates the low level interactions of neurons. Like in an RNN, our model has a state that persists between time steps, so that the energies of neurons persist. However, unlike an RNN, our state consists of… ▽ More

    Submitted 29 July, 2018; originally announced July 2018.

    Comments: 12 pages

  26. arXiv:1803.06775  [pdf, other

    quant-ph cs.AI cs.ET eess.SY

    Comparing and Integrating Constraint Programming and Temporal Planning for Quantum Circuit Compilation

    Authors: Kyle E. C. Booth, Minh Do, J. Christopher Beck, Eleanor Rieffel, Davide Venturelli, Jeremy Frank

    Abstract: Recently, the makespan-minimization problem of compiling a general class of quantum algorithms into near-term quantum processors has been introduced to the AI community. The research demonstrated that temporal planning is a strong approach for a class of quantum circuit compilation (QCC) problems. In this paper, we explore the use of constraint programming (CP) as an alternative and complementary… ▽ More

    Submitted 18 March, 2018; originally announced March 2018.

    Comments: 9 pages, 2 figures, Proceedings of the 28th International Conference of Automated Planning and Scheduling 2018 (ICAPS-18)

  27. Scheduling a Dynamic Aircraft Repair Shop with Limited Repair Resources

    Authors: Maliheh Aramon Bajestani, J. Christopher Beck

    Abstract: We address a dynamic repair shop scheduling problem in the context of military aircraft fleet management where the goal is to maintain a full complement of aircraft over the long-term. A number of flights, each with a requirement for a specific number and type of aircraft, are already scheduled over a long horizon. We need to assign aircraft to flights and schedule repair activities while consider… ▽ More

    Submitted 3 February, 2014; originally announced February 2014.

    Journal ref: Journal Of Artificial Intelligence Research, Volume 47, pages 35-70, 2013

  28. A Constraint Programming Approach for Solving a Queueing Control Problem

    Authors: Daria Terekhov, J. Christopher Beck

    Abstract: In a facility with front room and back room operations, it is useful to switch workers between the rooms in order to cope with changing customer demand. Assuming stochastic customer arrival and service times, we seek a policy for switching workers such that the expected customer waiting time is minimized while the expected back room staffing is sufficient to perform all work. Three novel constrain… ▽ More

    Submitted 31 October, 2011; originally announced November 2011.

    Journal ref: Journal Of Artificial Intelligence Research, Volume 32, pages 123-167, 2008

  29. Solution-Guided Multi-Point Constructive Search for Job Shop Scheduling

    Authors: J. C. Beck

    Abstract: Solution-Guided Multi-Point Constructive Search (SGMPCS) is a novel constructive search technique that performs a series of resource-limited tree searches where each search begins either from an empty solution (as in randomized restart) or from a solution that has been encountered during the search. A small number of these "elite solutions is maintained during the search. We introduce the techniqu… ▽ More

    Submitted 12 October, 2011; originally announced October 2011.

    Journal ref: Journal Of Artificial Intelligence Research, Volume 29, pages 49-77, 2007

  30. Proactive Algorithms for Job Shop Scheduling with Probabilistic Durations

    Authors: J. C. Beck, N. Wilson

    Abstract: Most classical scheduling formulations assume a fixed and known duration for each activity. In this paper, we weaken this assumption, requiring instead that each duration can be represented by an independent random variable with a known mean and variance. The best solutions are ones which have a high probability of achieving a good makespan. We first create a theoretical framework, formally showi… ▽ More

    Submitted 12 October, 2011; originally announced October 2011.

    Journal ref: Journal Of Artificial Intelligence Research, Volume 28, pages 183-232, 2007