Skip to main content

Showing 1–11 of 11 results for author: Tomlin, C

Searching in archive stat. Search in all archives.
.
  1. arXiv:2210.05015  [pdf, other

    cs.AI cs.RO eess.SY stat.ML

    Optimality Guarantees for Particle Belief Approximation of POMDPs

    Authors: Michael H. Lim, Tyler J. Becker, Mykel J. Kochenderfer, Claire J. Tomlin, Zachary N. Sunberg

    Abstract: Partially observable Markov decision processes (POMDPs) provide a flexible representation for real-world decision and control problems. However, POMDPs are notoriously difficult to solve, especially when the state and observation spaces are continuous or hybrid, which is often the case for physical systems. While recent online sampling-based POMDP algorithms that plan with observation likelihood w… ▽ More

    Submitted 19 October, 2023; v1 submitted 10 October, 2022; originally announced October 2022.

    Journal ref: Journal of Artificial Intelligence Research, 77, 1591-1636 (2023)

  2. arXiv:2009.02874  [pdf

    cs.LG eess.SY math.DS stat.ML

    Dynamically Computing Adversarial Perturbations for Recurrent Neural Networks

    Authors: Shankar A. Deka, Dušan M. Stipanović, Claire J. Tomlin

    Abstract: Convolutional and recurrent neural networks have been widely employed to achieve state-of-the-art performance on classification tasks. However, it has also been noted that these networks can be manipulated adversarially with relative ease, by carefully crafted additive perturbations to the input. Though several experimentally established prior works exist on crafting and defending against attacks,… ▽ More

    Submitted 6 September, 2020; originally announced September 2020.

    Comments: Submitted to IEEE Transactions on Neural Networks and Learning Systems

    MSC Class: 68T07; 93B52; 93C10; 49N90 ACM Class: I.2.8

  3. arXiv:2006.13208  [pdf, other

    cs.RO cs.AI cs.HC cs.LG stat.ML

    Feature Expansive Reward Learning: Rethinking Human Input

    Authors: Andreea Bobu, Marius Wiggert, Claire Tomlin, Anca D. Dragan

    Abstract: When a person is not satisfied with how a robot performs a task, they can intervene to correct it. Reward learning methods enable the robot to adapt its reward function online based on such human input, but they rely on handcrafted features. When the correction cannot be explained by these features, recent work in deep Inverse Reinforcement Learning (IRL) suggests that the robot could ask for task… ▽ More

    Submitted 12 January, 2021; v1 submitted 23 June, 2020; originally announced June 2020.

    Comments: 13 pages, 14 figures

  4. arXiv:2004.02766  [pdf, other

    cs.LG math.DS math.OC stat.ML

    Technical Report: Adaptive Control for Linearizable Systems Using On-Policy Reinforcement Learning

    Authors: Tyler Westenbroek, Eric Mazumdar, David Fridovich-Keil, Valmik Prabhu, Claire J. Tomlin, S. Shankar Sastry

    Abstract: This paper proposes a framework for adaptively learning a feedback linearization-based tracking controller for an unknown system using discrete-time model-free policy-gradient parameter update rules. The primary advantage of the scheme over standard model-reference adaptive control techniques is that it does not require the learned inverse model to be invertible at all instances of time. This enab… ▽ More

    Submitted 6 April, 2020; originally announced April 2020.

  5. arXiv:1910.04332  [pdf, other

    cs.LG cs.RO eess.SY stat.ML

    Sparse tree search optimality guarantees in POMDPs with continuous observation spaces

    Authors: Michael H. Lim, Claire J. Tomlin, Zachary N. Sunberg

    Abstract: Partially observable Markov decision processes (POMDPs) with continuous state and observation spaces have powerful flexibility for representing real-world decision and control problems but are notoriously difficult to solve. Recent online sampling-based algorithms that use observation likelihood weighting have shown unprecedented effectiveness in domains with continuous observation spaces. However… ▽ More

    Submitted 5 June, 2023; v1 submitted 9 October, 2019; originally announced October 2019.

  6. arXiv:1902.08594  [pdf, other

    eess.SY cs.LG cs.MA stat.ML

    Regression-based Inverter Control for Decentralized Optimal Power Flow and Voltage Regulation

    Authors: Oscar Sondermeijer, Roel Dobbe, Daniel Arnold, Claire Tomlin, Tamás Keviczky

    Abstract: Electronic power inverters are capable of quickly delivering reactive power to maintain customer voltages within operating tolerances and to reduce system losses in distribution grids. This paper proposes a systematic and data-driven approach to determine reactive power inverter output as a function of local measurements in a manner that obtains near optimal results. First, we use a network model… ▽ More

    Submitted 20 February, 2019; originally announced February 2019.

    Comments: Cite as: Oscar Sondermeijer, Roel Dobbe, Daniel Arnold, Claire Tomlin and Tamás Keviczky, "Regression-based Inverter Control for Decentralized Optimal Power Flow and Voltage Regulation", IEEE Power & Energy Society General Meeting, Boston, July 2016

  7. arXiv:1902.07247  [pdf, other

    cs.LG stat.ML

    Fast Neural Network Verification via Shadow Prices

    Authors: Vicenc Rubies-Royo, Roberto Calandra, Dusan M. Stipanovic, Claire Tomlin

    Abstract: To use neural networks in safety-critical settings it is paramount to provide assurances on their runtime operation. Recent work on ReLU networks has sought to verify whether inputs belonging to a bounded box can ever yield some undesirable output. Input-splitting procedures, a particular type of verification mechanism, do so by recursively partitioning the input set into smaller sets. The efficie… ▽ More

    Submitted 21 June, 2021; v1 submitted 19 February, 2019; originally announced February 2019.

  8. arXiv:1809.10611  [pdf, other

    cs.LG cs.RO stat.ML

    A Successive-Elimination Approach to Adaptive Robotic Sensing

    Authors: Esther Rolf, David Fridovich-Keil, Max Simchowitz, Benjamin Recht, Claire Tomlin

    Abstract: We study an adaptive source seeking problem, in which a mobile robot must identify the strongest emitter(s) of a signal in an environment with background emissions. Background signals may be highly heterogeneous and can mislead algorithms that are based on receding horizon control. We propose AdaSearch, a general algorithm for adaptive source seeking in the face of heterogeneous background noise.… ▽ More

    Submitted 23 June, 2020; v1 submitted 27 September, 2018; originally announced September 2018.

    Journal ref: IEEE Transactions on Robotics Research, 2020

  9. arXiv:1806.06790  [pdf, other

    cs.LG cs.AI cs.IT eess.SY math.OC stat.ML

    Towards Distributed Energy Services: Decentralizing Optimal Power Flow with Machine Learning

    Authors: Roel Dobbe, Oscar Sondermeijer, David Fridovich-Keil, Daniel Arnold, Duncan Callaway, Claire Tomlin

    Abstract: The implementation of optimal power flow (OPF) methods to perform voltage and power flow regulation in electric networks is generally believed to require extensive communication. We consider distribution systems with multiple controllable Distributed Energy Resources (DERs) and present a data-driven approach to learn control policies for each DER to reconstruct and mimic the solution to a centrali… ▽ More

    Submitted 13 August, 2019; v1 submitted 14 June, 2018; originally announced June 2018.

    Comments: Accepted for publication. To appear in the IEEE Transactions on Smart Grid

  10. arXiv:1711.05928  [pdf, ps, other

    cs.LG cs.AI stat.ML

    Budget-Constrained Multi-Armed Bandits with Multiple Plays

    Authors: Datong P. Zhou, Claire J. Tomlin

    Abstract: We study the multi-armed bandit problem with multiple plays and a budget constraint for both the stochastic and the adversarial setting. At each round, exactly $K$ out of $N$ possible arms have to be played (with $1\leq K \leq N$). In addition to observing the individual rewards for each arm played, the player also learns a vector of costs which has to be covered with an a-priori defined budget… ▽ More

    Submitted 16 November, 2017; originally announced November 2017.

    Comments: 20 pages

  11. arXiv:1609.09660  [pdf, other

    eess.SY cs.LG stat.ML

    On Identification of Sparse Multivariable ARX Model: A Sparse Bayesian Learning Approach

    Authors: J. **, Y. Yuan, W. Pan, D. L. T. Pham, C. J. Tomlin, A. Webb, J. Goncalves

    Abstract: This paper begins with considering the identification of sparse linear time-invariant networks described by multivariable ARX models. Such models possess relatively simple structure thus used as a benchmark to promote further research. With identifiability of the network guaranteed, this paper presents an identification method that infers both the Boolean structure of the network and the internal… ▽ More

    Submitted 30 September, 2016; originally announced September 2016.