Skip to main content

Showing 1–9 of 9 results for author: Lowrey, K

.
  1. arXiv:2202.02405  [pdf, other

    cs.LG stat.ML

    BAM: Bayes with Adaptive Memory

    Authors: Josue Nassar, Jennifer Brennan, Ben Evans, Kendall Lowrey

    Abstract: Online learning via Bayes' theorem allows new data to be continuously integrated into an agent's current beliefs. However, a naive application of Bayesian methods in non stationary environments leads to slow adaptation and results in state estimates that may converge confidently to the wrong parameter value. A common solution when learning in changing environments is to discard/downweight past dat… ▽ More

    Submitted 8 February, 2022; v1 submitted 4 February, 2022; originally announced February 2022.

    Comments: International Conference on Learning Representations (ICLR), 2022

  2. arXiv:2106.15775  [pdf, other

    cs.LG cs.RO eess.SY

    Koopman Spectrum Nonlinear Regulators and Efficient Online Learning

    Authors: Motoya Ohnishi, Isao Ishikawa, Kendall Lowrey, Masahiro Ikeda, Sham Kakade, Yoshinobu Kawahara

    Abstract: Most modern reinforcement learning algorithms optimize a cumulative single-step cost along a trajectory. The optimized motions are often 'unnatural', representing, for example, behaviors with sudden accelerations that waste energy and lack predictability. In this work, we present a novel paradigm of controlling nonlinear systems via the minimization of the Koopman spectrum cost: a cost over the Ko… ▽ More

    Submitted 2 July, 2024; v1 submitted 29 June, 2021; originally announced June 2021.

    Comments: 41 pages, 21 figures

    Journal ref: Transactions on Machine Learning Research (https://openreview.net/forum?id=thfoUZugvS), 2024

  3. arXiv:2012.06684  [pdf, other

    cs.LG stat.ML

    Faster Policy Learning with Continuous-Time Gradients

    Authors: Samuel Ainsworth, Kendall Lowrey, John Thickstun, Zaid Harchaoui, Siddhartha Srinivasa

    Abstract: We study the estimation of policy gradients for continuous-time systems with known dynamics. By reframing policy learning in continuous-time, we show that it is possible construct a more efficient and accurate gradient estimator. The standard back-propagation through time estimator (BPTT) computes exact gradients for a crude discretization of the continuous-time system. In contrast, we approximate… ▽ More

    Submitted 24 June, 2021; v1 submitted 11 December, 2020; originally announced December 2020.

    Journal ref: L4DC 2021

  4. arXiv:2006.12466  [pdf, other

    cs.LG cs.RO math.OC stat.ML

    Information Theoretic Regret Bounds for Online Nonlinear Control

    Authors: Sham Kakade, Akshay Krishnamurthy, Kendall Lowrey, Motoya Ohnishi, Wen Sun

    Abstract: This work studies the problem of sequential control in an unknown, nonlinear dynamical system, where we model the underlying system dynamics as an unknown function in a known Reproducing Kernel Hilbert Space. This framework yields a general setting that permits discrete and continuous control inputs as well as non-smooth, non-differentiable dynamics. Our main result, the Lower Confidence-based Con… ▽ More

    Submitted 22 June, 2020; originally announced June 2020.

  5. Model-Based Generalization Under Parameter Uncertainty Using Path Integral Control

    Authors: Ian Abraham, Ankur Handa, Nathan Ratliff, Kendall Lowrey, Todd D. Murphey, Dieter Fox

    Abstract: This work addresses the problem of robot interaction in complex environments where online control and adaptation is necessary. By expanding the sample space in the free energy formulation of path integral control, we derive a natural extension to the path integral control that embeds uncertainty into action and provides robustness for model-based robot planning. Our algorithm is applied to a diver… ▽ More

    Submitted 4 June, 2020; originally announced June 2020.

    Journal ref: IEEE Robotics and Automation Letters ( Volume: 5 , Issue: 2 , April 2020 )

  6. arXiv:2001.07343  [pdf, other

    cs.RO cs.AI cs.LG eess.SY

    Lyceum: An efficient and scalable ecosystem for robot learning

    Authors: Colin Summers, Kendall Lowrey, Aravind Rajeswaran, Siddhartha Srinivasa, Emanuel Todorov

    Abstract: We introduce Lyceum, a high-performance computational ecosystem for robot learning. Lyceum is built on top of the Julia programming language and the MuJoCo physics simulator, combining the ease-of-use of a high-level programming language with the performance of native C. In addition, Lyceum has a straightforward API to support parallel computation across multiple cores and machines. Overall, depen… ▽ More

    Submitted 21 January, 2020; originally announced January 2020.

  7. arXiv:1811.01848  [pdf, other

    cs.LG cs.AI cs.RO stat.ML

    Plan Online, Learn Offline: Efficient Learning and Exploration via Model-Based Control

    Authors: Kendall Lowrey, Aravind Rajeswaran, Sham Kakade, Emanuel Todorov, Igor Mordatch

    Abstract: We propose a plan online and learn offline (POLO) framework for the setting where an agent, with an internal model, needs to continually act and learn in the world. Our work builds on the synergistic relationship between local model-based control, global value function learning, and exploration. We study how local trajectory optimization can cope with approximation errors in the value function, an… ▽ More

    Submitted 28 January, 2019; v1 submitted 5 November, 2018; originally announced November 2018.

    Comments: The first two authors contributed equally. Accepted at ICLR 2019. Supplementary videos available at: https://sites.google.com/view/polo-mpc

  8. arXiv:1803.10371  [pdf, other

    cs.RO cs.LG eess.SY

    Reinforcement learning for non-prehensile manipulation: Transfer from simulation to physical system

    Authors: Kendall Lowrey, Svetoslav Kolev, Jeremy Dao, Aravind Rajeswaran, Emanuel Todorov

    Abstract: Reinforcement learning has emerged as a promising methodology for training robot controllers. However, most results have been limited to simulation due to the need for a large number of samples and the lack of automated-yet-safe data collection methods. Model-based reinforcement learning methods provide an avenue to circumvent these challenges, but the traditional concern has been the mismatch bet… ▽ More

    Submitted 27 March, 2018; originally announced March 2018.

    Comments: Accepted at IEEE SIMPAR 2018. Project page: https://sites.google.com/view/phantomsim2real

  9. arXiv:1703.02660  [pdf, other

    cs.LG cs.AI cs.RO eess.SY

    Towards Generalization and Simplicity in Continuous Control

    Authors: Aravind Rajeswaran, Kendall Lowrey, Emanuel Todorov, Sham Kakade

    Abstract: This work shows that policies with simple linear and RBF parameterizations can be trained to solve a variety of continuous control tasks, including the OpenAI gym benchmarks. The performance of these trained policies are competitive with state of the art results, obtained with more elaborate parameterizations such as fully connected neural networks. Furthermore, existing training and testing scena… ▽ More

    Submitted 20 March, 2018; v1 submitted 7 March, 2017; originally announced March 2017.

    Comments: NIPS 2017, Project page: https://sites.google.com/view/simple-pol