Skip to main content

Showing 1–6 of 6 results for author: Simmons-Edler, R

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.01859  [pdf, other

    cs.CY cs.AI cs.LG cs.RO

    AI-Powered Autonomous Weapons Risk Geopolitical Instability and Threaten AI Research

    Authors: Riley Simmons-Edler, Ryan Badman, Shayne Longpre, Kanaka Rajan

    Abstract: The recent embrace of machine learning (ML) in the development of autonomous weapons systems (AWS) creates serious risks to geopolitical stability and the free exchange of ideas in AI research. This topic has received comparatively little attention of late compared to risks stemming from superintelligent artificial general intelligence (AGI), but requires fewer assumptions about the course of tech… ▽ More

    Submitted 31 May, 2024; v1 submitted 3 May, 2024; originally announced May 2024.

    Comments: 9 pages, 1 figure, in ICML 2024

  2. arXiv:2108.04867  [pdf, other

    cs.RO cs.AI cs.CV cs.LG cs.SD

    AuraSense: Robot Collision Avoidance by Full Surface Proximity Detection

    Authors: Xiaoran Fan, Riley Simmons-Edler, Daewon Lee, Larry Jackel, Richard Howard, Daniel Lee

    Abstract: Perceiving obstacles and avoiding collisions is fundamental to the safe operation of a robot system, particularly when the robot must operate in highly dynamic human environments. Proximity detection using on-robot sensors can be used to avoid or mitigate impending collisions. However, existing proximity sensing methods are orientation and placement dependent, resulting in blind spots even with la… ▽ More

    Submitted 10 August, 2021; originally announced August 2021.

    Comments: Accepted to IROS 2021

  3. arXiv:2106.04499  [pdf, other

    cs.LG cs.AI

    Towards Practical Credit Assignment for Deep Reinforcement Learning

    Authors: Vyacheslav Alipov, Riley Simmons-Edler, Nikita Putintsev, Pavel Kalinin, Dmitry Vetrov

    Abstract: Credit assignment is a fundamental problem in reinforcement learning, the problem of measuring an action's influence on future rewards. Explicit credit assignment methods have the potential to boost the performance of RL algorithms on many tasks, but thus far remain impractical for general use. Recently, a family of methods called Hindsight Credit Assignment (HCA) was proposed, which explicitly as… ▽ More

    Submitted 11 February, 2022; v1 submitted 8 June, 2021; originally announced June 2021.

    Comments: 8 pages plus 8 page appendix

  4. arXiv:1906.08189  [pdf, other

    cs.LG stat.ML

    Reward Prediction Error as an Exploration Objective in Deep RL

    Authors: Riley Simmons-Edler, Ben Eisner, Daniel Yang, Anthony Bisulco, Eric Mitchell, Sebastian Seung, Daniel Lee

    Abstract: A major challenge in reinforcement learning is exploration, when local dithering methods such as epsilon-greedy sampling are insufficient to solve a given task. Many recent methods have proposed to intrinsically motivate an agent to seek novel states, driving the agent to discover improved reward. However, while state-novelty exploration methods are suitable for tasks where novel observations corr… ▽ More

    Submitted 13 January, 2021; v1 submitted 19 June, 2019; originally announced June 2019.

    Comments: Published at IJCAI 2020, camera-ready version

  5. arXiv:1903.10605  [pdf, other

    cs.AI

    Q-Learning for Continuous Actions with Cross-Entropy Guided Policies

    Authors: Riley Simmons-Edler, Ben Eisner, Eric Mitchell, Sebastian Seung, Daniel Lee

    Abstract: Off-Policy reinforcement learning (RL) is an important class of methods for many problem domains, such as robotics, where the cost of collecting data is high and on-policy methods are consequently intractable. Standard methods for applying Q-learning to continuous-valued action domains involve iteratively sampling the Q-function to find a good action (e.g. via hill-climbing), or by learning a poli… ▽ More

    Submitted 1 July, 2019; v1 submitted 25 March, 2019; originally announced March 2019.

  6. arXiv:1806.02932  [pdf, other

    cs.AI cs.LG cs.NE cs.PL

    Program Synthesis Through Reinforcement Learning Guided Tree Search

    Authors: Riley Simmons-Edler, Anders Miltner, Sebastian Seung

    Abstract: Program Synthesis is the task of generating a program from a provided specification. Traditionally, this has been treated as a search problem by the programming languages (PL) community and more recently as a supervised learning problem by the machine learning community. Here, we propose a third approach, representing the task of synthesizing a given program as a Markov decision process solvable v… ▽ More

    Submitted 7 June, 2018; originally announced June 2018.

    Comments: 9 pages, 5 figures, Submitted to NIPS 2018 conference