Skip to main content

Showing 1–3 of 3 results for author: Dunovan, K

Searching in archive cs. Search in all archives.
.
  1. arXiv:1809.09147  [pdf, other

    cs.LG cs.AI stat.ML

    Better Safe than Sorry: Evidence Accumulation Allows for Safe Reinforcement Learning

    Authors: Akshat Agarwal, Abhinau Kumar V, Kyle Dunovan, Erik Peterson, Timothy Verstynen, Katia Sycara

    Abstract: In the real world, agents often have to operate in situations with incomplete information, limited sensing capabilities, and inherently stochastic environments, making individual observations incomplete and unreliable. Moreover, in many situations it is preferable to delay a decision rather than run the risk of making a bad decision. In such situations it is necessary to aggregate information befo… ▽ More

    Submitted 24 September, 2018; originally announced September 2018.

    Comments: 8 pages, 3 figures. Code available at https://github.com/agakshat/evidence-accumulation

  2. arXiv:1809.03406  [pdf, other

    cs.AI

    Combining imagination and heuristics to learn strategies that generalize

    Authors: Erik J Peterson, Necati Alp Müyesser, Timothy Verstynen, Kyle Dunovan

    Abstract: Deep reinforcement learning can match or exceed human performance in stable contexts, but with minor changes to the environment artificial networks, unlike humans, often cannot adapt. Humans rely on a combination of heuristics to simplify computational load and imagination to extend experiential learning to new and more challenging environments. Motivated by theories of the hierarchical organizati… ▽ More

    Submitted 11 June, 2020; v1 submitted 10 September, 2018; originally announced September 2018.

  3. arXiv:1801.06689  [pdf, other

    cs.AI

    Learning model-based strategies in simple environments with hierarchical q-networks

    Authors: Necati Alp Muyesser, Kyle Dunovan, Timothy Verstynen

    Abstract: Recent advances in deep learning have allowed artificial agents to rival human-level performance on a wide range of complex tasks; however, the ability of these networks to learn generalizable strategies remains a pressing challenge. This critical limitation is due in part to two factors: the opaque information representation in deep neural networks and the complexity of the task environments in w… ▽ More

    Submitted 20 January, 2018; originally announced January 2018.