Skip to main content

Showing 1–3 of 3 results for author: Navidi, N

.
  1. arXiv:2102.02639  [pdf, other

    cs.LG cs.AI cs.HC

    Improving Reinforcement Learning with Human Assistance: An Argument for Human Subject Studies with HIPPO Gym

    Authors: Matthew E. Taylor, Nicholas Nissen, Yuan Wang, Neda Navidi

    Abstract: Reinforcement learning (RL) is a popular machine learning paradigm for game playing, robotics control, and other sequential decision tasks. However, RL agents often have long learning times with high data requirements because they begin by acting randomly. In order to better learn in complex tasks, this article argues that an external teacher can often significantly help the RL agent learn. Open… ▽ More

    Submitted 2 February, 2021; originally announced February 2021.

  2. arXiv:2006.07301  [pdf, other

    cs.AI cs.HC cs.LG cs.MA

    Human and Multi-Agent collaboration in a human-MARL teaming framework

    Authors: Neda Navidi, Francoi Chabo, Saga Kurandwa, Iv Lutigma, Vincent Robt, Gregry Szrftgr, Andea Schuh

    Abstract: Reinforcement learning provides effective results with agents learning from their observations, received rewards, and internal interactions between agents. This study proposes a new open-source MARL framework, called COGMENT, to efficiently leverage human and agent interactions as a source of learning. We demonstrate these innovations by using a designed real-time environment with unmanned aerial… ▽ More

    Submitted 1 March, 2021; v1 submitted 12 June, 2020; originally announced June 2020.

  3. arXiv:2003.04203  [pdf, other

    cs.LG cs.AI stat.ML

    Human AI interaction loop training: New approach for interactive reinforcement learning

    Authors: Neda Navidi

    Abstract: Reinforcement Learning (RL) in various decision-making tasks of machine learning provides effective results with an agent learning from a stand-alone reward function. However, it presents unique challenges with large amounts of environment states and action spaces, as well as in the determination of rewards. This complexity, coming from high dimensionality and continuousness of the environments co… ▽ More

    Submitted 9 March, 2020; originally announced March 2020.