Skip to main content

Showing 1–5 of 5 results for author: Valasek, J

Searching in archive cs. Search in all archives.
.
  1. arXiv:2102.13008  [pdf, other

    cs.LG cs.HC cs.RO

    Imitation Learning with Human Eye Gaze via Multi-Objective Prediction

    Authors: Ravi Kumar Thakur, MD-Nazmus Samin Sunbeam, Vinicius G. Goecks, Ellen Novoseller, Ritwik Bera, Vernon J. Lawhern, Gregory M. Gremillion, John Valasek, Nicholas R. Waytowich

    Abstract: Approaches for teaching learning agents via human demonstrations have been widely studied and successfully applied to multiple domains. However, the majority of imitation learning work utilizes only behavioral information from the demonstrator, i.e. which actions were taken, and ignores other useful information. In particular, eye gaze information can give valuable insight towards where the demons… ▽ More

    Submitted 22 July, 2023; v1 submitted 25 February, 2021; originally announced February 2021.

    Comments: Paper accepted and selected as an oral presentation at Interactive Learning with Implicit Human Feedback Workshop at ICML 2023

    ACM Class: I.2.6; I.2.9; I.2.10

  2. arXiv:2003.12638  [pdf, other

    cs.CV cs.LG eess.IV

    Combining Visible and Infrared Spectrum Imagery using Machine Learning for Small Unmanned Aerial System Detection

    Authors: Vinicius G. Goecks, Grayson Woods, John Valasek

    Abstract: Advances in machine learning and deep neural networks for object detection, coupled with lower cost and power requirements of cameras, led to promising vision-based solutions for sUAS detection. However, solely relying on the visible spectrum has previously led to reliability issues in low contrast scenarios such as sUAS flying below the treeline and against bright sources of light. Alternatively,… ▽ More

    Submitted 2 April, 2020; v1 submitted 27 March, 2020; originally announced March 2020.

    Comments: Project page: https://sites.google.com/view/tamudrone-spie2020/

  3. arXiv:1911.00171  [pdf, other

    cs.LG cs.AI stat.ML

    PODNet: A Neural Network for Discovery of Plannable Options

    Authors: Ritwik Bera, Vinicius G. Goecks, Gregory M. Gremillion, John Valasek, Nicholas R. Waytowich

    Abstract: Learning from demonstration has been widely studied in machine learning but becomes challenging when the demonstrated trajectories are unstructured and follow different objectives. This short-paper proposes PODNet, Plannable Option Discovery Network, addressing how to segment an unstructured set of demonstrated trajectories for option discovery. This enables learning from demonstration to perform… ▽ More

    Submitted 28 February, 2020; v1 submitted 31 October, 2019; originally announced November 2019.

    ACM Class: I.2.0; I.2.6

  4. arXiv:1910.04281  [pdf, other

    cs.LG cs.AI stat.ML

    Integrating Behavior Cloning and Reinforcement Learning for Improved Performance in Dense and Sparse Reward Environments

    Authors: Vinicius G. Goecks, Gregory M. Gremillion, Vernon J. Lawhern, John Valasek, Nicholas R. Waytowich

    Abstract: This paper investigates how to efficiently transition and update policies, trained initially with demonstrations, using off-policy actor-critic reinforcement learning. It is well-known that techniques based on Learning from Demonstrations, for example behavior cloning, can lead to proficient policies given limited data. However, it is currently unclear how to efficiently update that policy using r… ▽ More

    Submitted 3 April, 2020; v1 submitted 9 October, 2019; originally announced October 2019.

    Comments: 9 pages, 5 Figures. AAMAS 2020

  5. arXiv:1810.11545  [pdf, other

    cs.AI cs.HC cs.RO

    Efficiently Combining Human Demonstrations and Interventions for Safe Training of Autonomous Systems in Real-Time

    Authors: Vinicius G. Goecks, Gregory M. Gremillion, Vernon J. Lawhern, John Valasek, Nicholas R. Waytowich

    Abstract: This paper investigates how to utilize different forms of human interaction to safely train autonomous systems in real-time by learning from both human demonstrations and interventions. We implement two components of the Cycle-of-Learning for Autonomous Systems, which is our framework for combining multiple modalities of human interaction. The current effort employs human demonstrations to teach a… ▽ More

    Submitted 28 November, 2018; v1 submitted 26 October, 2018; originally announced October 2018.

    Comments: 9 pages, 6 figures