Skip to main content

Showing 1–3 of 3 results for author: Alcedo, K

.
  1. arXiv:2206.07520  [pdf, other

    cs.GT cs.LG

    Principal Trade-off Analysis

    Authors: Alexander Strang, David SeWell, Alexander Kim, Kevin Alcedo, David Rosenbluth

    Abstract: How are the advantage relations between a set of agents playing a game organized and how do they reflect the structure of the game? In this paper, we illustrate "Principal Trade-off Analysis" (PTA), a decomposition method that embeds games into a low-dimensional feature space. We argue that the embeddings are more revealing than previously demonstrated by develo** an analogy to Principal Compone… ▽ More

    Submitted 16 August, 2023; v1 submitted 9 June, 2022; originally announced June 2022.

    Comments: 17 pages, 8 figures

  2. arXiv:2202.02918  [pdf, other

    cs.LG cs.AI cs.NE

    Soft Actor-Critic with Inhibitory Networks for Faster Retraining

    Authors: Jaime S. Ide, Daria Mićović, Michael J. Guarino, Kevin Alcedo, David Rosenbluth, Adrian P. Pope

    Abstract: Reusing previously trained models is critical in deep reinforcement learning to speed up training of new agents. However, it is unclear how to acquire new skills when objectives and constraints are in conflict with previously learned skills. Moreover, when retraining, there is an intrinsic conflict between exploiting what has already been learned and exploring new skills. In soft actor-critic (SAC… ▽ More

    Submitted 7 February, 2022; v1 submitted 6 February, 2022; originally announced February 2022.

    Comments: 16 pages including Appendix

  3. arXiv:2105.00990  [pdf, other

    cs.LG

    Hierarchical Reinforcement Learning for Air-to-Air Combat

    Authors: Adrian P. Pope, Jaime S. Ide, Daria Micovic, Henry Diaz, David Rosenbluth, Lee Ritholtz, Jason C. Twedt, Thayne T. Walker, Kevin Alcedo, Daniel Javorsek

    Abstract: Artificial Intelligence (AI) is becoming a critical component in the defense industry, as recently demonstrated by DARPA`s AlphaDogfight Trials (ADT). ADT sought to vet the feasibility of AI algorithms capable of piloting an F-16 in simulated air-to-air combat. As a participant in ADT, Lockheed Martin`s (LM) approach combines a hierarchical architecture with maximum-entropy reinforcement learning… ▽ More

    Submitted 11 June, 2021; v1 submitted 3 May, 2021; originally announced May 2021.

    Comments: 10 pages, 10 figures, The 2021 International Conference on Unmanned Aircraft System (ICUAS 21), June 15-18, 2021, Athens, Greece