Skip to main content

Showing 1–16 of 16 results for author: Gaudet, B

.
  1. arXiv:2310.18509  [pdf

    cs.AI cs.LG

    Deep Reinforcement Learning for Weapons to Targets Assignment in a Hypersonic strike

    Authors: Brian Gaudet, Kris Drozd, Roberto Furfaro

    Abstract: We use deep reinforcement learning (RL) to optimize a weapons to target assignment (WTA) policy for multi-vehicle hypersonic strike against multiple targets. The objective is to maximize the total value of destroyed targets in each episode. Each randomly generated episode varies the number and initial conditions of the hypersonic strike weapons (HSW) and targets, the value distribution of the targ… ▽ More

    Submitted 27 October, 2023; originally announced October 2023.

  2. arXiv:2205.00085  [pdf

    cs.RO eess.SY

    Line of Sight Curvature for Missile Guidance using Reinforcement Meta-Learning

    Authors: Brian Gaudet, Roberto Furfaro

    Abstract: We use reinforcement meta learning to optimize a line of sight curvature policy that increases the effectiveness of a guidance system against maneuvering targets. The policy is implemented as a recurrent neural network that maps navigation system outputs to a Euler 321 attitude representation. The attitude representation is then used to construct a direction cosine matrix that biases the observed… ▽ More

    Submitted 29 April, 2022; originally announced May 2022.

    Comments: Submitted to 2023 Scitech Guidance and Control conference. arXiv admin note: substantial text overlap with arXiv:2109.03880; text overlap with arXiv:2004.09978

  3. arXiv:2112.08540  [pdf

    eess.SY cs.AI cs.RO

    Integrated Guidance and Control for Lunar Landing using a Stabilized Seeker

    Authors: Brian Gaudet, Roberto Furfaro

    Abstract: We develop an integrated guidance and control system that in conjunction with a stabilized seeker and landing site detection software can achieve precise and safe planetary landing. The seeker tracks the designated landing site by adjusting seeker elevation and azimuth angles to center the designated landing site in the sensor field of view. The seeker angles, closing speed, and range to the desig… ▽ More

    Submitted 15 December, 2021; originally announced December 2021.

    Comments: Accepted for 2022 AIAA Scitech GN&C. arXiv admin note: text overlap with arXiv:2107.14764, arXiv:2004.09978, arXiv:2110.00634, arXiv:2109.03880

  4. Terminal Adaptive Guidance for Autonomous Hypersonic Strike Weapons via Reinforcement Learning

    Authors: Brian Gaudet, Roberto Furfaro

    Abstract: An adaptive guidance system suitable for the terminal phase trajectory of a hypersonic strike weapon is optimized using reinforcement meta learning. The guidance system maps observations directly to commanded bank angle, angle of attack, and sideslip angle rates. Importantly, the observations are directly measurable from radar seeker outputs with minimal processing. The optimization framework impl… ▽ More

    Submitted 16 October, 2021; v1 submitted 1 October, 2021; originally announced October 2021.

    Comments: arXiv admin note: substantial text overlap with arXiv:2107.14764; text overlap with arXiv:2109.03880

  5. Integrated and Adaptive Guidance and Control for Endoatmospheric Missiles via Reinforcement Learning

    Authors: Brian Gaudet, Roberto Furfaro

    Abstract: We apply a reinforcement meta-learning framework to optimize an integrated and adaptive guidance and flight control system for an air-to-air missile. The system is implemented as a policy that maps navigation system outputs directly to commanded rates of change for the missile's control surface deflections. The system induces intercept trajectories against a maneuvering target that satisfy control… ▽ More

    Submitted 3 May, 2022; v1 submitted 8 September, 2021; originally announced September 2021.

    Comments: Preprint for 2023 Scitech GN&C submission

  6. arXiv:2107.14764  [pdf

    cs.RO cs.AI

    Adaptive Approach Phase Guidance for a Hypersonic Glider via Reinforcement Meta Learning

    Authors: Brian Gaudet, Kris Drozd, Ryan Meltzer, Roberto Furfaro

    Abstract: We use Reinforcement Meta Learning to optimize an adaptive guidance system suitable for the approach phase of a gliding hypersonic vehicle. Adaptability is achieved by optimizing over a range of off-nominal flight conditions including perturbation of aerodynamic coefficient parameters, actuator failure scenarios, and sensor noise. The system maps observations directly to commanded bank angle and a… ▽ More

    Submitted 30 July, 2021; originally announced July 2021.

    Comments: Under review for 2021 AIAA scitech GN&C conference

  7. arXiv:2009.00975  [pdf, other

    eess.SY

    Adaptive Scale Factor Compensation for Missiles with Strapdown Seekers via Predictive Coding

    Authors: Brian Gaudet

    Abstract: In this work we present a method to adaptively compensate for scale factor errors in both rotational velocity and seeker angle measurements. The adaptation scheme estimates the scale factor errors using a predictive coding model implemented as a deep neural network with recurrent layer, and then uses these estimates to compensate for the error. During training, the model learns over a wide range o… ▽ More

    Submitted 15 September, 2020; v1 submitted 31 August, 2020; originally announced September 2020.

    Comments: arXiv admin note: text overlap with arXiv:2004.09978

  8. arXiv:2004.09978  [pdf, other

    eess.SY cs.LG

    Reinforcement Meta-Learning for Interception of Maneuvering Exoatmospheric Targets with Parasitic Attitude Loop

    Authors: Brian Gaudet, Roberto Furfaro, Richard Linares, Andrea Scorsoglio

    Abstract: We use Reinforcement Meta-Learning to optimize an adaptive integrated guidance, navigation, and control system suitable for exoatmospheric interception of a maneuvering target. The system maps observations consisting of strapdown seeker angles and rate gyro measurements directly to thruster on-off commands. Using a high fidelity six degree-of-freedom simulator, we demonstrate that the optimized po… ▽ More

    Submitted 18 April, 2020; originally announced April 2020.

    Comments: Under Consideration for publication in Journal of Spacecraft and Rockets. arXiv admin note: text overlap with arXiv:1906.02113

  9. Six Degree-of-Freedom Body-Fixed Hovering over Unmapped Asteroids via LIDAR Altimetry and Reinforcement Meta-Learning

    Authors: Brian Gaudet, Richard Linares, Roberto Furfaro

    Abstract: We optimize a six degrees of freedom hovering policy using reinforcement meta-learning. The policy maps flash LIDAR measurements directly to on/off spacecraft body-frame thrust commands, allowing hovering at a fixed position and attitude in the asteroid body-fixed reference frame. Importantly, the policy does not require position and velocity estimates, and can operate in environments with unknown… ▽ More

    Submitted 8 February, 2020; v1 submitted 15 November, 2019; originally announced November 2019.

    Comments: Earlier version presented at 2020 AIAA Scitech conference. arXiv admin note: substantial text overlap with arXiv:1907.06098, arXiv:1906.02113

  10. arXiv:1907.13188  [pdf, other

    cs.SD cs.LG eess.AS stat.ML

    Marine Mammal Species Classification using Convolutional Neural Networks and a Novel Acoustic Representation

    Authors: Mark Thomas, Bruce Martin, Katie Kowarski, Briand Gaudet, Stan Matwin

    Abstract: Research into automated systems for detecting and classifying marine mammals in acoustic recordings is expanding internationally due to the necessity to analyze large collections of data for conservation purposes. In this work, we present a Convolutional Neural Network that is capable of classifying the vocalizations of three species of whales, non-biological sources of noise, and a fifth class pe… ▽ More

    Submitted 30 July, 2019; originally announced July 2019.

    Comments: 16 pages, To appear in ECML-PKDD 2019

  11. arXiv:1907.06098  [pdf, other

    eess.SY astro-ph.IM cs.LG

    Seeker based Adaptive Guidance via Reinforcement Meta-Learning Applied to Asteroid Close Proximity Operations

    Authors: Brian Gaudet, Richard Linares, Roberto Furfaro

    Abstract: Current practice for asteroid close proximity maneuvers requires extremely accurate characterization of the environmental dynamics and precise spacecraft positioning prior to the maneuver. This creates a delay of several months between the spacecraft's arrival and the ability to safely complete close proximity maneuvers. In this work we develop an adaptive integrated guidance, navigation, and cont… ▽ More

    Submitted 13 July, 2019; originally announced July 2019.

    Comments: Accepted for 2020 AAS Conference

  12. Reinforcement Learning for Angle-Only Intercept Guidance of Maneuvering Targets

    Authors: Brian Gaudet, Roberto Furfaro, Richard Linares

    Abstract: We present a novel guidance law that uses observations consisting solely of seeker line of sight angle measurements and their rate of change. The policy is optimized using reinforcement meta-learning and demonstrated in a simulated terminal phase of a mid-course exo-atmospheric interception. Importantly, the guidance law does not require range estimation, making it particularly suitable for passiv… ▽ More

    Submitted 15 November, 2019; v1 submitted 5 June, 2019; originally announced June 2019.

    Comments: Also in 2020 AIAA Scitech Guidance Navigation and Control Conference

  13. Adaptive Guidance and Integrated Navigation with Reinforcement Meta-Learning

    Authors: Brian Gaudet, Richard Linares, Roberto Furfaro

    Abstract: This paper proposes a novel adaptive guidance system developed using reinforcement meta-learning with a recurrent policy and value function approximator. The use of recurrent network layers allows the deployed policy to adapt real time to environmental forces acting on the agent. We compare the performance of the DR/DV guidance law, an RL agent with a non-recurrent policy, and an RL agent with a r… ▽ More

    Submitted 18 April, 2019; originally announced April 2019.

    Comments: arXiv admin note: substantial text overlap with arXiv:1901.04473

  14. arXiv:1901.04473  [pdf, other

    eess.SY cs.RO

    Adaptive Guidance with Reinforcement Meta-Learning

    Authors: Brian Gaudet, Richard Linares

    Abstract: This paper proposes a novel adaptive guidance system developed using reinforcement meta-learning with a recurrent policy and value function approximator. The use of recurrent network layers allows the deployed policy to adapt real time to environmental forces acting on the agent. We compare the performance of the DR/DV guidance law, an RL agent with a non-recurrent policy, and an RL agent with a r… ▽ More

    Submitted 12 January, 2019; originally announced January 2019.

    Comments: arXiv admin note: substantial text overlap with arXiv:1810.08719

  15. arXiv:1901.03895  [pdf, other

    cs.LG eess.SY

    Learning Accurate Extended-Horizon Predictions of High Dimensional Trajectories

    Authors: Brian Gaudet, Richard Linares, Roberto Furfaro

    Abstract: We present a novel predictive model architecture based on the principles of predictive coding that enables open loop prediction of future observations over extended horizons. There are two key innovations. First, whereas current methods typically learn to make long-horizon open-loop predictions using a multi-step cost function, we instead run the model open loop in the forward pass during training… ▽ More

    Submitted 12 January, 2019; originally announced January 2019.

  16. arXiv:1810.08719  [pdf, other

    eess.SY

    Deep Reinforcement Learning for Six Degree-of-Freedom Planetary Powered Descent and Landing

    Authors: Brian Gaudet, Richard Linares, Roberto Furfaro

    Abstract: Future Mars missions will require advanced guidance, navigation, and control algorithms for the powered descent phase to target specific surface locations and achieve pinpoint accuracy (landing error ellipse $<$ 5 m radius). The latter requires both a navigation system capable of estimating the lander's state in real-time and a guidance and control system that can map the estimated lander state to… ▽ More

    Submitted 19 October, 2018; originally announced October 2018.

    Comments: 37 pages