A model free deep reinforcement learning approach to autonomous undersea vehicle control with mixed numerical precision: a dissertation in Engineering and Applied Science

Christopher J. Hixenbaugh

doi:10.62791/19726

Back

Dissertation

Open access

A model free deep reinforcement learning approach to autonomous undersea vehicle control with mixed numerical precision: a dissertation in Engineering and Applied Science

Christopher J. Hixenbaugh

Doctor of Philosophy (PHD), University of Massachusetts Dartmouth

2023

DOI:

https://doi.org/10.62791/19726

Abstract

Deep Reinforcement Learning (RL) shows promising results for control problems with continuous action spaces. A drawback to Deep RL is that it can be very computationally intensive; this is particularly concerning when considering fielding Deep RL applications on computational and power-constrained edge computing hardware typically implemented onboard autonomous vehicle platforms. Another drawback to using Deep RL to learn optimal control strategies is that Deep RL agents can learn control strategies that exhibit high frequency and amplitude oscillations, which can negatively affect performance and cause damage to real-world systems. The first part of this thesis focuses on improving the computational efficiency of the Deep Deterministic Policy Gradient (DDPG) algorithm using mixed numerical precision methods. Mixed numerical precision methods are an active research area that is helping to make progress toward improving the computational efficiency of Deep Learning methods. While mixed-precision approaches are well understood for supervised learning tasks, this area is relatively unexplored for Deep RL. We aim to fill this gap in the research by presenting a method to improve the computational efficiency of the DDPG algorithm using mixed numerical precision and loss scaling. Then this thesis presents a numerical study investigating the impact of different neural network architectures on oscillations in the control signals output by DDPG agents when used for a complex continuous control problem. The neural network architectures considered in this study are commonly used in Deep RL literature. This study will first present numerical cases to compare the performance and computational improvements of DDPG agents trained with mixed-precision to those trained with single-precision in the context of continuous control of a complex Autonomous Undersea Vehicle model for various levels of the control system and Deep RL model complexity. Then, a numerical study will be presented to examine the effects of different DDPG actor and critic neural network architectures on action selection to minimize undesirable oscillations in the control signals output by DDPG agents.

Files and links (1)

pdf

Hixenbaugh C.J. COE PhD Dissertation 20232.06 MBDownload View

CC BY-NC-ND V4.0, Open Access

Metrics

11 File views/ downloads

31 Record Views

Details

Title: A model free deep reinforcement learning approach to autonomous undersea vehicle control with mixed numerical precision
Creators: Christopher J. Hixenbaugh
ORCID: 0000-0003-3348-9052
Contributors: Alfa R.H. Heryudono (Advisor) - University of Massachusetts Dartmouth, Department of Mathematics
Eugene Chabot (Committee Member) - University of Rhode Island
Firas Khatib (Committee Member) - University of Massachusetts Dartmouth, Department of Computer and Information Science
Ming Shao (Committee Member) - University of Massachusetts Dartmouth, Department of Computer and Information Science
Scott E Field (Committee Member) - University of Massachusetts Dartmouth, Department of Mathematics
Number of pages: x, 104 pages
Illustrations: illustrations (some color)
Table of contents: List of figures -- List of tables -- Chapter 1. Introduction -- Research background -- Autonomous undersea vehicle technology -- Deep learning for autonomous undersea vehicle applications -- History of autonomous undersea vehicle simulation -- Control system simulation -- Chapter 2. Our contributions -- Chapter 3. Technical knowledge -- Reinforcement learning -- DDPG algorithm -- Mixed numerical precision -- Naval post graduate school autonomous undersea vehicle dynamics -- PID control -- Data driven control -- Chapter 4. Computationally efficient deep reinforcement learning for AUV control -- Computationally efficient deep reinforcement learning method -- Experimental setup - continuous control of the NPSAUV -- Experimental results - continuous control of the NPSAUV -- Deep RL benchmark environments -- Chapter 5. Minimizing oscillatory signals in deep reinforcement learning control of AUVs -- Abstract of technical report -- Introduction -- Problem formulation -- Experimental analysis -- Conclusions and future work -- Chapter 6. Conclusions and next steps -- References.
References: Includes bibliographical references (pages 96-104).
Awarding Institution: University of Massachusetts Dartmouth
Degree Awarded: Doctor of Philosophy (PHD)
Degree in: Engineering and Applied Science
Academic Unit: College of Engineering
Language: English
Resource Type: Dissertation
DOI: https://doi.org/10.62791/19726
Record Identifier: 9914424902501301