Risk Averse Bayesian Reward Learning for Autonomous Navigation from Human Demonstration

Christian Ellis; Maggie Wigness; John Rogers; Craig Lennon; Lance Fiondella

doi:10.1109/IROS51168.2021.9635835

Back

Conference proceeding

Risk Averse Bayesian Reward Learning for Autonomous Navigation from Human Demonstration

Christian Ellis, Maggie Wigness, John Rogers, Craig Lennon and Lance Fiondella

2021 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), pp.8928-8935

09/27/2021

DOI: https://doi.org/10.1109/IROS51168.2021.9635835

Abstract

Bayes methods

Robot sensing systems

Three-dimensional displays

Training

Training data

Uncertainty

Semantics

Traditional imitation learning provides a set of methods and algorithms to learn a reward function or policy from expert demonstrations. Learning from demonstration has been shown to be advantageous for navigation tasks as it allows for machine learning non-experts to quickly provide information needed to learn complex traversal behaviors. However, a minimal set of demonstrations is unlikely to capture all relevant information needed to achieve the desired behavior in every possible future operational environment. Due to distributional shift among environments, a robot may encounter features that were rarely or never observed during training for which the appropriate reward value is uncertain, leading to undesired outcomes. This paper proposes a Bayesian technique which quantifies uncertainty over the weights of a linear reward function given a dataset of minimal human demonstrations to operate safely in dynamic environments. This uncertainty is quantified and incorporated into a risk averse set of weights used to generate cost maps for planning. Experiments in a 3-D environment with a simulated robot show that our proposed algorithm enables a robot to avoid dangerous terrain completely in two out of three test scenarios and accumulates a lower amount of risk than related approaches in all scenarios without requiring any additional demonstrations.

Metrics

5 Record Views

Details

Title: Risk Averse Bayesian Reward Learning for Autonomous Navigation from Human Demonstration
Creators: Christian Ellis - University of Massachusetts Dartmouth
Maggie Wigness - DEVCOM Army Research Laboratory
John Rogers - DEVCOM Army Research Laboratory
Craig Lennon - DEVCOM Army Research Laboratory
Lance Fiondella - University of Massachusetts Dartmouth
Publication Details: 2021 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), pp.8928-8935
Publisher: IEEE
Number of pages: 8
Grant note: Army Research Laboratory (10.13039/100006754)
Academic Unit: Department of Electrical and Computer Engineering
Language: English
Resource Type: Conference proceeding
ISBN: 1665417145; 9781665417143; 1665417145; 9781665417143
DOI: https://doi.org/10.1109/IROS51168.2021.9635835
Record Identifier: 9914432397901301

Risk Averse Bayesian Reward Learning for Autonomous Navigation from Human Demonstration

Abstract

Related links

Metrics

Related content

Details