An examination of adversarial machine learning on connected and autonomous vehicles: a thesis in Computer Engineering

David M. Austin

doi:10.62791/20107

Back

Thesis

Open access

An examination of adversarial machine learning on connected and autonomous vehicles: a thesis in Computer Engineering

David M. Austin

Master of Science (MS), University of Massachusetts Dartmouth

2020

DOI:

https://doi.org/10.62791/20107

Abstract

Connected and Autonomous Vehicle (CAV), a.k.a. driverless car, provides a solution to safety and efficiency in surface transportation. Yet, CAV technology introduces new security risks beyond the standard vehicle vulnerabilities. Machine Learning (ML) promises to secure CAV by utilizing telemetry data to validate legitimacy among CAVs as a method of Misbehavior Detection. However, most ML models base a misassumption of a benign environment, where the data during testing and training keep an equivalent distribution, which makes ML models vulnerable to adversarial perturbations. Furthermore, the lack of knowledge in CAV behavior opens new attack surfaces for adversaries to invalidate ML models. This thesis examines an attackers’ mindset to fool a ML system for Misbehavior Detection in CAV; thereby CAV defense strategies become robust to adversarial ML . Three software packages, OMNeT++, SUMO, and Veins, integrate to simulate CAV. The dataset for ML, comes from the Vehicle Reference Misbehavior Dataset, which this work expands with new attacks. Progressively, the work evaluates the accuracy of K-Nearest Neighbors and Random Forest for ML algorithms and that of Logistic Regression Neural Network and Recurrent Neural Network with Long-Short-Term-Memory for Deep Learning (DL) algorithms, all coded in Python. A Random Walk movement theory generates synthetic data following a normal distribution to further evaluate the robustness of the ML /DL models, i.e., the effectiveness of spoofed data from the attacker’s point of view. The work reaches a conclusion that DL models perform better than ML models in most cases. This work contributes to ML Misbehavior Detection in CAV, exposing the challenges posed by adversarial ML and recommending the use of DL over ML against adversaries. As a result, it strengthens CAV defense with robust DL models. Future work will solve the limitations of DL such as the need for large quantity of data and balanced classes of data. To correct unbalanced classes, synthetic data generation can use specific python packages such as SMOTE. Undersampling or oversampling the minority or majority classes, respectively, could be another solution. Pairing Feature Analysis, such as Exploratory Data Analysis, to identify the most relevant features would train the models effectively and accurately. Other future work includes engineering the features from the datasets as well as using the Real-World Datasets from CAV pilot cities.

Files and links (1)

pdf

Austin D.M. COE MS Thesis 20202.32 MBDownload View

CC BY-NC-ND V4.0, Open Access

Metrics

269 File views/ downloads

16 Record Views

Details

Title: An examination of adversarial machine learning on connected and autonomous vehicles
Creators: David M. Austin
ORCID: 0000-0003-0783-4545
Contributors: Hong Liu (Advisor) - University of Massachusetts Dartmouth, Department of Electrical and Computer Engineering
Benjamin Viall (Committee Member) - University of Massachusetts Dartmouth
Liudong Xing (Committee Member) - University of Massachusetts Dartmouth, Department of Electrical and Computer Engineering
Number of pages: xviii, 281 pages
Illustrations: illustrations (some color)
Table of contents: List of figures -- List of tables -- List of acronyms -- Chapter 1. Introduction -- Chapter 2. Background -- Driverless cars -- CAV vulnerability and security -- Machine learning -- Chapter 3. Problem statement -- Chapter 4. State of the art -- Machine learning models for misbehavior detection -- Adversarial machine learning -- Chapter 5. Traditional attacks with De Facto standard -- VeReMi architecture -- VeReMi datasets -- Extension of VeReMi dataset -- Chapter 6. Evaluation of machine learning defense models -- Data pre-processing -- ML defense models -- Validate ML defense model -- Chapter 7. Evaluation of deep learning defense models -- DL defense models -- Validate DL defense models -- Chapter 8. Applying models on synthetically generated data -- Random walk -- ML attack -- DL attack -- Chapter 9. Conclusion and future work -- conclusion -- Future work -- References -- Appendix A VeReMi configuration and pre-processing files -- Source file setup.h -- Source file GroundTruthScript -- Source file csvCombine -- Source file csvCombine_cond -- Source file csvCombine_cond -- Source file json2csvFullConvert -- Source file json2csvCondConvert -- Source file headerFileFull.csv -- Source file headerFile.csv -- Source file GroundTruthHeader.csv -- Source file nonCleanMatlab.m -- Source file cleanMatlab.m -- Appendix B ML and DL model code and data importance -- Source file 12_7gennonandatthst19_datafeatimportance.py -- Source file 12_7att1datafeatimportance_heatmap.py -- Source file 10_28attack_2_data_featimportance.py -- Appendix C ML and DL models source-code and testing files -- Source file myhst_knn12_7.py -- Source file my_hst_rf12_7.py -- Source file 10_28hst_lstm.py -- Source file 10_28hst_lr.py -- Source file 12_7testinghst19lstm.py -- Source file 12_7testinghst19lr.py -- Source file scaled12_7att1_knn.py -- Source file 10_28att1_rf.py -- Source file att1_10_19lstm_wsmote.py -- Source file att1_10_19logregressionnscaledwsmote.py -- Source file testingAtt1LSTM12_7.py -- Source file testingAtt1LR12_7.py -- Source file scaled10_28att2_knn.py -- Source file scaled12_7att2_rf.py -- Source file 10_14att2lstm_wsmote.py -- Source file att2_10_14logregressionnscaledwsmote.py -- Source file testingAtt2LSTM12_7.py -- Source file testingAtt2LR12_7.py.
References: Includes bibliographical references (pages 86-89).
Awarding Institution: University of Massachusetts Dartmouth
Degree Awarded: Master of Science (MS)
Degree in: Computer Engineering
Academic Unit: Department of Electrical and Computer Engineering
Language: English
Resource Type: Thesis
DOI: https://doi.org/10.62791/20107
Record Identifier: 9914424879601301