An automated approach to optimizing the number of hidden neurons in partially connected artificial neural networks: a thesis in Computer Science

Shannon Elizabeth Gibbs

doi:10.62791/20278

Back

Thesis

Open access

An automated approach to optimizing the number of hidden neurons in partially connected artificial neural networks: a thesis in Computer Science

Shannon Elizabeth Gibbs

Master of Science (MS), University of Massachusetts Dartmouth

2023

DOI:

https://doi.org/10.62791/20278

Abstract

Despite the tremendous advances in artificial neural networks, a systematic and efficient method for hyperparameter tuning, including optimizing the number of hidden neurons, is still lacking. Today, the method for determining the appropriate number of hidden neurons involves trial and error, i.e., randomly selecting the number of hidden neurons, looking for possible overfitting or underfitting, and adjusting the hidden neurons accordingly. There are guidelines that can be used to help approach the optimal number of hidden neurons, such as using a number between the size of the input and output layers or using two-thirds the size of the input layer plus the output layer. However, they still need to be tested, trained, and the number of neurons must be adjusted by trial and error. The method proposed in this thesis uses an automated approach to optimizing the number of hidden neurons in a partially connected neural network (PCNN). The method starts with a fully connected neural network (FCNN) with a small number of hidden neurons, and then increases the number of hidden neurons until the accuracy starts to decrease due to overfitting. At this point, a certain percentage of weak links and weak neurons are identified and removed from the FCNN, resulting in a PCNN. We continue to fine-tune the number of hidden neurons in the PCNN until an optimal number of hidden neurons is reached, while accuracy decreases but remains above a threshold value as a performance requirement. To illustrate the feasibility and effectiveness of our method, we used three datasets to show that our PCNN-based approach can efficiently find the optimal number of hidden neurons without significant accuracy loss. The results also show that the use of PCNN results in a modest overhead and a shorter training time than the FCNN-only approach.

Files and links (1)

pdf

Gibbs S.E. COE MS Thesis 2023779.64 kBDownload View

CC BY-NC-ND V4.0, Open Access

Metrics

6 File views/ downloads

17 Record Views

Details

Title: An automated approach to optimizing the number of hidden neurons in partially connected artificial neural networks
Creators: Shannon Elizabeth Gibbs
ORCID: 0009-0006-1803-1229
Contributors: Haiping Xu (Advisor) - University of Massachusetts Dartmouth, Department of Computer and Information Science
Clinton Louis Rogers (Committee Member) - University of Massachusetts Dartmouth, Department of Computer and Information Science
Ashokkumar Ratilal Patel (Committee Member) - University of Massachusetts Dartmouth, Department of Computer and Information Science
Number of pages: ix, 67 pages
Illustrations: illustrations (some color)
Table of contents: List of figures -- List of tables -- Abbreviations -- Chapter 1. Introduction -- Chapter 2. Related work -- Trial and error methods -- Deconstructive methods -- Constructive methods -- Chapter 3. Automated approach to optimizing the number of hidden neurons -- Removing weak links from the neural network -- Removing weak neurons from the neural network -- Removing a specific number of links and neurons from an ANN -- Finding the optimal number of hidden neurons -- Chapter 4. Implementation of a neural network in Java -- Artificial neural network structure -- Backpropagation algorithm -- Neural network functions -- Chapter 5. Case study -- Breast cancer dataset -- Estimation of obesity levels based on eating habits and physical condition dataset -- Thyroid disease dataset -- Chapter 6. Conclusion and future work -- References.
References: Includes bibliographical references (pages 66-67).
Awarding Institution: University of Massachusetts Dartmouth
Degree Awarded: Master of Science (MS)
Degree in: Computer Science
Academic Unit: Department of Computer and Information Science
Language: English
Resource Type: Thesis
DOI: https://doi.org/10.62791/20278
Record Identifier: 9914424800201301