Reliability and performance modeling and enhancement for storage area networks: a dissertation in Electrical Engineering

Guixiang Lyu

doi:10.62791/20444

Back

Dissertation

Open access

Reliability and performance modeling and enhancement for storage area networks: a dissertation in Electrical Engineering

Guixiang Lyu

Doctor of Philosophy (PHD), University of Massachusetts Dartmouth

2025

DOI:

https://doi.org/10.62791/20444

Abstract

Storage area networks (SAN) provide an effective solution to the significant growth issue in remote data storage and access. To deliver the desired quality of service, the reliability challenges of SANs must be addressed. A major threat to SAN reliability and performance is cascading failures, where a single incident triggers a chain reaction, causing extensive damage and even crash of the entire system. In this dissertation research, we focus on overload-triggered cascading failures, where the overloading of one device (e.g., a switch) causes it to fail, reallocating its workload to other devices, which in turn become overloaded, leading to further failures in a domino effect. We first investigate the effects of data loading on the reliability of an individual switch device in SANs using the proportional-hazards model and accelerated failure time model. We then investigate the effects of loading on the reliability of an entire SAN through dynamic fault trees and binary decision diagrams-based analysis. Furthermore, to enhance SAN reliability, we design proactive load redistribution-based mitigation strategies that aim to prevent cascading failures during the specified mission time, or at least alleviate the consequence of such failures. Two triggering mechanisms, based on the overall SAN reliability and switch loading, are considered. Load-based and reliability-based node selection rules are explored. Additionally, traffic reallocation strategies are investigated to enhance SAN performance in terms of load balancing and overall response time. The performance metrics of switch utilization, switch response time, and overall response time are analyzed using Jackson queueing networks. The application and effectiveness of the proposed mitigation strategies are demonstrated and compared through detailed case studies of SANs with a mesh topology.

Files and links (1)

pdf

Lyu G. COE PhD Dissertation 20253.93 MBDownload View

Open Access CC BY-NC-ND V4.0

Metrics

11 File views/ downloads

28 Record Views

Details

Title: Reliability and performance modeling and enhancement for storage area networks
Creators: Guixiang Lyu
ORCID: 0000-0003-2587-0443
Contributors: Liudong Xing (Advisor) - University of Massachusetts Dartmouth, Department of Electrical and Computer Engineering
Hong Liu (Committee Member) - University of Massachusetts Dartmouth, Department of Electrical and Computer Engineering
Honggang Wang (Committee Member) - University of Massachusetts Dartmouth, Department of Electrical and Computer Engineering
Haining Meng (Committee Member) - Xi'an University of Technology
Number of pages: xvii, 153 pages
Illustrations: illustrations (some color)
Table of contents: Abstract -- Acknowledgements -- Table of contents -- List of figures -- List of tables -- Chapter 1. Introduction -- Reliability versus resilience -- Storage area networks -- Relevant works -- Motivations -- Organization of proposal -- Chapter 2. Background -- FT method -- BDD -- Load failure rate relationship models -- Chapter 3. Problem statement -- Chapter 4. Influence of load on reliability of SANs -- An illustrative Mesh SAN -- SAN reliability modeling and analysis -- Influence of loading on switch reliability and SAN reliability -- Summary -- Chapter 5. Load redistribution-based reliability enhancement for SANs -- Proposed reliability enhancement strategies -- An illustrative Mesh SAN -- Influence of reliability threshold on redistribution performance -- Comparison of four schemes -- Application -- Summary -- Chapter 6. Static and dynamic load-triggered cascading failure mitigation -- Introduction -- Proposed load threshold-triggered mitigation strategies -- Performance comparisons under proportional reallocation (Scheme 1-4) -- Effects of step value in dynamic threshold mitigation schemes -- Performance evaluation under inverse-proportional reallocation (Scheme 5-8) -- Effects of step value in dynamic threshold mitigation schemes -- Comparisons and discussions -- Chapter 7. Performance modeling and enhancement of storage area networks -- Introduction -- Jackson queuing network-based performance evaluation -- Utilization-driven performance enhancement strategies -- Illustrative example -- Experiments and results -- Summary -- Chapter 8. Conclusion and directions for future research -- Summary of contributions -- Directions for future research -- References.
References: Includes bibliographical references (pages 147-153).
Awarding Institution: University of Massachusetts Dartmouth
Degree Awarded: Doctor of Philosophy (PHD)
Degree in: Electrical Engineering
Academic Unit: Department of Electrical and Computer Engineering
Language: English
Resource Type: Dissertation
DOI: https://doi.org/10.62791/20444
Record Identifier: 9914444129501301