Developing a scalable solution for storing on-chain big data in a consortium blockchain network: a thesis in Computer Science

Marcos A. Felipe

doi:10.62791/20297

Back

Thesis

Open access

Developing a scalable solution for storing on-chain big data in a consortium blockchain network: a thesis in Computer Science

Marcos A. Felipe

Master of Science (MS), University of Massachusetts Dartmouth

2023

DOI:

https://doi.org/10.62791/20297

Abstract

The concern for blockchain scalability is the main reason for many studies on consortium blockchain storage management. However, most of the proposed solutions use various off-chain storage strategies, such as InterPlanetary File System and cloud storage. Although off-chain approaches can mitigate the scalability issues of blockchain storage, the benefits of using blockchain technology are compromised when the data is moved off the chain and new issues regarding the security and maintainability of off-chain data can be introduced. In this thesis, we propose a novel scalable storage solution for a consortium blockchain network to manage blockchain data. To reduce the storage burden of most peers in a blockchain network, we establish network nodes as super peers or regular peers, where super peers have greater resources and computing power. In our approach, regular peers maintain only a lightweight blockchain, called the current blockchain, which can be split and transfer the old data to a historical blockchain, thereby reducing the size of the current blockchain by half. When the current blockchain have grown after a given period of time, it can be split again, generating multiple historical blockchains. The current blockchain and the historical blockchains are maintained by super peers in the network; while regular peers can retrieve historical data by making queries to the super peers. We present procedures for generating historical blockchains, dynamically balancing the data retrieval workload of super peers, and concurrently retrieving historical blockchain data in response to queries. To demonstrate the feasibility and effectiveness of our approach, we provide a case study of storing healthcare big data using a consortium blockchain. The simulation results show that our scalable storage solution supports efficient access and sharing of big data on the chain for a consortium blockchain network.

Files and links (1)

pdf

Felipe M.A. COE MS Thesis 2023718.62 kBDownload View

CC BY-NC-ND V4.0, Open Access

Metrics

4 File views/ downloads

14 Record Views

Details

Title: Developing a scalable solution for storing on-chain big data in a consortium blockchain network
Creators: Marcos A. Felipe
ORCID: 0009-0007-1952-7530
Contributors: Haiping Xu (Advisor) - University of Massachusetts Dartmouth, Department of Computer and Information Science
Firas Khatib (Committee Member) - University of Massachusetts Dartmouth, Department of Computer and Information Science
Clinton Louis Rogers (Committee Member) - University of Massachusetts Dartmouth, Department of Computer and Information Science
Number of pages: x, 52 pages
Illustrations: illustrations (some color)
Table of contents: List of figures -- List of tables -- Abbreviations -- Chapter 1. Introduction -- Chapter 2. Related work -- Chapter 3. Background knowledge -- Blockchain functionality -- Consensus process - Public, consortium, and private blockchains -- Chapter 4. Scalable storage using historical blockchains -- A framework for scalable blockchain networks -- The block structure -- The structure of a meta-block -- Generation of a historical blockchain -- Chapter 5. Centralized, course-grained retrieval of historical blockchain data -- CCG load balancing data retrieval requests -- CCG retrieval of historical blockchain data -- Chapter 6. Decentralized, fine-grained retrieval of historical blockchain data -- Load balancing data retrieval requests -- The structure of shared assignment table -- Retrieval of historical blockchain data -- Chapter 7. Simulation of historical blockchain scheme in consortium hospital network -- Simulation construction -- Estimation of blockchain size -- Data retrieval time for a single request -- Data retrieval time for concurrent requests -- Weight distribution across network for concurrent requests -- Chapter 8. Conclusions and future work -- References.
References: Includes bibliographical references (pages 48-52).
Awarding Institution: University of Massachusetts Dartmouth
Degree Awarded: Master of Science (MS)
Degree in: Computer Science
Academic Unit: Department of Computer and Information Science
Language: English
Resource Type: Thesis
DOI: https://doi.org/10.62791/20297
Record Identifier: 9914424900301301