Logo image
Performance of distributed join algorithms on large data sets: a thesis in Computer Science
Thesis   Open access

Performance of distributed join algorithms on large data sets: a thesis in Computer Science

Felicia A. Imbriano
Master of Science (MS), University of Massachusetts Dartmouth
2019
DOI:
https://doi.org/10.62791/20052

Abstract

Database management. Querying (Computer science) Data structures (Computer science) Algorithms. Relational Databases.
In query processing, computing joins is a common and expensive operation. When working with larger tables that also reside on different network nodes, performance becomes even more important. The simplest strategy for computing joins sends the smaller table across the network and then a standard join algorithm is executed. The Semi-Join algorithm is a well-known algorithm that attempts to improve the overall performance by reducing the amount of data transferred across the network. CNF Join is a new algorithm that takes a different approach to reducing network traffic, and CNF-Semi-Join combines the Semi-Join and CNF Join techniques. This project will begin to empirically compare the performance of these algorithms under a variety of circumstances.
pdf
Imbriano F.A. COE MS Thesis 2019616.12 kBDownloadView
Open Access CC BY-NC-ND V4.0

Metrics

1 File views/ downloads
5 Record Views

Details

Logo image