Developing Random Compaction Strategy for Apache Cassandra database and Evaluating performance of the strategy
2021 (English)Independent thesis Advanced level (degree of Master (Two Years)), 20 credits / 30 HE credits
Student thesis
Abstract [en]
Introduction: Nowadays, the data generated by global communication systems is enormously increasing. There is a need by Telecommunication Industries to monitor and manage this data generation efficiently. Apache Cassandra is a NoSQL database that manages any formatted data and a massive amount of data flow efficiently.
Aim: This project is focused on developing a new random compaction strategy and evaluating this random compaction strategy's performance. In this study, limitations of generic compaction strategies Size Tiered Compaction Strategy and Leveled Compaction Strategy will be investigated. A new random compaction strategy will be developed to address the limitations of the generic Compaction Strategies. Important performance metrics required for the evaluation of the strategy will be studied.
Method: In this study, a grey literature review is done to understand the working of Apache Cassandra, different compaction strategies' APIs. A random compaction strategy is developed in two phases of development. A testing environment is created consisting of a 4-node cluster and a simulator. Evaluated the performance by stress-testing the cluster using different workloads.
Conclusions: A stable RCS artifact is developed. This artifact also includes the support of generating random threshold from any user-defined distribution. Currently, only Uniform, Geometric, and Poisson distributions are supported. The RCS-Uniform's performance is found to be better than both STCS and LCS. The RCS-Poisson's performance is found to be not better than both STCS and LCS. The RCS-Geometric's performance is found to be better than STCS.
Place, publisher, year, edition, pages
2021. , p. 69
Keywords [en]
Apache Cassandra, Compaction, Random Probability Distributions, IBM Cloud, NoSQL databases
National Category
Telecommunications
Identifiers
URN: urn:nbn:se:bth-21246OAI: oai:DiVA.org:bth-21246DiVA, id: diva2:1537752
External cooperation
Ericsson AB, Karlskrona
Subject / course
ET2606 Masterarbete i elektroteknik med inriktning mot telekommunikationssystem 30,0 hp
Educational program
ETADT Plan för kvalifikation till masterexamen inom elektroteknik med inr mot telekommunikationssystem 120,0 hp
Presentation
, Karlskrona (English)
Supervisors
Examiners
2021-03-182021-03-162021-03-18Bibliographically approved