Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Developing Random Compaction Strategy for Apache Cassandra database and Evaluating performance of the strategy
Blekinge Institute of Technology.
2021 (English)Independent thesis Advanced level (degree of Master (Two Years)), 20 credits / 30 HE creditsStudent thesis
Abstract [en]

Introduction: Nowadays, the data generated by global communication systems is enormously increasing.  There is a need by Telecommunication Industries to monitor and manage this data generation efficiently. Apache Cassandra is a NoSQL database that manages any formatted data and a massive amount of data flow efficiently. 

Aim: This project is focused on developing a new random compaction strategy and evaluating this random compaction strategy's performance. In this study, limitations of generic compaction strategies Size Tiered Compaction Strategy and Leveled Compaction Strategy will be investigated. A new random compaction strategy will be developed to address the limitations of the generic Compaction Strategies. Important performance metrics required for the evaluation of the strategy will be studied.

Method: In this study, a grey literature review is done to understand the working of Apache Cassandra, different compaction strategies' APIs. A random compaction strategy is developed in two phases of development. A testing environment is created consisting of a 4-node cluster and a simulator. Evaluated the performance by stress-testing the cluster using different workloads.

Conclusions: A stable RCS artifact is developed. This artifact also includes the support of generating random threshold from any user-defined distribution. Currently, only Uniform, Geometric, and Poisson distributions are supported. The RCS-Uniform's performance is found to be better than both STCS and  LCS. The RCS-Poisson's performance is found to be not better than both STCS and LCS. The RCS-Geometric's performance is found to be better than STCS.

Place, publisher, year, edition, pages
2021. , p. 69
Keywords [en]
Apache Cassandra, Compaction, Random Probability Distributions, IBM Cloud, NoSQL databases
National Category
Telecommunications
Identifiers
URN: urn:nbn:se:bth-21246OAI: oai:DiVA.org:bth-21246DiVA, id: diva2:1537752
External cooperation
Ericsson AB, Karlskrona
Subject / course
ET2606 Masterarbete i elektroteknik med inriktning mot telekommunikationssystem 30,0 hp
Educational program
ETADT Plan för kvalifikation till masterexamen inom elektroteknik med inr mot telekommunikationssystem 120,0 hp
Presentation
, Karlskrona (English)
Supervisors
Examiners
Available from: 2021-03-18 Created: 2021-03-16 Last updated: 2021-03-18Bibliographically approved

Open Access in DiVA

Developing Random Compaction Strategy...(2201 kB)304 downloads
File information
File name FULLTEXT02.pdfFile size 2201 kBChecksum SHA-512
6cf4e7cb82f43e0adfea9a66e3b87a9c4ac3c2e6bc53cd1545fd74777ddeb35356d0f7b3456db9cbe587ad8a4b12ee7ed1408be6d9e4392eeda5472c25e0a1be
Type fulltextMimetype application/pdf

By organisation
Blekinge Institute of Technology
Telecommunications

Search outside of DiVA

GoogleGoogle Scholar
Total: 304 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

urn-nbn

Altmetric score

urn-nbn
Total: 362 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf