Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Evaluating machine learning strategies for classification of large-scale Kubernetes cluster logs
Blekinge Institute of Technology, Faculty of Computing, Department of Computer Science.
2022 (English)Independent thesis Advanced level (degree of Master (Two Years)), 20 credits / 30 HE creditsStudent thesis
Abstract [en]

Kubernetes is a free, open-source container orchestration system for deploying and managing Docker containers that host microservices. Its cluster logs are extremely helpful in determining the root cause of a failure. However, as systems become more complex, locating failures becomes more difficult and time-consuming. This study aims to identify the classification algorithms that accurately classify the given log data and, at the same time, require fewer computational resources. Because the data is quite large, we begin with expert-based feature selection to reduce the data size. Following that, TF-IDF feature extraction is performed, and finally, we compare five classification algorithms, SVM, KNN, random forest, gradient boosting and MLP using several metrics. The results show that Random forest produces good accuracy while requiring fewer computational resources compared to other algorithms. 

Place, publisher, year, edition, pages
2022. , p. 70
Keywords [en]
Kubernetes logs, feature selection, feature extraction, multi-class classification, Computational cost
National Category
Computer Sciences
Identifiers
URN: urn:nbn:se:bth-23934OAI: oai:DiVA.org:bth-23934DiVA, id: diva2:1711100
External cooperation
Ericsson
Subject / course
DV2572 Master´s Thesis in Computer Science
Educational program
DVACO Master's program in computer science 120,0 hp
Presentation
2022-09-27, Zoom, Karlskrona, 10:00 (English)
Supervisors
Examiners
Available from: 2022-11-16 Created: 2022-11-15 Last updated: 2025-09-30Bibliographically approved

Open Access in DiVA

Evaluating machine learning strategies for classification of large-scale Kubernetes cluster logs(5076 kB)235 downloads
File information
File name FULLTEXT02.pdfFile size 5076 kBChecksum SHA-512
0f9ec8776b29e9e7e1d8c1e540113bc2d9d014d952183deee90b3ce006832f0eb2fcf3dff3ffb7b751e9ea11260e33acae6f2173f9b3e9829e459ee102c4bb8d
Type fulltextMimetype application/pdf

Search in DiVA

By author/editor
Sarika, Pawan
By organisation
Department of Computer Science
Computer Sciences

Search outside of DiVA

GoogleGoogle Scholar
Total: 236 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

urn-nbn

Altmetric score

urn-nbn
Total: 560 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf