Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
On Evaluation of Data Stream Clustering Algorithms: A Survey
Blekinge Institute of Technology, Faculty of Computing, Department of Computer Science.ORCID iD: 0000-0001-7199-8080
Blekinge Institute of Technology, Faculty of Computing, Department of Computer Science.ORCID iD: 0000-0003-3128-191x
Blekinge Institute of Technology, Faculty of Computing, Department of Computer Science.ORCID iD: 0000-0001-9947-1088
Blekinge Institute of Technology, Faculty of Computing, Department of Computer Science.
2025 (English)In: IEEE Access, E-ISSN 2169-3536, Vol. 13, p. 139524-139546Article, review/survey (Refereed) Published
Abstract [en]

Data stream mining is a research area that has grown enormously in recent years. The main challenge is extracting knowledge in real-time from a possibly unbounded data stream. Clustering, a process in which groupings within the data are identified, is a valuable technique to extract and identify underlying structures of the data. An open question in stream clustering is how to evaluate the proposed algorithms. In this survey, we review the literature in the domain to identify common methodologies, datasets, and evaluation measures, used to evaluate the algorithms. We provide a short summary of the stream clustering algorithms in the literature, but our primary focus lies in the survey of cluster validation relevant to the evaluation of data stream clustering algorithms. We begin our literature review with the inception of clustering incrementally, namely with the introduction of the balanced iterative reducing and clustering using hierarchies (BIRCH) algorithm. We identify that the evaluation methodologies primarily focus on performance, and that aspects such as cluster quality are rarely considered. Performance has been the focal point of all evaluations, both in terms of computational performance and accuracy, since the inception of clustering data streams. We also identify that issues in the conventional clustering domain are present in the data stream clustering. However, minor additions to the evaluation methods can improve both the applicability and usefulness of the algorithms. 

Place, publisher, year, edition, pages
Institute of Electrical and Electronics Engineers (IEEE), 2025. Vol. 13, p. 139524-139546
Keywords [en]
Cluster analysis, cluster validation indices, cluster validation measures, clustering, data stream clustering, data stream mining, data streams, evaluation, review, streaming data, Clustering algorithms, Data mining, Iterative methods, Quality control, Cluster validation, Cluster validation index, Cluster validation measure, Clusterings, Data stream, Data streams mining, Validation index, Reviews
National Category
Computer Sciences
Identifiers
URN: urn:nbn:se:bth-28542DOI: 10.1109/ACCESS.2025.3596435ISI: 001550799400033Scopus ID: 2-s2.0-105013054673OAI: oai:DiVA.org:bth-28542DiVA, id: diva2:1993712
Part of project
HINTS - Human-Centered Intelligent Realities
Funder
Knowledge Foundation, 20220068Available from: 2025-09-01 Created: 2025-09-01 Last updated: 2025-09-30Bibliographically approved

Open Access in DiVA

fulltext(4078 kB)353 downloads
File information
File name FULLTEXT01.pdfFile size 4078 kBChecksum SHA-512
87f95eec4ed0702f0bbc7442ece40c5213f5d6864fa33ae99932774ca98b396aaa67550144078cc7096409b6f71b3046203ec31343857d77db8a9385d0f85edc
Type fulltextMimetype application/pdf

Other links

Publisher's full textScopus

Authority records

Nordahl, ChristianBoeva, VeselkaGrahn, HåkanNetz Persson, Marie

Search in DiVA

By author/editor
Nordahl, ChristianBoeva, VeselkaGrahn, HåkanNetz Persson, Marie
By organisation
Department of Computer Science
In the same journal
IEEE Access
Computer Sciences

Search outside of DiVA

GoogleGoogle Scholar
Total: 353 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

doi
urn-nbn

Altmetric score

doi
urn-nbn
Total: 1193 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf