89101112131411 of 38
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Comparative analysis of text mining and clustering techniques for assessing functional dependency between manual test cases
Ericsson AB, Stockholm, Sweden.
Mälardalen University.
German Aerospace Center, Cologne, Germany.
Chalmers University of Technology.
Show others and affiliations
2025 (English)In: Software quality journal, ISSN 0963-9314, E-ISSN 1573-1367, Vol. 33, no 2, article id 24Article in journal (Refereed) Published
Abstract [en]

Text mining techniques, particularly those leveraging machine learning for natural language processing, have gained significant attention for qualitative data analysis in software testing. However, their complexity and lack of transparency can pose challenges, especially in safety-critical domains where simpler, interpretable solutions are often preferred unless accuracy is heavily compromised. This study investigates the trade-offs between complexity, effort, accuracy, and utility in text mining and clustering techniques, focusing on their application for detecting functional dependencies among manual integration test cases in safety-critical systems. Using empirical data from an industrial testing project at ALSTOM Sweden, we evaluate various string distance methods, NCD compressors, and machine learning approaches. The results highlight the impact of preprocessing techniques, such as tokenization, and intrinsic factors, such as text length, on algorithm performance. Findings demonstrate how text mining and clustering can be optimized for safety-critical contexts, offering actionable insights for researchers and practitioners aiming to balance simplicity and effectiveness in their testing workflows. 

Place, publisher, year, edition, pages
Springer, 2025. Vol. 33, no 2, article id 24
Keywords [en]
Artificial intelligence, Clustering, Natural language processing, Software testing, Text mining, Cluster analysis, Natural language processing systems, Verification, Clustering techniques, Clusterings, Functional dependency, Language processing, Natural languages, Software testings, Text Clustering, Text mining techniques, Text-mining, Integration testing
National Category
Computer Sciences Artificial Intelligence
Identifiers
URN: urn:nbn:se:bth-27917DOI: 10.1007/s11219-025-09722-7ISI: 001489598700001Scopus ID: 2-s2.0-105005412458OAI: oai:DiVA.org:bth-27917DiVA, id: diva2:1961992
Available from: 2025-05-28 Created: 2025-05-28 Last updated: 2025-06-02Bibliographically approved

Open Access in DiVA

fulltext(1396 kB)14 downloads
File information
File name FULLTEXT01.pdfFile size 1396 kBChecksum SHA-512
dd49cddd7b68cc9146ae49ac2e4ccda9b741691906d791132cf54f2b5ca9841c31043944bc12d5df9082ad46770f7c6dfc6ef2de55a3b154e66577708b0707cd
Type fulltextMimetype application/pdf

Other links

Publisher's full textScopus

Authority records

Feldt, Robert

Search in DiVA

By author/editor
Feldt, Robert
By organisation
Department of Software Engineering
In the same journal
Software quality journal
Computer SciencesArtificial Intelligence

Search outside of DiVA

GoogleGoogle Scholar
Total: 14 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

doi
urn-nbn

Altmetric score

doi
urn-nbn
Total: 120 hits
89101112131411 of 38
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf