System disruptions
We are currently experiencing disruptions on the search portals due to high traffic. We are working to resolve the issue, you may temporarily encounter an error message.
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Industrial adoption of machine learning techniques for early identification of invalid bug reports
Blekinge Institute of Technology, Faculty of Computing, Department of Software Engineering.ORCID iD: 0000-0002-5964-5554
Blekinge Institute of Technology, Faculty of Computing, Department of Software Engineering.ORCID iD: 0000-0001-7266-5632
Blekinge Institute of Technology, Faculty of Computing, Department of Software Engineering.ORCID iD: 0000-0003-0639-4234
Lund University.
2024 (English)In: Empirical Software Engineering, ISSN 1382-3256, E-ISSN 1573-7616, Vol. 29, no 5, article id 130Article in journal (Refereed) Published
Abstract [en]

Despite the accuracy of machine learning (ML) techniques in predicting invalid bug reports, as shown in earlier research, and the importance of early identification of invalid bug reports in software maintenance, the adoption of ML techniques for this task in industrial practice is yet to be investigated. In this study, we used a technology transfer model to guide the adoption of an ML technique at a company for the early identification of invalid bug reports. In the process, we also identify necessary conditions for adopting such techniques in practice. We followed a case study research approach with various design and analysis iterations for technology transfer activities. We collected data from bug repositories, through focus groups, a questionnaire, and a presentation and feedback session with an expert. As expected, we found that an ML technique can identify invalid bug reports with acceptable accuracy at an early stage. However, the technique’s accuracy drops over time in its operational use due to changes in the product, the used technologies, or the development organization. Such changes may require retraining the ML model. During validation, practitioners highlighted the need to understand the ML technique’s predictions to trust the predictions. We found that a visual (using a state-of-the-art ML interpretation framework) and descriptive explanation of the prediction increases the trustability of the technique compared to just presenting the results of the validity predictions. We conclude that trustability, integration with the existing toolchain, and maintaining the techniques’ accuracy over time are critical for increasing the likelihood of adoption. © The Author(s) 2024.

Place, publisher, year, edition, pages
Springer, 2024. Vol. 29, no 5, article id 130
Keywords [en]
Concept drift, Defect classification, Invalid bug reports, Machine learning, Software maintenance, Software quality, Computer software maintenance, Computer software selection and evaluation, Industrial research, Technology transfer, Bug reports, Concept drifts, Industrial adoption, Industrial practices, Invalid bug report, Machine learning techniques, Machine-learning, Transfer models, Forecasting
National Category
Software Engineering
Identifiers
URN: urn:nbn:se:bth-26802DOI: 10.1007/s10664-024-10502-3ISI: 001283245300001Scopus ID: 2-s2.0-85200034314OAI: oai:DiVA.org:bth-26802DiVA, id: diva2:1888898
Part of project
GIST – Gaining actionable Insights from Software Testing, Knowledge Foundation
Funder
ELLIIT - The Linköping‐Lund Initiative on IT and Mobile CommunicationsKnowledge Foundation, 20220235Available from: 2024-08-14 Created: 2024-08-14 Last updated: 2024-12-03Bibliographically approved
In thesis
1. Software Analytics for Supporting Practitioners in Bug Management
Open this publication in new window or tab >>Software Analytics for Supporting Practitioners in Bug Management
2025 (English)Doctoral thesis, comprehensive summary (Other academic)
Abstract [en]

Context: In large-scale software development, a large number of bug reports are submitted during software development and maintenance. Practitioners need the ability to analyze this abundant data to make data-driven decisions about bug management tasks.

Objective: This thesis aims to utilize software analytics (SA) to support practitioners in bug management. The objectives of this thesis are (1) to identify and structure the knowledge on the use of SA for software engineering (SE) tasks and (2) to investigate and evaluate the practical application of SA to support practitioners in managing invalid bug reports (IBRs).

Method: We conducted a tertiary review and systematic mapping study to achieve the first objective and comparative experiments and two industrial case studies to achieve the second objective. Throughout the thesis work, we relied on a technology transfer model to guide the research and facilitate the adoption of ML techniques for the early identification of IBRs at the case company.

Results: We provide a comprehensive map of various SA applications for SE tasks and a decision matrix that can assist in selecting the most appropriate ML technique for bug report classification for a given context. Our results indicate that an ML technique can identify IBRs with acceptable accuracy at an early stage in practice. Furthermore, the results of an SA-based approach indicate that it can support practitioners in devising preventive measures for IBRs.

Conclusion: Through industrial validations, this thesis provides evidence of the usefulness of SA in bug management, particularly in supporting practitioners in managing IBRs in large-scale software development.

Place, publisher, year, edition, pages
Karlskrona: Blekinge Tekniska Högskola, 2025. p. 231
Series
Blekinge Institute of Technology Doctoral Dissertation Series, ISSN 1653-2090 ; 2025:02
Keywords
Issue management, Bug reports, Invalid bug reports, Software analytics, Machine learning, AutoML, Large language models
National Category
Software Engineering
Research subject
Software Engineering
Identifiers
urn:nbn:se:bth-27197 (URN)978-91-7295-494-6 (ISBN)
Public defence
2025-02-13, J1630, Campus Gräsvik, Karlskrona, 13:15 (English)
Opponent
Supervisors
Funder
ELLIIT - The Linköping‐Lund Initiative on IT and Mobile Communications
Available from: 2024-12-04 Created: 2024-12-03 Last updated: 2025-01-08Bibliographically approved

Open Access in DiVA

fulltext(3443 kB)59 downloads
File information
File name FULLTEXT01.pdfFile size 3443 kBChecksum SHA-512
a73c997c81223f3981b46724f90f2c409011a733ca2a5a51d41349faa30ef0db66bff2f69f504fb99b4674de0a163d9efa6c93584f31fcbf54ea8fff24660b36
Type fulltextMimetype application/pdf

Other links

Publisher's full textScopus

Authority records

Laiq, MuhammadAli, Nauman binBörstler, Jürgen

Search in DiVA

By author/editor
Laiq, MuhammadAli, Nauman binBörstler, Jürgen
By organisation
Department of Software Engineering
In the same journal
Empirical Software Engineering
Software Engineering

Search outside of DiVA

GoogleGoogle Scholar
Total: 59 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

doi
urn-nbn

Altmetric score

doi
urn-nbn
Total: 236 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf