12342 of 4
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Software Analytics for Supporting Practitioners in Bug Management
Blekinge Institute of Technology, Faculty of Computing, Department of Software Engineering.ORCID iD: 0000-0002-5964-5554
2025 (English)Doctoral thesis, comprehensive summary (Other academic)
Abstract [en]

Context: In large-scale software development, a large number of bug reports are submitted during software development and maintenance. Practitioners need the ability to analyze this abundant data to make data-driven decisions about bug management tasks.

Objective: This thesis aims to utilize software analytics (SA) to support practitioners in bug management. The objectives of this thesis are (1) to identify and structure the knowledge on the use of SA for software engineering (SE) tasks and (2) to investigate and evaluate the practical application of SA to support practitioners in managing invalid bug reports (IBRs).

Method: We conducted a tertiary review and systematic mapping study to achieve the first objective and comparative experiments and two industrial case studies to achieve the second objective. Throughout the thesis work, we relied on a technology transfer model to guide the research and facilitate the adoption of ML techniques for the early identification of IBRs at the case company.

Results: We provide a comprehensive map of various SA applications for SE tasks and a decision matrix that can assist in selecting the most appropriate ML technique for bug report classification for a given context. Our results indicate that an ML technique can identify IBRs with acceptable accuracy at an early stage in practice. Furthermore, the results of an SA-based approach indicate that it can support practitioners in devising preventive measures for IBRs.

Conclusion: Through industrial validations, this thesis provides evidence of the usefulness of SA in bug management, particularly in supporting practitioners in managing IBRs in large-scale software development.

Place, publisher, year, edition, pages
Karlskrona: Blekinge Tekniska Högskola, 2025. , p. 231
Series
Blekinge Institute of Technology Doctoral Dissertation Series, ISSN 1653-2090 ; 2025:02
Keywords [en]
Issue management, Bug reports, Invalid bug reports, Software analytics, Machine learning, AutoML, Large language models
National Category
Software Engineering
Research subject
Software Engineering
Identifiers
URN: urn:nbn:se:bth-27197ISBN: 978-91-7295-494-6 (print)OAI: oai:DiVA.org:bth-27197DiVA, id: diva2:1917756
Public defence
2025-02-13, J1630, Campus Gräsvik, Karlskrona, 13:15 (English)
Opponent
Supervisors
Funder
ELLIIT - The Linköping‐Lund Initiative on IT and Mobile CommunicationsAvailable from: 2024-12-04 Created: 2024-12-03 Last updated: 2025-01-08Bibliographically approved
List of papers
1. Software analytics for software engineering: A tertiary review
Open this publication in new window or tab >>Software analytics for software engineering: A tertiary review
2024 (English)Report (Other academic)
Abstract [en]

Software analytics (SA) is frequently proposed as a tool to support practitioners in software engineering (SE) tasks. We have observed that several secondary studies on SA have been published. Some of these studies have overlapping aims and some have even been published in the same calendar year. This presents an opportunity to analyze the congruence or divergence of the conclusions in these studies. Such an analysis can help identify broader generalizations beyond any of the individual secondary studies. We identified five secondary studies on the use of SA for SE. These secondary studies cover primary research from 2000 to 2021. Despite the overlapping objectives and search time frames of these secondary studies, there is negligible overlap of primary studies between these secondary studies. Thus, each of them provides an isolated view, and together, they provide a fragmented view, i.e., there is no “common picture” of the area. Thus, we conclude that an overview of the literature identified by these secondary studies would be useful in providing a more comprehensive overview of the topic.

Publisher
p. 14
Keywords
Software engineering, Software analytics, Tertiary review, Machine learning, Data analytics, Visual analytics
National Category
Software Engineering
Research subject
Software Engineering; Software Engineering
Identifiers
urn:nbn:se:bth-27192 (URN)
Funder
ELLIIT - The Linköping‐Lund Initiative on IT and Mobile Communications
Available from: 2024-12-03 Created: 2024-12-03 Last updated: 2025-01-15Bibliographically approved
2. Software Analytics for Software Engineering: A Systematic Mapping Study
Open this publication in new window or tab >>Software Analytics for Software Engineering: A Systematic Mapping Study
(English)Manuscript (preprint) (Other academic)
Keywords
Software engineering, Software analytics, Systematic mapping study, Scopic review, Literature review, Machine learning, Data analytics, Visual analytics
National Category
Software Engineering
Identifiers
urn:nbn:se:bth-27194 (URN)
Funder
ELLIIT - The Linköping‐Lund Initiative on IT and Mobile Communications
Available from: 2024-12-03 Created: 2024-12-03 Last updated: 2024-12-03Bibliographically approved
3. A comparative analysis of ML techniques for bug report classification
Open this publication in new window or tab >>A comparative analysis of ML techniques for bug report classification
(English)Manuscript (preprint) (Other academic)
Keywords
Software Maintenance, Issue Classification, Bug Report Classification, Natural Language Processing, BERT, RoBERTa, Large Language Models, Automated Machine Learning, AutoML, Software Analytics
National Category
Software Engineering
Research subject
Software Engineering
Identifiers
urn:nbn:se:bth-27193 (URN)
Funder
ELLIIT - The Linköping‐Lund Initiative on IT and Mobile Communications
Available from: 2024-12-03 Created: 2024-12-03 Last updated: 2024-12-03Bibliographically approved
4. An Intelligent Tool for Classifying Issue Reports
Open this publication in new window or tab >>An Intelligent Tool for Classifying Issue Reports
2023 (English)In: Proceedings - 2023 IEEE/ACM 2nd International Workshop on Natural Language-Based Software Engineering, NLBSE 2023, Institute of Electrical and Electronics Engineers (IEEE), 2023, p. 13-15Conference paper, Published paper (Refereed)
Abstract [en]

A considerable amount of issue reports are submitted daily in large-scale software development. Manually reviewing and classifying each issue report is challenging and error-prone. Thus, to assist practitioners, in this paper, we propose and evaluate an automatic supervised machine learning-based approach that can automatically predict the newly submitted issue report type (i.e., bug, feature, question, or documentation). We applied the supervised machine learning-based approach to over 1.4 million issue reports data from real open-source projects. We performed our experiments using Stochastic Gradient Descent (SGD)-based classifier and achieved an F1 micro average score of 0.8523. © 2023 IEEE.

Place, publisher, year, edition, pages
Institute of Electrical and Electronics Engineers (IEEE), 2023
Keywords
Bug Reports, Issue Classification, Natural Language Processing, Software Analytics, Software Maintenance, Gradient methods, Learning algorithms, Learning systems, Natural language processing systems, Open source software, Software design, Supervised learning, Intelligent tools, Language processing, Large-scales, Learning-based approach, Natural languages, Software analytic, Supervised machine learning, Stochastic systems
National Category
Software Engineering
Identifiers
urn:nbn:se:bth-25325 (URN)10.1109/NLBSE59153.2023.00010 (DOI)001039169700004 ()2-s2.0-85167946589 (Scopus ID)9798350301786 (ISBN)
Conference
2nd IEEE/ACM International Workshop on Natural Language-Based Software Engineering, NLBSE 2023, Melbourne, 20 May 2023
Funder
ELLIIT - The Linköping‐Lund Initiative on IT and Mobile Communications
Available from: 2023-08-25 Created: 2023-08-25 Last updated: 2024-12-03Bibliographically approved
5. Industrial adoption of machine learning techniques for early identification of invalid bug reports
Open this publication in new window or tab >>Industrial adoption of machine learning techniques for early identification of invalid bug reports
2024 (English)In: Empirical Software Engineering, ISSN 1382-3256, E-ISSN 1573-7616, Vol. 29, no 5, article id 130Article in journal (Refereed) Published
Abstract [en]

Despite the accuracy of machine learning (ML) techniques in predicting invalid bug reports, as shown in earlier research, and the importance of early identification of invalid bug reports in software maintenance, the adoption of ML techniques for this task in industrial practice is yet to be investigated. In this study, we used a technology transfer model to guide the adoption of an ML technique at a company for the early identification of invalid bug reports. In the process, we also identify necessary conditions for adopting such techniques in practice. We followed a case study research approach with various design and analysis iterations for technology transfer activities. We collected data from bug repositories, through focus groups, a questionnaire, and a presentation and feedback session with an expert. As expected, we found that an ML technique can identify invalid bug reports with acceptable accuracy at an early stage. However, the technique’s accuracy drops over time in its operational use due to changes in the product, the used technologies, or the development organization. Such changes may require retraining the ML model. During validation, practitioners highlighted the need to understand the ML technique’s predictions to trust the predictions. We found that a visual (using a state-of-the-art ML interpretation framework) and descriptive explanation of the prediction increases the trustability of the technique compared to just presenting the results of the validity predictions. We conclude that trustability, integration with the existing toolchain, and maintaining the techniques’ accuracy over time are critical for increasing the likelihood of adoption. © The Author(s) 2024.

Place, publisher, year, edition, pages
Springer, 2024
Keywords
Concept drift, Defect classification, Invalid bug reports, Machine learning, Software maintenance, Software quality, Computer software maintenance, Computer software selection and evaluation, Industrial research, Technology transfer, Bug reports, Concept drifts, Industrial adoption, Industrial practices, Invalid bug report, Machine learning techniques, Machine-learning, Transfer models, Forecasting
National Category
Software Engineering
Identifiers
urn:nbn:se:bth-26802 (URN)10.1007/s10664-024-10502-3 (DOI)001283245300001 ()2-s2.0-85200034314 (Scopus ID)
Funder
ELLIIT - The Linköping‐Lund Initiative on IT and Mobile CommunicationsKnowledge Foundation, 20220235
Available from: 2024-08-14 Created: 2024-08-14 Last updated: 2024-12-03Bibliographically approved
6. Early Identification of Invalid Bug Reports in Industrial Settings – A Case Study
Open this publication in new window or tab >>Early Identification of Invalid Bug Reports in Industrial Settings – A Case Study
2022 (English)In: Product-Focused Software Process Improvement / [ed] Taibi D., Kuhrmann M., Mikkonen T., Abrahamsson P., Klünder J., Springer Science+Business Media B.V., 2022, p. 497-507Conference paper, Published paper (Refereed)
Abstract [en]

Software development companies spend considerable time resolving bug reports. However, bug reports might be invalid, i.e., not point to a valid flaw. Expensive resources and time might be expended on invalid bug reports before discovering that they are invalid. In this case study, we explore the impact of invalid bug reports and develop and assess the use of machine learning (ML) to indicate whether a bug report is likely invalid. We found that about 15% of bug reports at the case company are invalid, and that their resolution time is similar to valid bug reports. Among the ML-based techniques we used, logistic regression and SVM show promising results. In the feedback, practitioners indicated an interest in using the tool to identify invalid bug reports at early stages. However, they emphasized the need to improve the explainability of ML-based recommendations and to reduce the maintenance cost of the tool. © 2022, The Author(s), under exclusive license to Springer Nature Switzerland AG.

Place, publisher, year, edition, pages
Springer Science+Business Media B.V., 2022
Series
Lecture Notes in Computer Science, ISSN 0302-9743, E-ISSN 1611-3349 ; 13709
Keywords
Bug classification, Bug reports, Invalid bugs, Machine learning, Software analytics, Valid bugs, Program debugging, Software design, Case-studies, Industrial settings, Invalid bug, Logistics regressions, Machine-learning, Resolution time, Software analytic, Valid bug, Support vector machines
National Category
Software Engineering
Identifiers
urn:nbn:se:bth-24056 (URN)10.1007/978-3-031-21388-5_34 (DOI)000897035000034 ()2-s2.0-85142737900 (Scopus ID)9783031213878 (ISBN)
Conference
23rd International Conference on Product-Focused Software Process Improvement, PROFES 2022, Jyväskylä, 21 November through 23 November 2022
Funder
ELLIIT - The Linköping‐Lund Initiative on IT and Mobile Communications
Available from: 2022-12-12 Created: 2022-12-12 Last updated: 2024-12-03Bibliographically approved
7. A data-driven approach for understanding invalid bug reports: An industrial case study
Open this publication in new window or tab >>A data-driven approach for understanding invalid bug reports: An industrial case study
2023 (English)In: Information and Software Technology, ISSN 0950-5849, E-ISSN 1873-6025, Vol. 164, article id 107305Article in journal (Refereed) Published
Abstract [en]

Context: Bug reports created during software development and maintenance do not always describe deviations from a system's valid behavior. Such invalid bug reports may consume significant resources and adversely affect the prioritization and resolution of valid bug reports. There is a need to identify preventive actions to reduce the inflow of invalid bug reports. Existing research has shown that manually analyzing invalid bug report descriptions provides cues regarding preventive actions. However, such a manual approach is not cost-effective due to the time required to analyze a sufficiently large number of bug reports needed to identify useful patterns. Furthermore, the analysis needs to be repeated as the underlying causes of invalid bug reports change over time. Objective: In this study, we propose and evaluate the use of Latent Dirichlet Allocation (LDA), a topic modeling approach, to support practitioners in suggesting preventive actions to avoid the creation of similar invalid bug reports in the future. Method: In an industrial case study, we first manually analyzed descriptions of invalid bug reports to identify common patterns in their descriptions. We further investigated to what extent LDA can support this manual process. We used expert-based validation to evaluate the relevance of identified common patterns and their usefulness in suggesting preventive measures. Results: We found that invalid bug reports have common patterns that are perceived as relevant, and they can be used to devise preventive measures. Furthermore, the identification of common patterns can be supported with automation. Conclusion: Using LDA, practitioners can effectively identify representative groups of bug reports (i.e., relevant common patterns) from a large number of bug reports and analyze them further to devise preventive measures. © 2023 The Author(s)

Place, publisher, year, edition, pages
Elsevier, 2023
Keywords
Bug classification, Bug management, Invalid bug reports, LDA, Software analytics, Software maintenance, Topic modeling, Cost effectiveness, Software design, Statistics, Bug managements, Bug reports, Industrial case study, Invalid bug report, Latent Dirichlet allocation, Preventive action, Preventive measures, Software analytic, Computer software maintenance
National Category
Software Engineering
Identifiers
urn:nbn:se:bth-25295 (URN)10.1016/j.infsof.2023.107305 (DOI)001053508400001 ()2-s2.0-85166970380 (Scopus ID)
Funder
ELLIIT - The Linköping‐Lund Initiative on IT and Mobile CommunicationsKnowledge Foundation, 20220235
Available from: 2023-08-18 Created: 2023-08-18 Last updated: 2024-12-03Bibliographically approved

Open Access in DiVA

fulltext(3239 kB)73 downloads
File information
File name FULLTEXT01.pdfFile size 3239 kBChecksum SHA-512
51561f5e6fbb1a72573dca39a28f462716fdb6679499ab7000f3764b3c7e831ab1f0fc0906e071a294b39e59508038fdababadf6b0a84f3f4f4233b34bf6cf53
Type fulltextMimetype application/pdf

Authority records

Laiq, Muhammad

Search in DiVA

By author/editor
Laiq, Muhammad
By organisation
Department of Software Engineering
Software Engineering

Search outside of DiVA

GoogleGoogle Scholar
Total: 75 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

isbn
urn-nbn

Altmetric score

isbn
urn-nbn
Total: 1045 hits
12342 of 4
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf