Supporting the identification of prevalent quality issues in code changes by analyzing reviewers’ feedback
2025 (English)In: Software quality journal, ISSN 0963-9314, E-ISSN 1573-1367, Vol. 33, no 2, article id 22Article in journal (Refereed) Published
Abstract [en]
Context: Code reviewers provide valuable feedback during the code review. Identifying common issues described in the reviewers’ feedback can provide input for devising context-specific software development improvements. However, the use of reviewer feedback for this purpose is currently less explored.
Objective: In this study, we assess how automation can derive more interpretable and informative themes in reviewers’ feedback and whether these themes help to identify recurring quality-related issues in code changes.
Method: We conducted a participatory case study using the JabRef system to analyze reviewers’ feedback on merged and abandoned code changes. We used two promising topic modeling methods (GSDMM and BERTopic) to identify themes in 5,560 code review comments. The resulting themes were analyzed and named by a domain expert from JabRef.
Results: The domain expert considered the identified themes from the two topic models to represent quality-related issues. Different quality issues are pointed out in code reviews for merged and abandoned code changes. While BERTopic provides higher objective coherence, the domain expert considered themes from short-text topic modeling more informative and easy to interpret than BERTopic-based topic modeling.
Conclusions: The identified prevalent code quality issues aim to address the maintainability-focused issues. The analysis of code review comments can enhance the current practices for JabRef by improving the guidelines for new developers and focusing discussions in the developer forums. The topic model choice impacts the interpretability of the generated themes, and a higher coherence (based on objective measures) of generated topics did not lead to improved interpretability by a domain expert.
Place, publisher, year, edition, pages
Springer, 2025. Vol. 33, no 2, article id 22
Keywords [en]
Modern code review, Natural language processing, Open-source systems, Software quality improvement, Computer software selection and evaluation, Open source software, Software design, Code changes, Code review, Domain experts, Language processing, Natural languages, Open source system, Software quality improvements, Topic Modeling, Software quality
National Category
Software Engineering
Identifiers
URN: urn:nbn:se:bth-27789DOI: 10.1007/s11219-025-09720-9ISI: 001473057800001Scopus ID: 2-s2.0-105003288015OAI: oai:DiVA.org:bth-27789DiVA, id: diva2:1955876
Part of project
GIST – Gaining actionable Insights from Software Testing, Knowledge Foundation
Funder
ELLIIT - The Linköping‐Lund Initiative on IT and Mobile CommunicationsKnowledge Foundation, 202202352025-05-022025-05-022025-05-02Bibliographically approved