System disruptions
We are currently experiencing disruptions on the search portals due to high traffic. We are working to resolve the issue, you may temporarily encounter an error message.
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Case study identification with GPT-4 and implications for mapping studies
Blekinge Institute of Technology, Faculty of Computing, Department of Software Engineering.ORCID iD: 0000-0002-1532-8223
2024 (English)In: Information and Software Technology, ISSN 0950-5849, E-ISSN 1873-6025, Vol. 171, article id 107452Article in journal (Refereed) Published
Abstract [en]

Context: Rainer and Wohlin showed that case studies are not well understood by reviewers and authors and thus they say that a given research is a case study when it is not. Objective: Rainer and Wohlin proposed a smell indicator (inspired by code smells) to identify case studies based on the frequency of occurrences of words, which performed better than human classifiers. With the emergence of ChatGPT, we evaluate ChatGPT to assess its performance in accurately identifying case studies. We also reflect on the results’ implications for mapping studies, specifically data extraction. Method: We used ChatGPT with the model GPT-4 to identify case studies and compared the result with the smell indicator for precision, recall, and accuracy. Results: GPT-4 and the smell indicator perform similarly, with GPT-4 performing slightly better in some instances and the smell indicator (SI) in others. The advantage of GPT-4 is that it is based on the definition of case studies and provides traceability on how it reaches its conclusions. Conclusion: As GPT-4 performed well on the task and provides traceability, we should use and, with that, evaluate it on data extraction tasks, supporting us as authors. © 2024 The Author(s)

Place, publisher, year, edition, pages
Elsevier, 2024. Vol. 171, article id 107452
Keywords [en]
Case study, Data extraction, GPT-4, Systematic mapping studies, Data mining, Extraction, Case-studies, Code smell, Mapping studies, Performance, Mapping
National Category
Software Engineering
Identifiers
URN: urn:nbn:se:bth-26102DOI: 10.1016/j.infsof.2024.107452ISI: 001205391500001Scopus ID: 2-s2.0-85189468248OAI: oai:DiVA.org:bth-26102DiVA, id: diva2:1851067
Available from: 2024-04-12 Created: 2024-04-12 Last updated: 2024-05-07Bibliographically approved

Open Access in DiVA

fulltext(1034 kB)107 downloads
File information
File name FULLTEXT01.pdfFile size 1034 kBChecksum SHA-512
ba67a77ac0f1ff6b99c8b8258c75bdb3ffa9fd2d295c5ce397f088caed171034a00ac52b97e03866c3e2a180e9db49236fe51ebc69d2890013d9fc1a9ec3ab13
Type fulltextMimetype application/pdf

Other links

Publisher's full textScopus

Authority records

Petersen, Kai

Search in DiVA

By author/editor
Petersen, Kai
By organisation
Department of Software Engineering
In the same journal
Information and Software Technology
Software Engineering

Search outside of DiVA

GoogleGoogle Scholar
Total: 107 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

doi
urn-nbn

Altmetric score

doi
urn-nbn
Total: 257 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf