Automatic Detection of Causality in Requirement Artifacts: The CiRA ApproachShow others and affiliations
2021 (English)In: Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) / [ed] Dalpiaz F., Spoletini P., Springer Science and Business Media Deutschland GmbH , 2021, Vol. 12685, p. 19-36Conference paper, Published paper (Refereed)
Abstract [en]
[Context & motivation:] System behavior is often expressed by causal relations in requirements (e.g., If event 1, then event 2). Automatically extracting this embedded causal knowledge supports not only reasoning about requirements dependencies, but also various automated engineering tasks such as seamless derivation of test cases. However, causality extraction from natural language (NL) is still an open research challenge as existing approaches fail to extract causality with reasonable performance. [Question/problem:] We understand causality extraction from requirements as a two-step problem: First, we need to detect if requirements have causal properties or not. Second, we need to understand and extract their causal relations. At present, though, we lack knowledge about the form and complexity of causality in requirements, which is necessary to develop a suitable approach addressing these two problems. [Principal ideas/results:] We conduct an exploratory case study with 14,983 sentences from 53 requirements documents originating from 18 different domains and shed light on the form and complexity of causality in requirements. Based on our findings, we develop a tool-supported approach for causality detection (CiRA, standing for Causality in Requirement Artifacts). This constitutes a first step towards causality extraction from NL requirements. [Contribution:] We report on a case study and the resulting tool-supported approach for causality detection in requirements. Our case study corroborates, among other things, that causality is, in fact, a widely used linguistic pattern to describe system behavior, as about a third of the analyzed sentences are causal. We further demonstrate that our tool CiRA achieves a macro-F 1 score of 82% on real word data and that it outperforms related approaches with an average gain of 11.06% in macro-Recall and 11.43% in macro-Precision. Finally, we disclose our open data sets as well as our tool to foster the discourse on the automatic detection of causality in the RE community. © 2021, Springer Nature Switzerland AG.
Place, publisher, year, edition, pages
Springer Science and Business Media Deutschland GmbH , 2021. Vol. 12685, p. 19-36
Series
Lecture Notes in Computer Science, ISSN 03029743, E-ISSN 16113349 ; 12685
Keywords [en]
Case study, Causality, Natural Language Processing, Requirements engineering, Computer software selection and evaluation, Extraction, Macros, Automatic Detection, Different domains, Exploratory case studies, Knowledge supports, Linguistic patterns, Requirements dependencies, Requirements document, Research challenges, Open Data
National Category
Computer Sciences Software Engineering
Identifiers
URN: urn:nbn:se:bth-21703DOI: 10.1007/978-3-030-73128-1_2ISI: 000788007000002Scopus ID: 2-s2.0-85107442991ISBN: 9783030731274 (print)OAI: oai:DiVA.org:bth-21703DiVA, id: diva2:1568890
Conference
27th International Working Conference on Requirements Engineering: Foundation for Software Quality, REFSQ 2021, 12 April 2021 - 15 April 2021
Part of project
SERT- Software Engineering ReThought, Knowledge Foundation
Funder
Knowledge Foundation, 201800102021-06-182021-06-182022-05-13Bibliographically approved