From User Stories to Models: A Machine Learning Empowered Automation

Takwa Kochbati, Shuai Li, Sébastien Gérard, Chokri Mraidha

Abstract

In modern software development, manually deriving architecture models from software requirements expressed in natural language becomes a tedious and time-consuming task particularly for more complex systems. Moreover, the increase in size of the developed systems raises the need to decompose the software system into sub-systems at early stages since such decomposition aids to better design the system architecture. In this paper, we propose a machine learning based approach to automatically break-down the system into sub-systems and generate preliminary architecture models from natural language user stories in the Scrum process. Our approach consists of three pillars. Firstly, we compute word level similarity of requirements using word2vec as a prediction model. Secondly, we extend it to the requirement level similarity computation, using a scoring formula. Thirdly, we employ the Hierarchical Agglomerative Clustering algorithm to group the semantically similar requirements and provide an early decomposition of the system. Finally, we implement a set of specific Natural Language Processing heuristics in order to extract relevant elements that are needed to build models from the identified clusters. Ultimately, we illustrate our approach by the generation of sub-systems expressed as UML use-case models and demonstrate its applicability using three case studies.

Download


Paper Citation


in Harvard Style

Kochbati T., Li S., Gérard S. and Mraidha C. (2021). From User Stories to Models: A Machine Learning Empowered Automation.In Proceedings of the 9th International Conference on Model-Driven Engineering and Software Development - Volume 1: MODELSWARD, ISBN 978-989-758-487-9, pages 28-40. DOI: 10.5220/0010197800280040


in Bibtex Style

@conference{modelsward21,
author={Takwa Kochbati and Shuai Li and Sébastien Gérard and Chokri Mraidha},
title={From User Stories to Models: A Machine Learning Empowered Automation},
booktitle={Proceedings of the 9th International Conference on Model-Driven Engineering and Software Development - Volume 1: MODELSWARD,},
year={2021},
pages={28-40},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0010197800280040},
isbn={978-989-758-487-9},
}


in EndNote Style

TY - CONF

JO - Proceedings of the 9th International Conference on Model-Driven Engineering and Software Development - Volume 1: MODELSWARD,
TI - From User Stories to Models: A Machine Learning Empowered Automation
SN - 978-989-758-487-9
AU - Kochbati T.
AU - Li S.
AU - Gérard S.
AU - Mraidha C.
PY - 2021
SP - 28
EP - 40
DO - 10.5220/0010197800280040