
language models be too big? In Proceedings of the
2021 ACM Conference on Fairness, Accountability,
and Transparency, FAccT ’21, pages 610–51998623.
ACM.
Bender, E. and Koller, A. (2020). Climbing towards nlu:
On meaning, form, andunderstanding in the age of
data. In Proceedings of the 58th Annual Meeting of
the Association for Computational Linguistics, pages
5185–51998. ACL.
Binamungu, L. P. and Maro, S. (2023). Behaviour driven
development: a systematic mapping study. Journal of
Systems and Software, 203:111749.
Bruschi, S., Xiao, L., Kavatkar, M., et al. (2019). Be-
havior Driven-Development (BDD): a case study in
healthtech. In Pacific NW Software Quality Confer-
ence.
Couto, T., dos Santos Marczak, S., Callegari, D. A., M
´
ora,
M., and Rocha, F. (2022). On the Characterization
of Behavior-Driven Development Adoption Benefits:
A Multiple Case Study of Novice Software Teams.
Anais do XXI Simp
´
osio Brasileiro de Qualidade de
Software, 2022, Brasil.
Guerra-Garcia, C., Nikiforova, A., Jim
´
enez, S., Perez-
Gonzalez, H. G., Ram
´
ırez-Torres, M. T., and
Onta
˜
non-Garc
´
ıa, L. (2023). ISO/IEC 25012 - Based
methodology for managing data quality requirements
in the development of information systems: Towards
Data Quality by Design . Data and Knowledge Engi-
neering, 145:102152–102152.
Karpurapu, S., Myneni, S., Nettur, U., Gajja, L. S., Burke,
D., Stiehm, T., and Payne, J. (2024). Comprehensive
evaluation and insights into the use of large language
models in the automation of behavior-driven develop-
ment acceptance test formulation. IEEE Access.
Lee, E., Gong, J., and Cao, Q. (2023). Object oriented bdd
and executable human-language module specification.
In 2023 26th ACIS International Winter Conference
on Software Engineering, Artificial Intelligence, Net-
working and Parallel/Distributed Computing (SNPD-
Winter), pages 127–133. IEEE.
Ma, S.-P., Chen, Y.-A., Guo, Y.-J., and Su, Y.-S. (2023).
Semi-automated behavior-driven testing for the web
front-ends. In 2023 IEEE International Conference
on e-Business Engineering (ICEBE), pages 225–230.
IEEE.
Mock, M., Melegati, J., and Russo, B. (2024). Generative
ai for test driven development: Preliminary results.
arXiv preprint arXiv:2405.10849.
North, D. (2006). Introducing BDD.
https://dannorth.net/introducing-bdd/.
North, D. et al. (2019). What’s in a story? Dosegljivo:
https://dannorth. net/whats-in-a-story/[Dostopano 4.
5. 2016].
Oliveira, G., Marczak, S., and Moralles, C. (2019). How
to evaluate bdd scenarios’ quality? In Proceedings
of the XXXIII Brazilian Symposium on Software Engi-
neering, pages 481–490.
Pereira, L., Sharp, H., de Souza, C., Oliveira, G., Marczak,
S., and Bastos, R. (2018). Behavior-Driven Develop-
ment benefits and challenges: reports from an indus-
trial study. In Proceedings of the 19th International
Conference on Agile Software Development: Com-
panion, pages 1–4.
Rajbhoj, A., Somase, A., Kulkarni, P., and Kulkarni, V.
(2024). Accelerating software development using
generative ai: Chatgpt case study. In Proceedings of
the 17th Innovations in Software Engineering Confer-
ence, pages 1–11.
Runeson, P. and H
¨
ost, M. (2009). Guidelines for conduct-
ing and reporting case study research in software en-
gineering. Empirical software engineering. Springer,
V.14:131–164.
Sauvola, J., Tarkoma, S., Klemettinen, M., Riekki, J., and
Doermann, D. (2024). Future of software develop-
ment with generative ai. Automated Software Engi-
neering, 31(1):26.
Silva, T. R. and Fitzgerald, B. (2021). Empirical findings on
BDD story parsing to support consistency assurance
between requirements and artifacts. In Evaluation and
Assessment in Software Engineering, pages 266–271.
Smart, J. (2014). BDD in Action: Behavior-Driven Devel-
opment for the Whole Software Lifecycle. Manning
Publications, Shelter Island, NY.
Takerngsaksiri, W., Charakorn, R., Tantithamthavorn, C.,
and Li, Y.-F. (2024). Tdd without tears: Towards test
case generation from requirements through deep rein-
forcement learning. arXiv preprint arXiv:2401.07576.
Wohlin, C., H
¨
ost, M., and Henningsson, K. (2003). Empir-
ical research methods in software engineering. Em-
pirical methods and studies in software engineering:
Experiences from ESERNET. Springer, pages 7–23.
Zameni, T., van Den Bos, P., Tretmans, J., Foederer, J., and
Rensink, A. (2023). From bdd scenarios to test case
generation. In 2023 IEEE International Conference
on Software Testing, Verification and Validation Work-
shops (ICSTW), pages 36–44. IEEE.
Zhang, L., Wang, Y., and Li, X. (2023). Enhancing bdd
test generation with large language models. Journal
of Software Engineering Research and Development,
11(2):75–90.
Zhou, X., Jin, Y., Zhang, H., Li, S., and Huang, X. (2016).
A map of threats to validity of Systematic Literature
Reviews in Software Engineering. In 2016 23rd Asia-
Pacific Software Engineering Conference (APSEC),
pages 153–160. IEEE.
WEBIST 2025 - 21st International Conference on Web Information Systems and Technologies
58