
tion: the analyst must have previously written the re-
quirements document, as the text must be copied and
pasted into the tool. Furthermore, MUCE does not
generate use case or class diagrams, which reduces
its usefulness in an automated workflow.
Another approach was presented by Herwanto
(Herwanto, 2024), who explored the generation of
Data Flow Diagrams (DFDs) using ChatGPT. The ex-
periment was based on a structured prompt composed
of four parts: task description, detailed instructions,
input consisting of user stories, and an example using
few-shot learning. The generated output included a
CSV-formatted syntax, which was then imported into
draw.io for visualization and editing of the diagram.
Although the study confirms the potential of ChatGPT
to assist in visual modeling within software engineer-
ing, the process still requires significant manual in-
tervention. The development team must first define
the user stories, and the tool is not integrated with
any specific methodology, making the process more
labor-intensive.
While the potential benefits of using ChatGPT in
requirements engineering are clear, there is still sig-
nificant room for exploration and improvement. Our
goal is to leverage this technology to automate the re-
quirements engineering process as much as possible,
covering everything from stakeholder interviews to
model generation. We believe that combining LLMs
with the SofIA methodology and effective prompt en-
gineering will allow us to develop a more precise and
autonomous tool, reducing the engineers’ workload
and optimising the transition from requirements anal-
ysis to modeling.
3 MATERIALS AND METHODS
OpenAI provides accessible and well-documented
APIs that allow developers to integrate large language
models into custom applications, services, or sys-
tems. These APIs work similarly to ChatGPT (Ope-
nAI, 2024b), where natural language prompts are sent
to the model, which then generates context-based re-
sponses.
The API offers access to various models designed
for different use cases. In this project, we will specif-
ically work with the o4-mini model, the latest small
model in OpenAI’s ”o” series. It is optimised for
fast and effective reasoning, with exceptional per-
formance in programming and visual tasks (OpenAI,
2024a). Despite being smaller than other models, it
provides an excellent balance of cost, performance,
and versatility.
To use the API effectively, it is essential to un-
derstand the concept of tokens, the smallest units of
text the model processes. Tokens are used both in
the input (prompt) and the output (response), mean-
ing that cost and performance are directly tied to to-
ken usage. Optimizing prompts and managing token
counts is crucial for efficiency and cost reduction.
Before integrating OpenAI’s APIs into any soft-
ware methodology, developers must obtain an API
key. This key functions as a unique identifier that au-
thenticates requests, controls access, and logs usage
(Auger and Saroyan, 2024).
Access to the API is managed through a pay-as-
you-go pricing model, where the total cost depends
on the selected model and the number of tokens used
for input and output. OpenAI regularly publishes its
pricing on its official website, allowing developers to
manage budgets, set usage limits, and plan resource
consumption efficiently.
During the development of this project and the
execution of the tests, approximately 100.000 tokens
were used, resulting in a total cost of only $0.02 This
highlights the remarkable cost-efficiency of OpenAI’s
o4-mini model. Despite the intensive use of the model
to generate and process complex information, the eco-
nomic impact was negligible. This demonstrates one
of the key advantages of using optimised language
models: the ability to integrate AI into software en-
gineering processes without incurring high costs an
especially valuable benefit for academic, research-
based, or budget-constrained projects.
The SofIA methodology (Software Methodology
for Industrial Applications) was developed by the ES3
research group (Engineering and Science for Soft-
ware Systems) at the University of Seville. It is
a model-based methodological framework supported
by a CASE tool also named SofIA. This methodol-
ogy originates from a previous proposal, NDT (Navi-
gational Development Techniques), and its associated
tool, NDT-Suite (Garc
´
ıa-Garc
´
ıa et al., 2014). The cre-
ation of SofIA stems from over two decades of expe-
rience applying NDT-Suite in industrial contexts.
The SofIA tool was developed by extending func-
tionalities from platforms such as Enterprise Archi-
tect (Sparx Systems, 2022) and Draw.io, resulting in
a powerful CASE tool for software application design
and development. It provides high flexibility in the
design starting points and offers automated support
for bidirectional traceability, which is crucial for soft-
ware lifecycle management. SofIA focuses particu-
larly on the early stages of development, including
requirements engineering, prototyping, use cases, and
data structure modeling (Escalona et al., 2023).
SofIA is defined as an MDE (Model-Driven En-
gineering) methodology (Escalona et al., 2023), al-
Towards an Integrative Approach Between the SofIA Methodology and ChatGPT for the Extraction of Requirements from User Interviews
167