loading
Papers Papers/2022 Papers Papers/2022

Research.Publish.Connect.

Paper

Authors: Matheus O. Silva 1 ; Eduardo Nascimento 1 ; 2 ; Yenier Izquierdo 2 ; Melissa Lemos 1 ; 2 and Marco Casanova 2 ; 1

Affiliations: 1 Department of Informatics, PUC-Rio, Rio de Janeiro, RJ, Brazil ; 2 Tecgraf Institute, PUC-Rio, Rio de Janeiro, RJ, Brazil

Keyword(s): Conversational Agents, Database Interfaces, ReAcT, LLM.

Abstract: Database conversational agents support dialogues to help users interact with databases in their jargon. A strategy to construct such agents is to adopt an LLM-based architecture. However, evaluating agent-based systems is complex and lacks a definitive solution, as responses from such systems are open-ended, with no direct relationship between input and the expected response. This paper then focuses on the problem of evaluating LLM-based database conversational agents. It first introduces a tool to construct test datasets for such agents that explores the schema and the data values of the underlying database. The paper then describes an evaluation agent that behaves like a human user to assess the responses of a database conversational agent on a test dataset. Finally, the paper includes a proof-of-concept experiment with an implementation of a database conversational agent over two databases, the Mondial database and an industrial database in production at an energy company.

CC BY-NC-ND 4.0

Sign In Guest: Register as new SciTePress user now for free.

Sign In SciTePress user: please login.

PDF ImageMy Papers

You are not signed in, therefore limits apply to your IP address 216.73.216.141

In the current month:
Recent papers: 100 available of 100 total
2+ years older papers: 200 available of 200 total

Paper citation in several formats:
Silva, M. O., Nascimento, E., Izquierdo, Y., Lemos, M. and Casanova, M. (2025). Automated Evaluation of Database Conversational Agents. In Proceedings of the 21st International Conference on Web Information Systems and Technologies - WEBIST; ISBN 978-989-758-772-6; ISSN 2184-3252, SciTePress, pages 277-288. DOI: 10.5220/0013732900003985

@conference{webist25,
author={Matheus O. Silva and Eduardo Nascimento and Yenier Izquierdo and Melissa Lemos and Marco Casanova},
title={Automated Evaluation of Database Conversational Agents},
booktitle={Proceedings of the 21st International Conference on Web Information Systems and Technologies - WEBIST},
year={2025},
pages={277-288},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0013732900003985},
isbn={978-989-758-772-6},
issn={2184-3252},
}

TY - CONF

JO - Proceedings of the 21st International Conference on Web Information Systems and Technologies - WEBIST
TI - Automated Evaluation of Database Conversational Agents
SN - 978-989-758-772-6
IS - 2184-3252
AU - Silva, M.
AU - Nascimento, E.
AU - Izquierdo, Y.
AU - Lemos, M.
AU - Casanova, M.
PY - 2025
SP - 277
EP - 288
DO - 10.5220/0013732900003985
PB - SciTePress