Automated Evaluation of Database Conversational Agents

Matheus Silva; Eduardo Nascimento; Eduardo Nascimento; Yenier Izquierdo; Melissa Lemos; Melissa Lemos; Marco Casanova; Marco Casanova

Research.Publish.Connect.

*Please fill out at least one Field. *Value must be an number!

Title:
ISBN:
Year:
Acronym:
Subject:

Advanced Search Proceedings Search

If you're looking for an exact phrase use quotation marks on text fields.

*Please fill out at least one Field.

Title:
Author:
Affiliation:
Subject:

Advanced Search Papers Search

If you're looking for an exact phrase use quotation marks on text fields.

*Please fill out at least one Field.

Name:
Affiliation:
Country:
Conference:
Subject:

Advanced Search Authors Search

If you're looking for an exact phrase use quotation marks on text fields.

*Please fill out at least one Field.

Name:
Country:
Subject:

Advanced Search Affiliations Search

If you're looking for an exact phrase use quotation marks on text fields.

Proceedings

Proceedings Search *Please fill out at least one Field. *Value must be an number!

Title:
ISBN:
Year:
Acronym:
Subject:

Advanced Search Proceedings Search

If you're looking for an exact phrase use quotation marks on text fields.

Papers

Papers Search *Please fill out at least one Field.

Title:
Author:
Affiliation:
Subject:

Advanced Search Papers Search

If you're looking for an exact phrase use quotation marks on text fields.

Authors

Authors Search *Please fill out at least one Field.

Name:
Affiliation:
Country:
Conference:
Subject:

Advanced Search Authors Search

If you're looking for an exact phrase use quotation marks on text fields.

Advanced Search

Paper

Automated Evaluation of Database Conversational Agents

Topics: Generative AI Application Development and LLM Engineering; Natural Language Processing

In Proceedings of the 21st International Conference on Web Information Systems and Technologies WEBIST - Volume 1, 277-288, 2025 , Marbella, Spain

Authors: Matheus O. Silva ¹ ; Eduardo Nascimento ^{1

;

2} ; Yenier Izquierdo ² ; Melissa Lemos ^{1

;

2} and Marco Casanova ^{2

;

1}

Affiliations: ¹ Department of Informatics, PUC-Rio, Rio de Janeiro, RJ, Brazil ; ² Tecgraf Institute, PUC-Rio, Rio de Janeiro, RJ, Brazil

Keyword(s): Conversational Agents, Database Interfaces, ReAcT, LLM.

Abstract: Database conversational agents support dialogues to help users interact with databases in their jargon. A strategy to construct such agents is to adopt an LLM-based architecture. However, evaluating agent-based systems is complex and lacks a definitive solution, as responses from such systems are open-ended, with no direct relationship between input and the expected response. This paper then focuses on the problem of evaluating LLM-based database conversational agents. It first introduces a tool to construct test datasets for such agents that explores the schema and the data values of the underlying database. The paper then describes an evaluation agent that behaves like a human user to assess the responses of a database conversational agent on a test dataset. Finally, the paper includes a proof-of-concept experiment with an implementation of a database conversational agent over two databases, the Mondial database and an industrial database in production at an energy company.

CC BY-NC-ND 4.0

Guest: Register as new SciTePress user now for free.

SciTePress user: please login.

My Papers

You are not signed in, therefore limits apply to your IP address 216.73.216.219

In the current month:

Recent papers: 100 available of 100 total

2⁺ years older papers: 200 available of 200 total

Paper citation in several formats:

Silva, M. O., Nascimento, E., Izquierdo, Y., Lemos, M. and Casanova, M. (2025). Automated Evaluation of Database Conversational Agents. In Proceedings of the 21st International Conference on Web Information Systems and Technologies - WEBIST; ISBN 978-989-758-772-6; ISSN 2184-3252, SciTePress, pages 277-288. DOI: 10.5220/0013732900003985

@conference{webist25,
author={Matheus O. Silva and Eduardo Nascimento and Yenier Izquierdo and Melissa Lemos and Marco Casanova},
title={Automated Evaluation of Database Conversational Agents},
booktitle={Proceedings of the 21st International Conference on Web Information Systems and Technologies - WEBIST},
year={2025},
pages={277-288},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0013732900003985},
isbn={978-989-758-772-6},
issn={2184-3252},
}

TY - CONF

JO - Proceedings of the 21st International Conference on Web Information Systems and Technologies - WEBIST
TI - Automated Evaluation of Database Conversational Agents
SN - 978-989-758-772-6
IS - 2184-3252
AU - Silva, M.
AU - Nascimento, E.
AU - Izquierdo, Y.
AU - Lemos, M.
AU - Casanova, M.
PY - 2025
SP - 277
EP - 288
DO - 10.5220/0013732900003985
PB - SciTePress