Using Retrieval Augmented Generation to Build the Context for Data-Driven Stories

Angelica Lo Duca

2024

Abstract

Data Storytelling (DS) is building data-driven stories to communicate the result of a data analysis process effectively. However, it may happen that data storytellers lack the competences to build compelling texts to include in the data-driven stories. In this paper, we propose a novel strategy to enhance DS by automatically generating context for data-driven stories, leveraging the capabilities of Generative AI (GenAI). This contextual information provides the background knowledge necessary for the audience to understand the story’s message fully. Our approach uses Retrieval Augmented Generation (RAG), which adapts large language models (LLMs), the core concept behind GenAI, to the specific domain required by a data-driven story. We demonstrate the effectiveness of our method through a practical case study on salmon aquaculture, showcasing the ability of GenAI to enrich DS with relevant context. We also describe some possible strategies to evaluate the generated context and ethical issues may raise when using GenAI in DS.

Download


Paper Citation


in Harvard Style

Lo Duca A. (2024). Using Retrieval Augmented Generation to Build the Context for Data-Driven Stories. In Proceedings of the 19th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications - Volume 1: IVAPP; ISBN 978-989-758-679-8, SciTePress, pages 690-696. DOI: 10.5220/0012419700003660


in Bibtex Style

@conference{ivapp24,
author={Angelica Lo Duca},
title={Using Retrieval Augmented Generation to Build the Context for Data-Driven Stories},
booktitle={Proceedings of the 19th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications - Volume 1: IVAPP},
year={2024},
pages={690-696},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0012419700003660},
isbn={978-989-758-679-8},
}


in EndNote Style

TY - CONF

JO - Proceedings of the 19th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications - Volume 1: IVAPP
TI - Using Retrieval Augmented Generation to Build the Context for Data-Driven Stories
SN - 978-989-758-679-8
AU - Lo Duca A.
PY - 2024
SP - 690
EP - 696
DO - 10.5220/0012419700003660
PB - SciTePress