Exploring the Impact of Toxic Comments in Code Quality

Jaime Sayago-Heredia, Gustavo Chango, Ricardo Pérez-Castillo, Mario Piattini

2022

Abstract

Software development has an important human-side, which implies that developers' feelings have a significant impact to software development and could affect developers' quality, productivity, and performance. In this paper, we explore the process to find, understand and relate the effects of toxic emotions on code quality. We propose a tool and sentiments dataset, a clean set of commit messages, extracted from SonarQube code quality metrics and toxic comments obtained from GitHub. Moreover, we perform a preliminary statistical analysis of the dataset. We apply natural language processing techniques to identify toxic developer sentiments on commits that could impact code quality. Our study describes data retrieval process along with tools used for performing a preliminary analysis. The preliminary dataset is available in CSV format to facilitate queries on the data and to investigate in depth factors that impact developer emotions. Preliminary results imply that there is a relationship between toxic comments and code quality that may affect the quality of the software project. Future research will be the development of a complete dataset and an in-depth analysis for efficiency validation experiments along with a linear regression. Finally, we will estimate the code quality as a function of developers' toxic comments.

Download


Paper Citation


in Harvard Style

Sayago-Heredia J., Chango G., Pérez-Castillo R. and Piattini M. (2022). Exploring the Impact of Toxic Comments in Code Quality. In Proceedings of the 17th International Conference on Evaluation of Novel Approaches to Software Engineering - Volume 1: ENASE, ISBN 978-989-758-568-5, pages 335-343. DOI: 10.5220/0011039700003176


in Bibtex Style

@conference{enase22,
author={Jaime Sayago-Heredia and Gustavo Chango and Ricardo Pérez-Castillo and Mario Piattini},
title={Exploring the Impact of Toxic Comments in Code Quality},
booktitle={Proceedings of the 17th International Conference on Evaluation of Novel Approaches to Software Engineering - Volume 1: ENASE,},
year={2022},
pages={335-343},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0011039700003176},
isbn={978-989-758-568-5},
}


in EndNote Style

TY - CONF

JO - Proceedings of the 17th International Conference on Evaluation of Novel Approaches to Software Engineering - Volume 1: ENASE,
TI - Exploring the Impact of Toxic Comments in Code Quality
SN - 978-989-758-568-5
AU - Sayago-Heredia J.
AU - Chango G.
AU - Pérez-Castillo R.
AU - Piattini M.
PY - 2022
SP - 335
EP - 343
DO - 10.5220/0011039700003176