Towards an Accurate Prediction of the Question Quality on Stack Overflow using a Deep-Learning-Based NLP Approach

László Tóth, Balázs Nagy, Dávid Janthó, László Vidács, Tibor Gyimóthy

2019

Abstract

Online question answering (Q&A) forums like Stack Overflow have been playing an increasingly important role in supporting the daily tasks of developers. Stack Overflow can be considered as a meeting point of experienced developers and those who are looking for a solution for a specific problem. Since anyone with any background and experience level can ask and respond to questions, the community tries to use different solutions to maintain quality, such as closing and deleting inappropriate posts. As over 8,000 posts arrive on Stack Overflow every day, the effective automatic filtering of them is essential. In this paper, we present a novel approach for classifying questions based exclusively on their linguistic and semantic features using deep learning method. Our binary classifier relying on the textual properties of posts can predict whether the question is to be closed with an accuracy of 74% similar to the results of previous metrics-based models. In accordance with our findings we conclude that by combining deep learning and natural language processing methods, the maintenance of quality at Q&A forums could be supported using only the raw text of posts.

Download


Paper Citation


in Harvard Style

Tóth L., Nagy B., Janthó D., Vidács L. and Gyimóthy T. (2019). Towards an Accurate Prediction of the Question Quality on Stack Overflow using a Deep-Learning-Based NLP Approach.In Proceedings of the 14th International Conference on Software Technologies - Volume 1: ICSOFT, ISBN 978-989-758-379-7, pages 631-639. DOI: 10.5220/0007971306310639


in Bibtex Style

@conference{icsoft19,
author={László Tóth and Balázs Nagy and Dávid Janthó and László Vidács and Tibor Gyimóthy},
title={Towards an Accurate Prediction of the Question Quality on Stack Overflow using a Deep-Learning-Based NLP Approach},
booktitle={Proceedings of the 14th International Conference on Software Technologies - Volume 1: ICSOFT,},
year={2019},
pages={631-639},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0007971306310639},
isbn={978-989-758-379-7},
}


in EndNote Style

TY - CONF

JO - Proceedings of the 14th International Conference on Software Technologies - Volume 1: ICSOFT,
TI - Towards an Accurate Prediction of the Question Quality on Stack Overflow using a Deep-Learning-Based NLP Approach
SN - 978-989-758-379-7
AU - Tóth L.
AU - Nagy B.
AU - Janthó D.
AU - Vidács L.
AU - Gyimóthy T.
PY - 2019
SP - 631
EP - 639
DO - 10.5220/0007971306310639