Authors:
Melissa Silva
1
and
João Faria
2
;
1
Affiliations:
1
Faculty of Engineering, University of Porto, Porto, Portugal
;
2
INESC TEC, Porto, Portugal
Keyword(s):
Requirements Engineering, Social Media, Requirements Mining.
Abstract:
Requirements Engineering (RE) is crucial for product success but challenging for software with a broad user base, such as streaming platforms. Developers must analyze vast user feedback, but manual methods are impractical due to volume and diversity. This research addresses these challenges by automating the collection, filtering, summarization, and clustering of user feedback from social media, suggesting feature requests and bug fixes through an interactive platform. Data from Reddit, Twitter, iTunes, and Google Play is gathered via web crawlers and APIs and processed using a novel combination of natural language processing (NLP), machine learning (ML), large language models (LLMs), and incremental clustering. We evaluated our approach with a partner company in the streaming industry, extracting 66,168 posts related to 10 streaming services and identifying 22,847 as relevant with an ML classifier (75.5% precision, 74.2% recall). From the top 100 posts, a test user found 89 relevant
and generated 47 issues in 80 minutes—a significant reduction in effort compared to a manual process. A usability study with six specialists yielded a SUS score of 83.33 (“Good”) and very positive feedback. The platform reduces cognitive overload by prioritizing high-impact posts and suggesting structured issue details, ensuring focus on insights while supporting scalability.
(More)