Exploring Text Classification and Emotion Detection Using Keras Models on Reddit Data

Yue Shao

2024

Abstract

Human communication has evolved alongside technology, with the internet playing a pivotal role in contemporary digital communications. This research focuses on the emotional aspect of text communication by developing a machine learning model capable of classifying emotional polarity within Reddit posts. Using a TensorFlow Keras model, the study explores how varying the number of epochs and batch sizes influences model accuracy. TextBlob was used to generate polarity labels for the large Reddit dataset, providing a supervised learning framework for the study. Despite initial issues with a Keras layer incompatibility and processing limitation, the final model achieved an accuracy of 0.8619 on a test sample of 24,053 Reddit posts. The research highlights the challenges encountered during model development, particularly related to time constraints and the computational limitations of Google Colab. The findings suggest that further optimization and larger datasets could improve performance in future iterations. This study demonstrates the potential of AI to analyze emotional content in large-scale communication data, contributing to the growing field of sentiment analysis and emotion classification in social media contexts.

Download


Paper Citation


in Harvard Style

Shao Y. (2024). Exploring Text Classification and Emotion Detection Using Keras Models on Reddit Data. In Proceedings of the 1st International Conference on Modern Logistics and Supply Chain Management - Volume 1: MLSCM; ISBN 978-989-758-738-2, SciTePress, pages 413-416. DOI: 10.5220/0013337100004558


in Bibtex Style

@conference{mlscm24,
author={Yue Shao},
title={Exploring Text Classification and Emotion Detection Using Keras Models on Reddit Data},
booktitle={Proceedings of the 1st International Conference on Modern Logistics and Supply Chain Management - Volume 1: MLSCM},
year={2024},
pages={413-416},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0013337100004558},
isbn={978-989-758-738-2},
}


in EndNote Style

TY - CONF

JO - Proceedings of the 1st International Conference on Modern Logistics and Supply Chain Management - Volume 1: MLSCM
TI - Exploring Text Classification and Emotion Detection Using Keras Models on Reddit Data
SN - 978-989-758-738-2
AU - Shao Y.
PY - 2024
SP - 413
EP - 416
DO - 10.5220/0013337100004558
PB - SciTePress