loading
Papers Papers/2022 Papers Papers/2022

Research.Publish.Connect.

Paper

Paper Unlock

Authors: Nina Rizun 1 and Wojciech Waloszek 2

Affiliations: 1 Department of Applied Informatics in Management, Gdansk University of Technology, Gdansk and Poland ; 2 Department of Software Engineering, Gdansk University of Technology, Gdansk and Poland

Keyword(s): Textual Content Classification, Hierarchical Sentiment Dictionary, Text Tonality, Evaluation the Quality, Bigrams, Polarity Scores.

Related Ontology Subjects/Areas/Topics: Artificial Intelligence ; Knowledge Discovery and Information Retrieval ; Knowledge-Based Systems ; Mining Text and Semi-Structured Data ; Symbolic Systems

Abstract: This paper presents the methodology of Textual Content Classification, which is based on a combination of algorithms: preliminary formation of a contextual framework for the texts in particular problem area; manual creation of the Hierarchical Sentiment Dictionary (HSD) on the basis of a topically-oriented Corpus; tonality texts recognition via using HSD for analysing the documents as a collection of topically completed fragments (paragraphs). For verification of the proposed methodology a case study of Polish-language film reviews Corpora was used. The main scientific contributions of this research are: writing style of the analyzed text determines the possibility of adaptation of the Texts Classification algorithms; Hierarchically-oriented Structure of the HSD allows customizing the classification process to qualitative recognition of text tonality in the context of individual paragraphs topics; texts of Persuasive style most often are initially empowered by authors with a certain tonality. The tone, expressed in the author's opinion, effects the qualitative indicators of sentiment recognition. Negative emotions of the author usually reduce the level of vocabulary variability as well as the variety of topics raised in the document, but simultaneously increase the level of unpredictability of words contextually used with both positive and negative emotional coloring. (More)

CC BY-NC-ND 4.0

Sign In Guest: Register as new SciTePress user now for free.

Sign In SciTePress user: please login.

PDF ImageMy Papers

You are not signed in, therefore limits apply to your IP address 44.222.129.73

In the current month:
Recent papers: 100 available of 100 total
2+ years older papers: 200 available of 200 total

Paper citation in several formats:
Rizun, N. and Waloszek, W. (2018). Methodology for Text Classification using Manually Created Corpora-based Sentiment Dictionary. In Proceedings of the 10th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management (IC3K 2018) - KDIR; ISBN 978-989-758-330-8; ISSN 2184-3228, SciTePress, pages 212-220. DOI: 10.5220/0006932602120220

@conference{kdir18,
author={Nina Rizun. and Wojciech Waloszek.},
title={Methodology for Text Classification using Manually Created Corpora-based Sentiment Dictionary},
booktitle={Proceedings of the 10th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management (IC3K 2018) - KDIR},
year={2018},
pages={212-220},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0006932602120220},
isbn={978-989-758-330-8},
issn={2184-3228},
}

TY - CONF

JO - Proceedings of the 10th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management (IC3K 2018) - KDIR
TI - Methodology for Text Classification using Manually Created Corpora-based Sentiment Dictionary
SN - 978-989-758-330-8
IS - 2184-3228
AU - Rizun, N.
AU - Waloszek, W.
PY - 2018
SP - 212
EP - 220
DO - 10.5220/0006932602120220
PB - SciTePress