loading
Papers Papers/2022 Papers Papers/2022

Research.Publish.Connect.

Paper

Paper Unlock

Authors: Diogo Campos 1 ; Rodrigo Rocha Silva 2 and Jorge Bernardino 3

Affiliations: 1 Polytechnic of Coimbra - ISEC, Rua Pedro Nunes, Quinta da Nora, 3030-199 Coimbra and Portugal ; 2 Centre of Informatics and Systems of University of Coimbra, Pinhal de Marrocos, 3030-290, Coimbra, Portugal, FATEC Mogi das Cruzes, São Paulo Technological College, 08773-600 Mogi das Cruzes and Brazil ; 3 Polytechnic of Coimbra - ISEC, Rua Pedro Nunes, Quinta da Nora, 3030-199 Coimbra, Portugal, Centre of Informatics and Systems of University of Coimbra, Pinhal de Marrocos, 3030-290, Coimbra and Portugal

Keyword(s): Text Mining, Sentiment Analysis, Text Cube, Machine Learning, Stemming.

Related Ontology Subjects/Areas/Topics: Artificial Intelligence ; Clustering and Classification Methods ; Computational Intelligence ; Evolutionary Computing ; Knowledge Discovery and Information Retrieval ; Knowledge-Based Systems ; Machine Learning ; Mining Text and Semi-Structured Data ; Pre-Processing and Post-Processing for Data Mining ; Soft Computing ; Symbolic Systems

Abstract: Text Mining is the process of extracting interesting and non-trivial patterns or knowledge from unstructured text documents. Hotel Reviews are used by hotels to verify client satisfaction regarding their own services or facilities. However, we can’t deal with this type of big and unstructured data manually, so we should use OLAP techniques and Text Cube for modelling and manage text data. But then, we have a problem, we must separate the reviews in two classes, positive and negative, and for that, we use Sentiment Analysis technique. Nevertheless, do we really need all the words of a review to make the right classification? In this paper, we will study the impact of word restriction on text classification. To do that, we create some words domains (words that belong to a Hotel Domain). First, we use an algorithm that will pre-process the text (where we use our created domains like stop words). In the experimental evaluation, we use four classifiers to classify the text, Naïve-Bayes, D ecision-Tree, Random-Forest, and Support Vector Machine. (More)

CC BY-NC-ND 4.0

Sign In Guest: Register as new SciTePress user now for free.

Sign In SciTePress user: please login.

PDF ImageMy Papers

You are not signed in, therefore limits apply to your IP address 34.204.3.195

In the current month:
Recent papers: 100 available of 100 total
2+ years older papers: 200 available of 200 total

Paper citation in several formats:
Campos, D.; Silva, R. and Bernardino, J. (2019). Text Mining in Hotel Reviews: Impact of Words Restriction in Text Classification. In Proceedings of the 11th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management (IC3K 2019) - KDIR; ISBN 978-989-758-382-7; ISSN 2184-3228, SciTePress, pages 442-449. DOI: 10.5220/0008346904420449

@conference{kdir19,
author={Diogo Campos. and Rodrigo Rocha Silva. and Jorge Bernardino.},
title={Text Mining in Hotel Reviews: Impact of Words Restriction in Text Classification},
booktitle={Proceedings of the 11th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management (IC3K 2019) - KDIR},
year={2019},
pages={442-449},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0008346904420449},
isbn={978-989-758-382-7},
issn={2184-3228},
}

TY - CONF

JO - Proceedings of the 11th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management (IC3K 2019) - KDIR
TI - Text Mining in Hotel Reviews: Impact of Words Restriction in Text Classification
SN - 978-989-758-382-7
IS - 2184-3228
AU - Campos, D.
AU - Silva, R.
AU - Bernardino, J.
PY - 2019
SP - 442
EP - 449
DO - 10.5220/0008346904420449
PB - SciTePress