loading
Papers Papers/2022 Papers Papers/2022

Research.Publish.Connect.

Paper

Paper Unlock

Authors: Carlos-Emiliano González-Gallardo 1 ; Juan-Manuel Torres-Moreno 2 ; Azucena Montes Rendón 3 and Gerardo Sierra 4

Affiliations: 1 Université d'Avignon et des Pays de Vaucluse, France ; 2 École Polytechnique de Montréal and Université d'Avignon et des Pays de Vaucluse, Canada ; 3 Centro Nacional de Investigación y Desarrollo Tecnológico, Mexico ; 4 GIL-Instituto de Ingeniería and Universidad Nacional Autónoma de México, Mexico

Keyword(s): Text Mining, Machine Learning, Classification, n-grams, POS, Blogs, Tweets, Social Network.

Related Ontology Subjects/Areas/Topics: Artificial Intelligence ; Clustering and Classification Methods ; Computational Intelligence ; Evolutionary Computing ; Information Extraction ; Knowledge Discovery and Information Retrieval ; Knowledge-Based Systems ; Machine Learning ; Mining Text and Semi-Structured Data ; Soft Computing ; Symbolic Systems

Abstract: In this paper we describe a dynamic normalization process applied to social network multilingual documents (Facebook and Twitter) to improve the performance of the Author profiling task for short texts. After the normalization process, n-grams of characters and n-grams of POS tags are obtained to extract all the possible stylistic information encoded in the documents (emoticons, character flooding, capital letters, references to other users, hyperlinks, hashtags, etc.). Experiments with SVM showed up to 90% of performance.

CC BY-NC-ND 4.0

Sign In Guest: Register as new SciTePress user now for free.

Sign In SciTePress user: please login.

PDF ImageMy Papers

You are not signed in, therefore limits apply to your IP address 54.225.1.66

In the current month:
Recent papers: 100 available of 100 total
2+ years older papers: 200 available of 200 total

Paper citation in several formats:
González-Gallardo, C.; Torres-Moreno, J.; Montes Rendón, A. and Sierra, G. (2016). Efficient Social Network Multilingual Classification using Character, POS n-grams and Dynamic Normalization. In Proceedings of the 8th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management (IC3K 2016) - KDIR; ISBN 978-989-758-203-5; ISSN 2184-3228, SciTePress, pages 307-314. DOI: 10.5220/0006052803070314

@conference{kdir16,
author={Carlos{-}Emiliano González{-}Gallardo. and Juan{-}Manuel Torres{-}Moreno. and Azucena {Montes Rendón}. and Gerardo Sierra.},
title={Efficient Social Network Multilingual Classification using Character, POS n-grams and Dynamic Normalization},
booktitle={Proceedings of the 8th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management (IC3K 2016) - KDIR},
year={2016},
pages={307-314},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0006052803070314},
isbn={978-989-758-203-5},
issn={2184-3228},
}

TY - CONF

JO - Proceedings of the 8th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management (IC3K 2016) - KDIR
TI - Efficient Social Network Multilingual Classification using Character, POS n-grams and Dynamic Normalization
SN - 978-989-758-203-5
IS - 2184-3228
AU - González-Gallardo, C.
AU - Torres-Moreno, J.
AU - Montes Rendón, A.
AU - Sierra, G.
PY - 2016
SP - 307
EP - 314
DO - 10.5220/0006052803070314
PB - SciTePress