loading
Papers

Research.Publish.Connect.

Paper

Paper Unlock

Authors: Carlos-Emiliano González-Gallardo 1 ; Juan-Manuel Torres-Moreno 2 ; Azucena Montes Rendón 3 and Gerardo Sierra 4

Affiliations: 1 Université d'Avignon et des Pays de Vaucluse, France ; 2 École Polytechnique de Montréal and Université d'Avignon et des Pays de Vaucluse, Canada ; 3 Centro Nacional de Investigación y Desarrollo Tecnológico, Mexico ; 4 GIL-Instituto de Ingeniería and Universidad Nacional Autónoma de México, Mexico

ISBN: 978-989-758-203-5

Keyword(s): Text Mining, Machine Learning, Classification, n-grams, POS, Blogs, Tweets, Social Network.

Related Ontology Subjects/Areas/Topics: Artificial Intelligence ; Clustering and Classification Methods ; Computational Intelligence ; Evolutionary Computing ; Information Extraction ; Knowledge Discovery and Information Retrieval ; Knowledge-Based Systems ; Machine Learning ; Mining Text and Semi-Structured Data ; Soft Computing ; Symbolic Systems

Abstract: In this paper we describe a dynamic normalization process applied to social network multilingual documents (Facebook and Twitter) to improve the performance of the Author profiling task for short texts. After the normalization process, n-grams of characters and n-grams of POS tags are obtained to extract all the possible stylistic information encoded in the documents (emoticons, character flooding, capital letters, references to other users, hyperlinks, hashtags, etc.). Experiments with SVM showed up to 90% of performance.

PDF ImageFull Text

Download
CC BY-NC-ND 4.0

Sign In Guest: Register as new SciTePress user now for free.

Sign In SciTePress user: please login.

PDF ImageMy Papers

You are not signed in, therefore limits apply to your IP address 3.93.75.242

In the current month:
Recent papers: 100 available of 100 total
2+ years older papers: 200 available of 200 total

Paper citation in several formats:
González-Gallardo, C.; Torres-Moreno, J.; Montes Rendón, A. and Sierra, G. (2016). Efficient Social Network Multilingual Classification using Character, POS n-grams and Dynamic Normalization.In Proceedings of the 8th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management - Volume 1: KDIR, (IC3K 2016) ISBN 978-989-758-203-5, pages 307-314. DOI: 10.5220/0006052803070314

@conference{kdir16,
author={Carlos{-}Emiliano González{-}Gallardo. and Juan{-}Manuel Torres{-}Moreno. and Azucena Montes Rendón. and Gerardo Sierra.},
title={Efficient Social Network Multilingual Classification using Character, POS n-grams and Dynamic Normalization},
booktitle={Proceedings of the 8th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management - Volume 1: KDIR, (IC3K 2016)},
year={2016},
pages={307-314},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0006052803070314},
isbn={978-989-758-203-5},
}

TY - CONF

JO - Proceedings of the 8th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management - Volume 1: KDIR, (IC3K 2016)
TI - Efficient Social Network Multilingual Classification using Character, POS n-grams and Dynamic Normalization
SN - 978-989-758-203-5
AU - González-Gallardo, C.
AU - Torres-Moreno, J.
AU - Montes Rendón, A.
AU - Sierra, G.
PY - 2016
SP - 307
EP - 314
DO - 10.5220/0006052803070314

Login or register to post comments.

Comments on this Paper: Be the first to review this paper.