loading
Papers Papers/2022 Papers Papers/2022

Research.Publish.Connect.

Paper

Paper Unlock

Authors: Arsenii Rasov 1 ; Ilya Obabkov 1 ; Eckehard Olbrich 2 and Ivan P. Yamshchikov 2

Affiliations: 1 Ural Federal University, Mira Street, 19, Yekaterinburg, Russia ; 2 Max Planck Institute for Mathematics in the Sciences, Inselstrasse 22, Leipzig, Germany

Keyword(s): Electoral Programs, Text Corpus, Classification of Political Texts.

Abstract: In this position paper, we implement an automatic coding algorithm for electoral programs from the Manifesto Project Database. We propose a new approach that works with new words that are out of the training vocabulary, replacing them with the words from training vocabulary that are the closest neighbors in the space of word embeddings. A set of simulations demonstrates that the proposed algorithm shows classification accuracy comparable to the state-of-the-art benchmarks for monolingual multi-label classification. The agreement levels for the algorithm is comparable with manual labeling. The results for a broad set of model hyperparam-eters are compared to each other.

CC BY-NC-ND 4.0

Sign In Guest: Register as new SciTePress user now for free.

Sign In SciTePress user: please login.

PDF ImageMy Papers

You are not signed in, therefore limits apply to your IP address 3.237.65.102

In the current month:
Recent papers: 100 available of 100 total
2+ years older papers: 200 available of 200 total

Paper citation in several formats:
Rasov, A.; Obabkov, I.; Olbrich, E. and Yamshchikov, I. (2020). Text Classification for Monolingual Political Manifestos with Words Out of Vocabulary. In Proceedings of the 5th International Conference on Complexity, Future Information Systems and Risk - COMPLEXIS; ISBN 978-989-758-427-5; ISSN 2184-5034, SciTePress, pages 149-154. DOI: 10.5220/0009792101490154

@conference{complexis20,
author={Arsenii Rasov. and Ilya Obabkov. and Eckehard Olbrich. and Ivan P. Yamshchikov.},
title={Text Classification for Monolingual Political Manifestos with Words Out of Vocabulary},
booktitle={Proceedings of the 5th International Conference on Complexity, Future Information Systems and Risk - COMPLEXIS},
year={2020},
pages={149-154},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0009792101490154},
isbn={978-989-758-427-5},
issn={2184-5034},
}

TY - CONF

JO - Proceedings of the 5th International Conference on Complexity, Future Information Systems and Risk - COMPLEXIS
TI - Text Classification for Monolingual Political Manifestos with Words Out of Vocabulary
SN - 978-989-758-427-5
IS - 2184-5034
AU - Rasov, A.
AU - Obabkov, I.
AU - Olbrich, E.
AU - Yamshchikov, I.
PY - 2020
SP - 149
EP - 154
DO - 10.5220/0009792101490154
PB - SciTePress