loading
Papers Papers/2022 Papers Papers/2022

Research.Publish.Connect.

Paper

Paper Unlock

Authors: Alaa El-Ebshihy ; Nagwa El-Makky and Khaled Nagi

Affiliation: Dept. of Computer and Systems Engineering, Faculty of Engineering, Alexandria University and Egypt

Keyword(s): Linguistic Shift, Semantic Change, Google Books Ngram, FastText, Time Series Analysis, Computational Linguistics.

Related Ontology Subjects/Areas/Topics: Artificial Intelligence ; Computational Intelligence ; Evolutionary Computing ; Knowledge Discovery and Information Retrieval ; Knowledge-Based Systems ; Machine Learning ; Mining Text and Semi-Structured Data ; Soft Computing ; Structured Data Analysis and Statistical Methods ; Symbolic Systems

Abstract: The availability of large historical corpora, such as Google Books Ngram, makes it possible to extract various meta information about the evolution of human languages. Together with advances in machine learning techniques, researchers recently use the huge corpora to track cultural and linguistic shifts in words and terms over time. In this paper, we develop a new approach to quantitatively recognize semantic changes of words during the period between 1800 and 1990. We use the state-of-the-art FastText approach to construct word embedding for Google Books Ngram corpus for the decades within the time period 1800-1990. We use a time series analysis to identify words that have a statistically significant change in the period between 1900 and 1990. We conduct a performance evaluation study to compare our approach against related work, we show that our system is more robust against morphological language variations.

CC BY-NC-ND 4.0

Sign In Guest: Register as new SciTePress user now for free.

Sign In SciTePress user: please login.

PDF ImageMy Papers

You are not signed in, therefore limits apply to your IP address 3.137.170.183

In the current month:
Recent papers: 100 available of 100 total
2+ years older papers: 200 available of 200 total

Paper citation in several formats:
El-Ebshihy, A.; El-Makky, N. and Nagi, K. (2018). Using Google Books Ngram in Detecting Linguistic Shifts over Time. In Proceedings of the 10th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management (IC3K 2018) - KDIR; ISBN 978-989-758-330-8; ISSN 2184-3228, SciTePress, pages 332-339. DOI: 10.5220/0007188703320339

@conference{kdir18,
author={Alaa El{-}Ebshihy. and Nagwa El{-}Makky. and Khaled Nagi.},
title={Using Google Books Ngram in Detecting Linguistic Shifts over Time},
booktitle={Proceedings of the 10th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management (IC3K 2018) - KDIR},
year={2018},
pages={332-339},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0007188703320339},
isbn={978-989-758-330-8},
issn={2184-3228},
}

TY - CONF

JO - Proceedings of the 10th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management (IC3K 2018) - KDIR
TI - Using Google Books Ngram in Detecting Linguistic Shifts over Time
SN - 978-989-758-330-8
IS - 2184-3228
AU - El-Ebshihy, A.
AU - El-Makky, N.
AU - Nagi, K.
PY - 2018
SP - 332
EP - 339
DO - 10.5220/0007188703320339
PB - SciTePress