loading
Papers Papers/2022 Papers Papers/2022

Research.Publish.Connect.

Paper

Paper Unlock

Authors: Kostas Fragos 1 ; Yannis Maistros 1 and Christos Skourlas 2

Affiliations: 1 National Technical University of Athens, Greece ; 2 Technical Educational Institute of Athens, Greece

Abstract: In this paper two statistical methods for extracting collocations from text corpora written in Modern Greek are described, the mean and variance method and a method based on the X2 test. The mean and variance method calculates distances (“offsets”) between words in a corpus and looks for specific patterns of distance. The X2 test is combined with the formulation of a null hypothesis H0 for a sample of occurrences and we check if there are associations between the words. The X2 testing does not assume that the words in the corpus have normally distributed probabilities and hence it seems to be more flexible. The two methods extract interesting collocations that are useful in various applications e.g. computational lexicography, language generation and machine translation.

CC BY-NC-ND 4.0

Sign In Guest: Register as new SciTePress user now for free.

Sign In SciTePress user: please login.

PDF ImageMy Papers

You are not signed in, therefore limits apply to your IP address 18.191.211.66

In the current month:
Recent papers: 100 available of 100 total
2+ years older papers: 200 available of 200 total

Paper citation in several formats:
Fragos, K.; Maistros, Y. and Skourlas, C. (2004). Descovering Collocations in Modern Greek Language. In Proceedings of the 1st International Workshop on Natural Language Understanding and Cognitive Science (ICEIS 2004) - NLUCS; ISBN 972-8865-05-8, SciTePress, pages 151-158. DOI: 10.5220/0002667101510158

@conference{nlucs04,
author={Kostas Fragos. and Yannis Maistros. and Christos Skourlas.},
title={Descovering Collocations in Modern Greek Language},
booktitle={Proceedings of the 1st International Workshop on Natural Language Understanding and Cognitive Science (ICEIS 2004) - NLUCS},
year={2004},
pages={151-158},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0002667101510158},
isbn={972-8865-05-8},
}

TY - CONF

JO - Proceedings of the 1st International Workshop on Natural Language Understanding and Cognitive Science (ICEIS 2004) - NLUCS
TI - Descovering Collocations in Modern Greek Language
SN - 972-8865-05-8
AU - Fragos, K.
AU - Maistros, Y.
AU - Skourlas, C.
PY - 2004
SP - 151
EP - 158
DO - 10.5220/0002667101510158
PB - SciTePress