loading
Papers

Research.Publish.Connect.

Paper

Authors: Rui Portocarrero Sarmento 1 ; Mário Cordeiro 1 ; Pavel Brazdil 2 and João Gama 2

Affiliations: 1 University of Porto, Portugal ; 2 LIAAD-INESC TEC, Portugal

ISBN: 978-989-758-298-1

Keyword(s): Automatic Keyword Extraction, Incremental PageRank, Data Streams, Text Mining, Incremental TextRank.

Abstract: Text Mining and NLP techniques are a hot topic nowadays. Researchers thrive to develop new and faster algorithms to cope with larger amounts of data. Particularly, text data analysis has been increasing in interest due to the growth of social networks media. Given this, the development of new algorithms and/or the upgrade of existing ones is now a crucial task to deal with text mining problems under this new scenario. In this paper, we present an update to TextRank, a well-known implementation used to do automatic keyword extraction from text, adapted to deal with streams of text. In addition, we present results for this implementation and compare them with the batch version. Major improvements are lowest computation times for the processing of the same text data, in a streaming environment, both in sliding window and incremental setups. The speedups obtained in the experimental results are significant. Therefore the approach was considered valid and useful to the research co mmunity. (More)

PDF ImageFull Text

Download
CC BY-NC-ND 4.0

Sign In Guest: Register as new SciTePress user now for free.

Sign In SciTePress user: please login.

PDF ImageMy Papers

You are not signed in, therefore limits apply to your IP address 3.227.249.234

In the current month:
Recent papers: 100 available of 100 total
2+ years older papers: 200 available of 200 total

Paper citation in several formats:
Sarmento, R.; Cordeiro, M.; Brazdil, P. and Gama, J. (2018). Incremental TextRank - Automatic Keyword Extraction for Text Streams.In Proceedings of the 20th International Conference on Enterprise Information Systems - Volume 1: ICEIS, ISBN 978-989-758-298-1, pages 363-370. DOI: 10.5220/0006639703630370

@conference{iceis18,
author={Rui Portocarrero Sarmento. and Mário Cordeiro. and Pavel Brazdil. and João Gama.},
title={Incremental TextRank - Automatic Keyword Extraction for Text Streams},
booktitle={Proceedings of the 20th International Conference on Enterprise Information Systems - Volume 1: ICEIS,},
year={2018},
pages={363-370},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0006639703630370},
isbn={978-989-758-298-1},
}

TY - CONF

JO - Proceedings of the 20th International Conference on Enterprise Information Systems - Volume 1: ICEIS,
TI - Incremental TextRank - Automatic Keyword Extraction for Text Streams
SN - 978-989-758-298-1
AU - Sarmento, R.
AU - Cordeiro, M.
AU - Brazdil, P.
AU - Gama, J.
PY - 2018
SP - 363
EP - 370
DO - 10.5220/0006639703630370

Login or register to post comments.

Comments on this Paper: Be the first to review this paper.