loading
Papers Papers/2022 Papers Papers/2022

Research.Publish.Connect.

Paper

Paper Unlock

Authors: Georges Dubus ; Mathieu Bruyen and Nacéra Bennacer

Affiliation: E3S - SUPELEC, France

Keyword(s): Information retrieval, Text mining, Partitioning clustering, k-means, RSS feeds, XML, TFIDF.

Related Ontology Subjects/Areas/Topics: Artificial Intelligence ; Data Engineering ; Knowledge Discovery and Information Retrieval ; Knowledge-Based Systems ; Ontologies and the Semantic Web ; Soft Computing ; Symbolic Systems ; Web Information Systems and Technologies ; Web Interfaces and Applications ; Web Mining ; Web Personalization

Abstract: Really Simple Syndication (RSS) information feeds present new challenges to information retrieval technologies. In this paper we propose a RSS feeds retrieval approach which aims to give for an user a personalized view of items and making easier the access to their content. In our proposal, we define different filters in order to construct the vocabulary used in text describing items feeds. This filtering takes into account both the lexical category and the frequency of terms. The set of items feeds is then represented in a m-dimensional vector space. The k-means clustering algorithm with an adapted centroid computation and a distance measure is applied to find automatically clusters. The clusters indexed by relevant terms can so be refined, labeled and browsed by the user. We experiment the approach on a collection of items feeds collected from news sites. The resulting clusters show a good quality of their cohesion and their separation. This provides meaningful classes to org anize the information and to classify new items feeds. (More)

CC BY-NC-ND 4.0

Sign In Guest: Register as new SciTePress user now for free.

Sign In SciTePress user: please login.

PDF ImageMy Papers

You are not signed in, therefore limits apply to your IP address 18.221.41.214

In the current month:
Recent papers: 100 available of 100 total
2+ years older papers: 200 available of 200 total

Paper citation in several formats:
Dubus, G.; Bruyen, M. and Bennacer, N. (2010). SUPPORTING INFORMATION RETRIEVAL IN RSS FEEDS. In Proceedings of the 6th International Conference on Web Information Systems and Technology - Volume 1: WEBIST; ISBN 978-989-674-025-2; ISSN 2184-3252, SciTePress, pages 307-312. DOI: 10.5220/0002809103070312

@conference{webist10,
author={Georges Dubus. and Mathieu Bruyen. and Nacéra Bennacer.},
title={SUPPORTING INFORMATION RETRIEVAL IN RSS FEEDS},
booktitle={Proceedings of the 6th International Conference on Web Information Systems and Technology - Volume 1: WEBIST},
year={2010},
pages={307-312},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0002809103070312},
isbn={978-989-674-025-2},
issn={2184-3252},
}

TY - CONF

JO - Proceedings of the 6th International Conference on Web Information Systems and Technology - Volume 1: WEBIST
TI - SUPPORTING INFORMATION RETRIEVAL IN RSS FEEDS
SN - 978-989-674-025-2
IS - 2184-3252
AU - Dubus, G.
AU - Bruyen, M.
AU - Bennacer, N.
PY - 2010
SP - 307
EP - 312
DO - 10.5220/0002809103070312
PB - SciTePress