SUMMARIZING SETS OF CATEGORICAL SEQUENCES - Selecting and Visualizing Representative Sequences

Alexis Gabadinho; Gilbert Ritschard; Matthias Studer; Nicolas S. Müller

Research.Publish.Connect.

Title:
ISBN:
Year:
Acronym:
Subject:

Advanced Search Proceedings Search

If you're looking for an exact phrase use quotation marks on text fields.

Title:
Author:
Affiliation:
Subject:

Advanced Search Papers Search

If you're looking for an exact phrase use quotation marks on text fields.

Name:
Affiliation:
Country:
Conference:
Subject:

Advanced Search Authors Search

If you're looking for an exact phrase use quotation marks on text fields.

Name:
Country:
Subject:

Advanced Search Affiliations Search

If you're looking for an exact phrase use quotation marks on text fields.

Proceedings

Proceedings Search

Title:
ISBN:
Year:
Acronym:
Subject:

Advanced Search Proceedings Search

If you're looking for an exact phrase use quotation marks on text fields.

Papers

Papers Search

Title:
Author:
Affiliation:
Subject:

Advanced Search Papers Search

If you're looking for an exact phrase use quotation marks on text fields.

Authors

Authors Search

Name:
Affiliation:
Country:
Conference:
Subject:

Advanced Search Authors Search

If you're looking for an exact phrase use quotation marks on text fields.

Advanced Search

Paper

SUMMARIZING SETS OF CATEGORICAL SEQUENCES - Selecting and Visualizing Representative Sequences

Topics: Bioinformatics & Pattern Discovery; Clustering and Classification Methods; Data Reduction and Quality Assessment; Information Extraction; Mining High-Dimensional Data; Visual Data Mining and Data Visualization

In Proceedings of the International Conference on Knowledge Discovery and Information Retrieval - Volume 0IC3K, 62-69, 2009 , Funchal - Madeira, Portugal

Authors: Alexis Gabadinho ; Gilbert Ritschard ; Matthias Studer and Nicolas S. Müller

Affiliation: University of Geneva, Switzerland

Keyword(s): Categorical sequence data, Representativeness, Dissimilarity, Discrepancy of sequences, Summarizing sets of sequences, Visualization.

Related Ontology Subjects/Areas/Topics: Artificial Intelligence ; BioInformatics & Pattern Discovery ; Clustering and Classification Methods ; Data Reduction and Quality Assessment ; Information Extraction ; Knowledge Discovery and Information Retrieval ; Knowledge-Based Systems ; Mining High-Dimensional Data ; Symbolic Systems ; Visual Data Mining and Data Visualization

Abstract: This paper is concerned with the summarization of a set of categorical sequence data. More specifically, the problem studied is the determination of the smallest possible number of representative sequences that ensure a given coverage of the whole set, i.e. that have together a given percentage of sequences in their neighborhood. The goal is to yield a representative set that exhibits the key features of the whole sequence data set and permits easy sounded interpretation. We propose an heuristic for determining the representative set that first builds a list of candidates using a representativeness score and then eliminates redundancy. We propose also a visualization tool for rendering the results and quality measures for evaluating them. The proposed tools have been implemented in TraMineR our R package for mining and visualizing sequence data and we demonstrate their efficiency on a real world example from social sciences. The methods are nonetheless by no way limited to social sci ence data and should prove useful in many other domains. (More)

CC BY-NC-ND 4.0

Guest: Register as new SciTePress user now for free.

SciTePress user: please login.

My Papers

You are not signed in, therefore limits apply to your IP address 3.139.97.157

In the current month:

Recent papers: 100 available of 100 total

2⁺ years older papers: 200 available of 200 total

Paper citation in several formats:

Gabadinho, A.; Ritschard, G.; Studer, M. and Müller, N. (2009). SUMMARIZING SETS OF CATEGORICAL SEQUENCES - Selecting and Visualizing Representative Sequences. In Proceedings of the International Conference on Knowledge Discovery and Information Retrieval (IC3K 2009) - KDIR; ISBN 978-989-674-011-5; ISSN 2184-3228, SciTePress, pages 62-69. DOI: 10.5220/0002300400620069

@conference{kdir09,
author={Alexis Gabadinho. and Gilbert Ritschard. and Matthias Studer. and Nicolas S. Müller.},
title={SUMMARIZING SETS OF CATEGORICAL SEQUENCES - Selecting and Visualizing Representative Sequences},
booktitle={Proceedings of the International Conference on Knowledge Discovery and Information Retrieval (IC3K 2009) - KDIR},
year={2009},
pages={62-69},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0002300400620069},
isbn={978-989-674-011-5},
issn={2184-3228},
}

TY - CONF

JO - Proceedings of the International Conference on Knowledge Discovery and Information Retrieval (IC3K 2009) - KDIR
TI - SUMMARIZING SETS OF CATEGORICAL SEQUENCES - Selecting and Visualizing Representative Sequences
SN - 978-989-674-011-5
IS - 2184-3228
AU - Gabadinho, A.
AU - Ritschard, G.
AU - Studer, M.
AU - Müller, N.
PY - 2009
SP - 62
EP - 69
DO - 10.5220/0002300400620069
PB - SciTePress