loading
Papers

Research.Publish.Connect.

Paper

Paper Unlock

Authors: Sebastian Lindner and Winfried Höhn

Affiliation: University of Würzburg, Germany

ISBN: 978-989-8565-29-7

Keyword(s): References Parsing, Bibliography, Conditional Random Fields (CRFs), Constraint-based Learning, Information Extraction, Information Retrieval, Machine Learning, Sequence Labeling, Semi-supervised Learning.

Related Ontology Subjects/Areas/Topics: Artificial Intelligence ; Clustering and Classification Methods ; Computational Intelligence ; Data Reduction and Quality Assessment ; Evolutionary Computing ; Information Extraction ; Knowledge Discovery and Information Retrieval ; Knowledge-Based Systems ; Machine Learning ; Soft Computing ; Symbolic Systems

Abstract: This paper shows some key components of our workflow to cope with bibliographic information. We therefore compare several approaches for parsing bibliographic references using conditional random fields (CRFs). This paper concentrates on cases, where there are only few labeled training instances available. To get better labeling results prior knowledge about the bibliography domain is used in training CRFs using different constraint models. We show that our labeling approach is able to achieve comparable and even better results than other state of the art approaches. Afterwards we point out how for about half of our reference strings a correlation between journal title, volume and publishing year could be used to identify the correct journal even when we had ambiguous journal title abbreviations.

PDF ImageFull Text

Download
CC BY-NC-ND 4.0

Sign In Guest: Register as new SciTePress user now for free.

Sign In SciTePress user: please login.

PDF ImageMy Papers

You are not signed in, therefore limits apply to your IP address 35.175.191.72

In the current month:
Recent papers: 100 available of 100 total
2+ years older papers: 200 available of 200 total

Paper citation in several formats:
Lindner, S. and Höhn, W. (2012). Parsing and Maintaining Bibliographic References - Semi-supervised Learning of Conditional Random Fields with Constraints.In Proceedings of the International Conference on Knowledge Discovery and Information Retrieval - Volume 1: KDIR, (IC3K 2012) ISBN 978-989-8565-29-7, pages 233-238. DOI: 10.5220/0004138602330238

@conference{kdir12,
author={Sebastian Lindner. and Winfried Höhn.},
title={Parsing and Maintaining Bibliographic References - Semi-supervised Learning of Conditional Random Fields with Constraints},
booktitle={Proceedings of the International Conference on Knowledge Discovery and Information Retrieval - Volume 1: KDIR, (IC3K 2012)},
year={2012},
pages={233-238},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0004138602330238},
isbn={978-989-8565-29-7},
}

TY - CONF

JO - Proceedings of the International Conference on Knowledge Discovery and Information Retrieval - Volume 1: KDIR, (IC3K 2012)
TI - Parsing and Maintaining Bibliographic References - Semi-supervised Learning of Conditional Random Fields with Constraints
SN - 978-989-8565-29-7
AU - Lindner, S.
AU - Höhn, W.
PY - 2012
SP - 233
EP - 238
DO - 10.5220/0004138602330238

Login or register to post comments.

Comments on this Paper: Be the first to review this paper.