loading
Papers Papers/2022 Papers Papers/2022

Research.Publish.Connect.

Paper

Paper Unlock
Using Conditional Random Fields with Constraints to Train Support Vector Machines - Locating and Parsing Bibliographic References

Topics: Clustering and Classification Methods; Information Extraction; Machine Learning; Structured Data Analysis and Statistical Methods; Web Mining

Author: Sebastian Lindner

Affiliation: University of Würzburg, Germany

Keyword(s): Bibliography, Classification, Conditional Random Fields (CRFs), Constraint-based Learning, Information Extraction, Information Retrieval, Machine Learning, References Parsing, Semi-supervised Learning, Support Vector Machines (SVMs).

Related Ontology Subjects/Areas/Topics: Artificial Intelligence ; Clustering and Classification Methods ; Computational Intelligence ; Evolutionary Computing ; Information Extraction ; Knowledge Discovery and Information Retrieval ; Knowledge-Based Systems ; Machine Learning ; Soft Computing ; Structured Data Analysis and Statistical Methods ; Symbolic Systems ; Web Mining

Abstract: This paper shows how bibliographic references can be located in HTML and then be separated into fields. First it is demonstrated, how Conditional Random Fields (CRFs) with constraints and prior knowledge about the bibliographic domain can be used to split bibliographic references into fields e.g. authors and title, when only a few labeled training instances are available. For this purpose an algorithm for automatic keyword extraction and a unique set of features and constraints is introduced. Features and the output of this Conditional Random Field (CRF) for tagging bibliographic references, Part Of Speech (POS) analysis and Named Entity Recognition (NER) are then used to find the bibliographic reference section in an article. First, a separation of the HTML document into blocks of consecutive inline elements is done. Then we compare one machine learning approach using a Support Vector Machines (SVM) with another one using a CRF for the reference locating process. In contrast to othe r reference locating approches, our method can even cope with single reference entries in a document or with multiple reference sections. We show that our reference location process achieves very good results, while the reference tagging approach is able to compete with other state-of-the-art approaches and sometimes even outperforms them. (More)

CC BY-NC-ND 4.0

Sign In Guest: Register as new SciTePress user now for free.

Sign In SciTePress user: please login.

PDF ImageMy Papers

You are not signed in, therefore limits apply to your IP address 216.73.216.141

In the current month:
Recent papers: 100 available of 100 total
2+ years older papers: 200 available of 200 total

Paper citation in several formats:
Lindner, S. (2013). Using Conditional Random Fields with Constraints to Train Support Vector Machines - Locating and Parsing Bibliographic References. In Proceedings of the International Conference on Knowledge Discovery and Information Retrieval and the International Conference on Knowledge Management and Information Sharing (IC3K 2013) - KDIR; ISBN 978-989-8565-75-4; ISSN 2184-3228, SciTePress, pages 28-36. DOI: 10.5220/0004546100280036

@conference{kdir13,
author={Sebastian Lindner},
title={Using Conditional Random Fields with Constraints to Train Support Vector Machines - Locating and Parsing Bibliographic References},
booktitle={Proceedings of the International Conference on Knowledge Discovery and Information Retrieval and the International Conference on Knowledge Management and Information Sharing (IC3K 2013) - KDIR},
year={2013},
pages={28-36},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0004546100280036},
isbn={978-989-8565-75-4},
issn={2184-3228},
}

TY - CONF

JO - Proceedings of the International Conference on Knowledge Discovery and Information Retrieval and the International Conference on Knowledge Management and Information Sharing (IC3K 2013) - KDIR
TI - Using Conditional Random Fields with Constraints to Train Support Vector Machines - Locating and Parsing Bibliographic References
SN - 978-989-8565-75-4
IS - 2184-3228
AU - Lindner, S.
PY - 2013
SP - 28
EP - 36
DO - 10.5220/0004546100280036
PB - SciTePress