FactRunner: Fact Extraction over Wikipedia

Rhio Sutoyo; Christoph Quix; Fisnik Kastrati

Research.Publish.Connect.

*Please fill out at least one Field. *Value must be an number!

Title:
ISBN:
Year:
Acronym:
Subject:

Advanced Search Proceedings Search

If you're looking for an exact phrase use quotation marks on text fields.

*Please fill out at least one Field.

Title:
Author:
Affiliation:
Subject:

Advanced Search Papers Search

If you're looking for an exact phrase use quotation marks on text fields.

*Please fill out at least one Field.

Name:
Affiliation:
Country:
Conference:
Subject:

Advanced Search Authors Search

If you're looking for an exact phrase use quotation marks on text fields.

*Please fill out at least one Field.

Name:
Country:
Subject:

Advanced Search Affiliations Search

If you're looking for an exact phrase use quotation marks on text fields.

Proceedings

Proceedings Search *Please fill out at least one Field. *Value must be an number!

Title:
ISBN:
Year:
Acronym:
Subject:

Advanced Search Proceedings Search

If you're looking for an exact phrase use quotation marks on text fields.

Papers

Papers Search *Please fill out at least one Field.

Title:
Author:
Affiliation:
Subject:

Advanced Search Papers Search

If you're looking for an exact phrase use quotation marks on text fields.

Authors

Authors Search *Please fill out at least one Field.

Name:
Affiliation:
Country:
Conference:
Subject:

Advanced Search Authors Search

If you're looking for an exact phrase use quotation marks on text fields.

Advanced Search

Paper

FactRunner: Fact Extraction over Wikipedia

Topics: Meta-Knowledge Discovery and Representation; Web Information Filtering and Retrieval

In Proceedings of the 9th International Conference on Web Information Systems and Technologies WEBIST - Volume 1, 423-432, 2013 , Aachen, Germany

Authors: Rhio Sutoyo ¹ ; Christoph Quix ² and Fisnik Kastrati ³

Affiliations: ¹ King Mongkut’s University of Technology North Bangkok, Thailand ; ² RWTH Aachen University and Fraunhofer Institute for Applied Information Technology FIT, Germany ; ³ RWTH Aachen University, Germany

Keyword(s): Information Extraction, Semantic Search.

Abstract: The increasing role of Wikipedia as a source of human-readable knowledge is evident as it contains an enormous amount of high quality information written in natural language by human authors. However, querying this information using traditional keyword based approaches requires often a time-consuming, iterative process to explore the document collection to find the information of interest. Therefore, a structured representation of information and queries would be helpful to be able to directly query for the relevant information. An important challenge in this context is the extraction of structured information from unstructured knowledge bases which is addressed by Information Extraction (IE) systems. However, these systems struggle with the complexity of natural language and produce frequently unsatisfying results. In addition to the plain natural language text, Wikipedia contains links between documents which directly link a term of one document to another document. In our approach for fact extraction from Wikipedia, we consider these links as an important indicator for the relevance of the linked information. Thus, our proposed system FactRunner focusses on extracting structured information from sentences containing such links. We show that a natural language parser combined with Wikipedia markup can be exploited for extracting facts in form of triple statements with a high accuracy. (More)

CC BY-NC-ND 4.0

Guest: Register as new SciTePress user now for free.

SciTePress user: please login.

My Papers

You are not signed in, therefore limits apply to your IP address 3.145.60.149

In the current month:

Recent papers: 100 available of 100 total

2⁺ years older papers: 200 available of 200 total

Paper citation in several formats:

Sutoyo, R.; Quix, C. and Kastrati, F. (2013). FactRunner: Fact Extraction over Wikipedia. In Proceedings of the 9th International Conference on Web Information Systems and Technologies - WEBIST; ISBN 978-989-8565-54-9; ISSN 2184-3252, SciTePress, pages 423-432. DOI: 10.5220/0004375604230432

@conference{webist13,
author={Rhio Sutoyo. and Christoph Quix. and Fisnik Kastrati.},
title={FactRunner: Fact Extraction over Wikipedia},
booktitle={Proceedings of the 9th International Conference on Web Information Systems and Technologies - WEBIST},
year={2013},
pages={423-432},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0004375604230432},
isbn={978-989-8565-54-9},
issn={2184-3252},
}

TY - CONF

JO - Proceedings of the 9th International Conference on Web Information Systems and Technologies - WEBIST
TI - FactRunner: Fact Extraction over Wikipedia
SN - 978-989-8565-54-9
IS - 2184-3252
AU - Sutoyo, R.
AU - Quix, C.
AU - Kastrati, F.
PY - 2013
SP - 423
EP - 432
DO - 10.5220/0004375604230432
PB - SciTePress