Quality of Wikipedia Articles: Analyzing Features and Building a Ground Truth for Supervised Classification

Elias Bassani; Marco Viviani

Research.Publish.Connect.

*Please fill out at least one Field. *Value must be an number!

Title:
ISBN:
Year:
Acronym:
Subject:

Advanced Search Proceedings Search

If you're looking for an exact phrase use quotation marks on text fields.

*Please fill out at least one Field.

Title:
Author:
Affiliation:
Subject:

Advanced Search Papers Search

If you're looking for an exact phrase use quotation marks on text fields.

*Please fill out at least one Field.

Name:
Affiliation:
Country:
Conference:
Subject:

Advanced Search Authors Search

If you're looking for an exact phrase use quotation marks on text fields.

*Please fill out at least one Field.

Name:
Country:
Subject:

Advanced Search Affiliations Search

If you're looking for an exact phrase use quotation marks on text fields.

Proceedings

Proceedings Search *Please fill out at least one Field. *Value must be an number!

Title:
ISBN:
Year:
Acronym:
Subject:

Advanced Search Proceedings Search

If you're looking for an exact phrase use quotation marks on text fields.

Papers

Papers Search *Please fill out at least one Field.

Title:
Author:
Affiliation:
Subject:

Advanced Search Papers Search

If you're looking for an exact phrase use quotation marks on text fields.

Authors

Authors Search *Please fill out at least one Field.

Name:
Affiliation:
Country:
Conference:
Subject:

Advanced Search Authors Search

If you're looking for an exact phrase use quotation marks on text fields.

Advanced Search

Paper

Quality of Wikipedia Articles: Analyzing Features and Building a Ground Truth for Supervised Classification

Topics: Data Analytics; Data Reduction and Quality Assessment; Machine Learning; Mining Text and Semi-Structured Data; Web Mining

In Proceedings of the 11th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management - Volume 0IC3K, 338-346, 2019 , Vienna, Austria

Authors: Elias Bassani ¹ and Marco Viviani ²

Affiliations: ¹ University of Milano-Bicocca, Department of Informatics, Systems, and Communication, Edificio U14 - Viale Sarca, 336, 20126 Milan, Italy, Consorzio per il Trasferimento Tecnologico (C2T), Milan and Italy ; ² University of Milano-Bicocca, Department of Informatics, Systems, and Communication, Edificio U14 - Viale Sarca, 336, 20126 Milan and Italy

Keyword(s): Data Quality, Wikipedia, Supervised Classification, Feature Analysis, Ground Truth Building.

Related Ontology Subjects/Areas/Topics: Artificial Intelligence ; Business Analytics ; Computational Intelligence ; Data Analytics ; Data Engineering ; Data Reduction and Quality Assessment ; Evolutionary Computing ; Knowledge Discovery and Information Retrieval ; Knowledge-Based Systems ; Machine Learning ; Mining Text and Semi-Structured Data ; Soft Computing ; Symbolic Systems ; Web Mining

Abstract: Wikipedia is nowadays one of the biggest online resources on which users rely as a source of information. The amount of collaboratively generated content that is sent to the online encyclopedia every day can let to the possible creation of low-quality articles (and, consequently, misinformation) if not properly monitored and revised. For this reason, in this paper, the problem of automatically assessing the quality of Wikipedia articles is considered. In particular, the focus is (i) on the analysis of groups of hand-crafted features that can be employed by supervised machine learning techniques to classify Wikipedia articles on qualitative bases, and (ii) on the analysis of some issues behind the construction of a suitable ground truth. Evaluations are performed, on the analyzed features and on a specifically built labeled dataset, by implementing different supervised classifiers based on distinct machine learning algorithms, which produced promising results.

CC BY-NC-ND 4.0

Guest: Register as new SciTePress user now for free.

SciTePress user: please login.

My Papers

You are not signed in, therefore limits apply to your IP address 18.116.239.195

In the current month:

Recent papers: 100 available of 100 total

2⁺ years older papers: 200 available of 200 total

Paper citation in several formats:

Bassani, E. and Viviani, M. (2019). Quality of Wikipedia Articles: Analyzing Features and Building a Ground Truth for Supervised Classification. In Proceedings of the 11th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management (IC3K 2019) - KDIR; ISBN 978-989-758-382-7; ISSN 2184-3228, SciTePress, pages 338-346. DOI: 10.5220/0008149303380346

@conference{kdir19,
author={Elias Bassani. and Marco Viviani.},
title={Quality of Wikipedia Articles: Analyzing Features and Building a Ground Truth for Supervised Classification},
booktitle={Proceedings of the 11th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management (IC3K 2019) - KDIR},
year={2019},
pages={338-346},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0008149303380346},
isbn={978-989-758-382-7},
issn={2184-3228},
}

TY - CONF

JO - Proceedings of the 11th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management (IC3K 2019) - KDIR
TI - Quality of Wikipedia Articles: Analyzing Features and Building a Ground Truth for Supervised Classification
SN - 978-989-758-382-7
IS - 2184-3228
AU - Bassani, E.
AU - Viviani, M.
PY - 2019
SP - 338
EP - 346
DO - 10.5220/0008149303380346
PB - SciTePress