loading
Papers Papers/2022 Papers Papers/2022

Research.Publish.Connect.

Paper

Paper Unlock

Authors: Daniel Braun and Florian Matthes

Affiliation: Department of Informatics, Technical University of Munich, Boltzmannstrasse 3, Garching, Germany

Keyword(s): Natural Language Processing, Document Classification, Terms and Conditions, Web Crawling.

Abstract: Terms and Conditions in online shops are arguably among the most important (or at least the most widely used) forms of consumer contracts. At the same time, they are probably among the least read documents. Thus, their automated analysis is of great interest, not just for research, but also from a consumer protection perspective. To be able to automatically process large amounts of Terms and Conditions and build the corpora which are necessary to train data-driven systems, we need means to identify Terms and Conditions automatically. In this paper, we present and evaluate four different approaches to the automatic detection of Terms and Conditions pages in German and English online shops. We treat the problem as a binary document classification problem for web-pages and report an approach which achieves precision, recall, and F1-score above 0.9 in German and close to 0.9 in English, by analysing the URL of the page.

CC BY-NC-ND 4.0

Sign In Guest: Register as new SciTePress user now for free.

Sign In SciTePress user: please login.

PDF ImageMy Papers

You are not signed in, therefore limits apply to your IP address 3.128.198.21

In the current month:
Recent papers: 100 available of 100 total
2+ years older papers: 200 available of 200 total

Paper citation in several formats:
Braun, D. and Matthes, F. (2020). Automatic Detection of Terms and Conditions in German and English Online Shops. In Proceedings of the 16th International Conference on Web Information Systems and Technologies - WEBIST; ISBN 978-989-758-478-7; ISSN 2184-3252, SciTePress, pages 233-237. DOI: 10.5220/0010154302330237

@conference{webist20,
author={Daniel Braun. and Florian Matthes.},
title={Automatic Detection of Terms and Conditions in German and English Online Shops},
booktitle={Proceedings of the 16th International Conference on Web Information Systems and Technologies - WEBIST},
year={2020},
pages={233-237},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0010154302330237},
isbn={978-989-758-478-7},
issn={2184-3252},
}

TY - CONF

JO - Proceedings of the 16th International Conference on Web Information Systems and Technologies - WEBIST
TI - Automatic Detection of Terms and Conditions in German and English Online Shops
SN - 978-989-758-478-7
IS - 2184-3252
AU - Braun, D.
AU - Matthes, F.
PY - 2020
SP - 233
EP - 237
DO - 10.5220/0010154302330237
PB - SciTePress