loading
Papers Papers/2022 Papers Papers/2022

Research.Publish.Connect.

Paper

Paper Unlock

Authors: Yao Jean Marc Pokou ; Philippe Fournier-Viger and Chadia Moghrabi

Affiliation: Université de Moncton, Canada

Keyword(s): Authorship Attribution, Stylometry, Part-of-Speech Tags, Variable Length Sequential Patterns.

Related Ontology Subjects/Areas/Topics: Agents ; Artificial Intelligence ; Data Mining ; Databases and Information Systems Integration ; Enterprise Information Systems ; Privacy, Safety and Security ; Sensor Networks ; Signal Processing ; Soft Computing

Abstract: Identifying the author of a book or document is an interesting research topic having numerous real-life applications. A number of algorithms have been proposed for the automatic authorship attribution of texts. However, it remains an important challenge to find distinct and quantifiable features for accurately identifying or narrowing the range of likely authors of a text. In this paper we propose a novel approach for authorship attribution, which relies on the discovery of variable-length sequential patterns of parts of speech to build signatures representing each author’s writing style. An experimental evaluation using 10 authors and 30 books, consisting of 2,615,856 words, from Project Gutenberg was carried. Results show that the proposed approach can accurately classify texts most of the time using a very small number of variable-length patterns. The proposed approach is also shown to perform better using variable-length patterns than with fixed-length patterns (bigrams or trigra ms). (More)

CC BY-NC-ND 4.0

Sign In Guest: Register as new SciTePress user now for free.

Sign In SciTePress user: please login.

PDF ImageMy Papers

You are not signed in, therefore limits apply to your IP address 3.88.185.100

In the current month:
Recent papers: 100 available of 100 total
2+ years older papers: 200 available of 200 total

Paper citation in several formats:
Pokou, Y.; Fournier-Viger, P. and Moghrabi, C. (2016). Authorship Attribution using Variable Length Part-of-Speech Patterns. In Proceedings of the 8th International Conference on Agents and Artificial Intelligence - Volume 2: ICAART; ISBN 978-989-758-172-4; ISSN 2184-433X, SciTePress, pages 354-361. DOI: 10.5220/0005710103540361

@conference{icaart16,
author={Yao Jean Marc Pokou. and Philippe Fournier{-}Viger. and Chadia Moghrabi.},
title={Authorship Attribution using Variable Length Part-of-Speech Patterns},
booktitle={Proceedings of the 8th International Conference on Agents and Artificial Intelligence - Volume 2: ICAART},
year={2016},
pages={354-361},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0005710103540361},
isbn={978-989-758-172-4},
issn={2184-433X},
}

TY - CONF

JO - Proceedings of the 8th International Conference on Agents and Artificial Intelligence - Volume 2: ICAART
TI - Authorship Attribution using Variable Length Part-of-Speech Patterns
SN - 978-989-758-172-4
IS - 2184-433X
AU - Pokou, Y.
AU - Fournier-Viger, P.
AU - Moghrabi, C.
PY - 2016
SP - 354
EP - 361
DO - 10.5220/0005710103540361
PB - SciTePress