File Name Classification Approach to Identify Child Sexual Abuse

Mhd Wesam Al-Nabki; Mhd Wesam Al-Nabki; Eduardo Fidalgo; Eduardo Fidalgo; Enrique Alegre; Enrique Alegre; Rocío Aláiz-Rodríguez; Rocío Aláiz-Rodríguez

Research.Publish.Connect.

*Please fill out at least one Field. *Value must be an number!

Title:
ISBN:
Year:
Acronym:
Subject:

Advanced Search Proceedings Search

If you're looking for an exact phrase use quotation marks on text fields.

*Please fill out at least one Field.

Title:
Author:
Affiliation:
Subject:

Advanced Search Papers Search

If you're looking for an exact phrase use quotation marks on text fields.

*Please fill out at least one Field.

Name:
Affiliation:
Country:
Conference:
Subject:

Advanced Search Authors Search

If you're looking for an exact phrase use quotation marks on text fields.

*Please fill out at least one Field.

Name:
Country:
Subject:

Advanced Search Affiliations Search

If you're looking for an exact phrase use quotation marks on text fields.

Proceedings

Proceedings Search *Please fill out at least one Field. *Value must be an number!

Title:
ISBN:
Year:
Acronym:
Subject:

Advanced Search Proceedings Search

If you're looking for an exact phrase use quotation marks on text fields.

Papers

Papers Search *Please fill out at least one Field.

Title:
Author:
Affiliation:
Subject:

Advanced Search Papers Search

If you're looking for an exact phrase use quotation marks on text fields.

Authors

Authors Search *Please fill out at least one Field.

Name:
Affiliation:
Country:
Conference:
Subject:

Advanced Search Authors Search

If you're looking for an exact phrase use quotation marks on text fields.

Advanced Search

Paper

File Name Classification Approach to Identify Child Sexual Abuse

Topics: Classification and Clustering; Deep Learning and Neural Networks; Feature Selection and Extraction; Natural Language Processing

In Proceedings of the 9th International Conference on Pattern Recognition Applications and Methods ICPRAM - Volume 1, 228-234, 2020 , Valletta, Malta

Authors: Mhd Wesam Al-Nabki ^{1

;

2} ; Eduardo Fidalgo ^{1

;

2} ; Enrique Alegre ^{1

;

2} and Rocío Aláiz-Rodríguez ^{1

;

2}

Affiliations: ¹ Department of Electrical, Systems and Automation, Universidad de León, Spain ; ² Researcher at INCIBE (Spanish National Cybersecurity Institute), León, Spain

Keyword(s): Short Text Classification, File Name Classification, Active Learning, Character-level Convolutional Networks, Child Sexual Abuse.

Abstract: When Law Enforcement Agencies seize a computer machine from a potential producer or consumer of Child Sexual Exploitation Material (CSEM), they need accurate and time-efficient tools to analyze its files. However, classifying and detecting CSEM by manual inspection is a high time-consuming task, and most of the time, it is unfeasible in the amount of time available for Spanish police using a search warrant. An option for identifying CSEM is to analyze the names of the files stored in the hard disk of the suspect person, looking in the text for patterns related to CSEM. However, due to the particularity of this file names, mainly its length and the use of obfuscated words, current file name classification methods suffer from a low recall rate, which is essential in the context of this problem. This paper presents our ongoing research to identify CSEM through their file names. We evaluate two approaches of short text classification: a proposal based on machine learning classifiers expl oring the use of Logistic Regression and Support Vector Machine and an approach using deep learning by adapting two popular Convolutional Neural Network (CNN) models that work on character-level. The presented CNN achieved an average class recall of 0.86 and a recall rate of 0.78 for the CSEM class. The CNN based classifier could be integrated into forensic tools and services that might support Law Enforcement Agencies to identify CSEM without the need to access systematically to the visual content of every file. (More)

CC BY-NC-ND 4.0

Guest: Register as new SciTePress user now for free.

SciTePress user: please login.

My Papers

You are not signed in, therefore limits apply to your IP address 18.226.187.101

In the current month:

Recent papers: 100 available of 100 total

2⁺ years older papers: 200 available of 200 total

Paper citation in several formats:

Al-Nabki, M.; Fidalgo, E.; Alegre, E. and Aláiz-Rodríguez, R. (2020). File Name Classification Approach to Identify Child Sexual Abuse. In Proceedings of the 9th International Conference on Pattern Recognition Applications and Methods - ICPRAM; ISBN 978-989-758-397-1; ISSN 2184-4313, SciTePress, pages 228-234. DOI: 10.5220/0009154802280234

@conference{icpram20,
author={Mhd Wesam Al{-}Nabki. and Eduardo Fidalgo. and Enrique Alegre. and Rocío Aláiz{-}Rodríguez.},
title={File Name Classification Approach to Identify Child Sexual Abuse},
booktitle={Proceedings of the 9th International Conference on Pattern Recognition Applications and Methods - ICPRAM},
year={2020},
pages={228-234},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0009154802280234},
isbn={978-989-758-397-1},
issn={2184-4313},
}

TY - CONF

JO - Proceedings of the 9th International Conference on Pattern Recognition Applications and Methods - ICPRAM
TI - File Name Classification Approach to Identify Child Sexual Abuse
SN - 978-989-758-397-1
IS - 2184-4313
AU - Al-Nabki, M.
AU - Fidalgo, E.
AU - Alegre, E.
AU - Aláiz-Rodríguez, R.
PY - 2020
SP - 228
EP - 234
DO - 10.5220/0009154802280234
PB - SciTePress