loading
Papers Papers/2022 Papers Papers/2022

Research.Publish.Connect.

Paper

Paper Unlock

Authors: Juris Rāts 1 ; Inguna Pede 1 ; Tatjana Rubina 2 and Gatis Vītols 2

Affiliations: 1 RIX Technologies, Blaumana 5a-3, Riga, LV-1011, Latvia ; 2 Faculty of Information Technologies, Latvia University of Life Sciences and Technologies, 2 Liela str., Jelgava, LV-3001, Latvia

Keyword(s): Machine Learning, Document Classification, Enterprise Content Management, Python, Elasticsearch.

Abstract: The aim of the research is to create and evaluate a flexible model for document capturing that would employ machine learning to classify documents feeding them with values for one or more metadata items. Documents and classification metadata fields typical for Enterprise Content Management (ECM) systems are used in the research. The model comprises selection of classification methods, configuration of the methods hyperparameters and configuration of a number of other learning related parameters. The model provides user with visual means to analyse the classification outcomes and those to tune the further steps of the learning. A couple of challenges are addressed along the way – as informal and eventually changing criteria for document classification, and imbalanced data sets.

CC BY-NC-ND 4.0

Sign In Guest: Register as new SciTePress user now for free.

Sign In SciTePress user: please login.

PDF ImageMy Papers

You are not signed in, therefore limits apply to your IP address 18.207.255.67

In the current month:
Recent papers: 100 available of 100 total
2+ years older papers: 200 available of 200 total

Paper citation in several formats:
Rāts, J.; Pede, I.; Rubina, T. and Vītols, G. (2020). A Flexible Model for Enterprise Document Capturing Automation. In Proceedings of the 22nd International Conference on Enterprise Information Systems - Volume 1: ICEIS; ISBN 978-989-758-423-7; ISSN 2184-4992, SciTePress, pages 297-304. DOI: 10.5220/0009034802970304

@conference{iceis20,
author={Juris Rāts. and Inguna Pede. and Tatjana Rubina. and Gatis Vītols.},
title={A Flexible Model for Enterprise Document Capturing Automation},
booktitle={Proceedings of the 22nd International Conference on Enterprise Information Systems - Volume 1: ICEIS},
year={2020},
pages={297-304},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0009034802970304},
isbn={978-989-758-423-7},
issn={2184-4992},
}

TY - CONF

JO - Proceedings of the 22nd International Conference on Enterprise Information Systems - Volume 1: ICEIS
TI - A Flexible Model for Enterprise Document Capturing Automation
SN - 978-989-758-423-7
IS - 2184-4992
AU - Rāts, J.
AU - Pede, I.
AU - Rubina, T.
AU - Vītols, G.
PY - 2020
SP - 297
EP - 304
DO - 10.5220/0009034802970304
PB - SciTePress