loading
Documents

Research.Publish.Connect.

Paper

Authors: Luís Ferreira 1 ; André Pilastri 2 ; Carlos Martins 3 ; Pedro Santos 3 and Paulo Cortez 4

Affiliations: 1 EPMQ - IT Engineering Maturity and Quality Lab, CCG ZGDV Institute, Guimarães, Portugal, ALGORITMI Centre, Dep. Information Systems, University of Minho, Guimarães, Portugal ; 2 EPMQ - IT Engineering Maturity and Quality Lab, CCG ZGDV Institute, Guimarães, Portugal ; 3 WeDo Technologies, Braga, Portugal ; 4 ALGORITMI Centre, Dep. Information Systems, University of Minho, Guimarães, Portugal

ISBN: 978-989-758-395-7

Keyword(s): Automated Machine Learning, Distributed Machine Learning, Supervised Learning, Risk Management.

Abstract: Automation and scalability are currently two of the main challenges of Machine Learning. This paper proposes an automated and distributed ML framework that automatically trains a supervised learning model and produces predictions independently of the dataset and with minimum human input. The framework was designed for the domain of telecommunications risk management, which often requires supervised learning models that need to be quickly updated by non-ML-experts and trained on vast amounts of data. Thus, the architecture assumes a distributed environment, in order to deal with big data, and Automated Machine Learning (AutoML), to select and tune the ML models. The framework includes several modules: task detection (to detect if classification or regression), data preprocessing, feature selection, model training, and deployment. In this paper, we detail the model training module. In order to select the computational technologies to be used in this module, we first analyzed the capabil ities of an initial set of five modern AutoML tools: Auto-Keras, Auto-Sklearn, Auto-Weka, H2O AutoML, and TransmogrifAI. Then, we performed a benchmarking of the only two tools that address distributed ML (H2O AutoML and TransmogrifAI). Several comparison experiments were held using three real-world datasets from the telecommunications domain (churn, event forecasting, and fraud detection), allowing us to measure the computational effort and predictive capability of the AutoML tools. (More)

PDF ImageFull Text

Download
CC BY-NC-ND 4.0

Sign In Guest: Register as new SciTePress user now for free.

Sign In SciTePress user: please login.

PDF ImageMy Papers

You are not signed in, therefore limits apply to your IP address 100.24.113.182

In the current month:
Recent papers: 100 available of 100 total
2+ years older papers: 200 available of 200 total

Paper citation in several formats:
Ferreira, L.; Pilastri, A.; Martins, C.; Santos, P. and Cortez, P. (2020). An Automated and Distributed Machine Learning Framework for Telecommunications Risk Management.In Proceedings of the 12th International Conference on Agents and Artificial Intelligence - Volume 2: ICAART, ISBN 978-989-758-395-7, pages 99-107. DOI: 10.5220/0008952800990107

@conference{icaart20,
author={Luís Ferreira. and André Pilastri. and Carlos Martins. and Pedro Santos. and Paulo Cortez.},
title={An Automated and Distributed Machine Learning Framework for Telecommunications Risk Management},
booktitle={Proceedings of the 12th International Conference on Agents and Artificial Intelligence - Volume 2: ICAART,},
year={2020},
pages={99-107},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0008952800990107},
isbn={978-989-758-395-7},
}

TY - CONF

JO - Proceedings of the 12th International Conference on Agents and Artificial Intelligence - Volume 2: ICAART,
TI - An Automated and Distributed Machine Learning Framework for Telecommunications Risk Management
SN - 978-989-758-395-7
AU - Ferreira, L.
AU - Pilastri, A.
AU - Martins, C.
AU - Santos, P.
AU - Cortez, P.
PY - 2020
SP - 99
EP - 107
DO - 10.5220/0008952800990107

Login or register to post comments.

Comments on this Paper: Be the first to review this paper.