loading
Documents

Research.Publish.Connect.

Paper

Authors: Önder Babur 1 ; Loek Cleophas 2 and Mark van den Brand 1

Affiliations: 1 Eindhoven University of Technology, Netherlands ; 2 Eindhoven University of Technology and Stellenbosch University, Netherlands

ISBN: 978-989-758-283-7

Keyword(s): Model-Driven Engineering, Model Analytics, Scalability, Distributed Computing, Apache Spark, Big Data.

Abstract: The growing number of models and other related artefacts in model-driven engineering has recently led to the emergence of approaches and tools for analyzing and managing them on a large scale. The framework SAMOS applies techniques inspired by information retrieval and data mining to analyze large sets of models. As the data size and analysis complexity goes up, however, further scalability is needed. In this paper we extend SAMOS to operate on Apache Spark, a popular engine for distributed Big Data processing, by partitioning the data and parallelizing the comparison and analysis phase. We present preliminary studies using a cluster infrastructure and report the results for two datasets: one with 250 Ecore metamodels where we detail the performance gain with various settings, and a larger one of 7.3k metamodels with nearly one million model elements for further demonstrating scalability.

PDF ImageFull Text

Download
Sign In Guest: Register as new SciTePress user now for free.

Sign In SciTePress user: please login.

PDF ImageMy Papers

You are not signed in, therefore limits apply to your IP address 54.198.23.251

In the current month:
Recent papers: 100 available of 100 total
2+ years older papers: 200 available of 200 total

Paper citation in several formats:
Babur Ö., Cleophas L. and van den Brand M. (2018). Towards Distributed Model Analytics with Apache Spark.In Proceedings of the 6th International Conference on Model-Driven Engineering and Software Development - Volume 1: MOMA3N, ISBN 978-989-758-283-7, pages 767-772. DOI: 10.5220/0006735407670772

@conference{moma3n18,
author={Önder Babur and Loek Cleophas and Mark van den Brand},
title={Towards Distributed Model Analytics with Apache Spark},
booktitle={Proceedings of the 6th International Conference on Model-Driven Engineering and Software Development - Volume 1: MOMA3N,},
year={2018},
pages={767-772},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0006735407670772},
isbn={978-989-758-283-7},
}

TY - CONF

JO - Proceedings of the 6th International Conference on Model-Driven Engineering and Software Development - Volume 1: MOMA3N,
TI - Towards Distributed Model Analytics with Apache Spark
SN - 978-989-758-283-7
AU - Babur Ö.
AU - Cleophas L.
AU - van den Brand M.
PY - 2018
SP - 767
EP - 772
DO - 10.5220/0006735407670772

Login or register to post comments.

Comments on this Paper: Be the first to review this paper.