Authors:
Andrea Murari
1
;
Emmanuele Peluso
2
;
Saeed Talebzadeh
2
;
Pasqualino Gaudio
2
;
Michele Lungaroni
2
;
Ondrej Mikulin
3
;
Jesus Vega
4
and
Michela Gelfusa
2
Affiliations:
1
Consorzio RFX (CNR, ENEA, INFN, Università di Padova and Acciaierie Venete SpA), Italy
;
2
University of Rome “Tor Vergata”, Italy
;
3
Institute of Plasma Physics AS CR, Czech Republic
;
4
Asociación EURATOM/CIEMAT para Fusión, Spain
Keyword(s):
Machine Learning Tools, Support Vector Machines, Symbolic Regression, Genetic Programming.
Related
Ontology
Subjects/Areas/Topics:
Artificial Intelligence
;
Clustering and Classification Methods
;
Computational Intelligence
;
Evolutionary Computing
;
Knowledge Discovery and Information Retrieval
;
Knowledge-Based Systems
;
Machine Learning
;
Soft Computing
;
Structured Data Analysis and Statistical Methods
;
Symbolic Systems
Abstract:
In many scientific applications, it is necessary to perform classification, which means discrimination between examples belonging to different classes. Machine Learning Tools have proved to be very performing in this task and can achieve very high success rates. On the other hand, the “realism” and interpretability of their results are very low, limiting their applicability. In this paper, a method to derive manageable equations for the hypersurface between classes is presented. The main objective consists of formulating the results of machine learning tools in a way representing the actual “physics” behind the phenomena under investigation. The proposed approach is based on a suitable combination of Support vector Machines and Symbolic Regression via Genetic Programming; it has been investigated with a series of systematic numerical tests, for different types of equations and classification problems, and tested with various experimental databases. The obtained results indicate that
the proposed method permits to find a good trade-off between accuracy of the classification and complexity of the derived mathematical equations. Moreover, the derived models can be tuned to reflect the actual phenomena, providing a very useful tool to bridge the gap between data, machine learning tools and scientific theories.
(More)