Variable Importance Analysis in Default Prediction using Machine Learning Techniques

Başak Gültekin; Betül Erdoğdu Şakar

Research.Publish.Connect.

*Please fill out at least one Field. *Value must be an number!

Title:
ISBN:
Year:
Acronym:
Subject:

Advanced Search Proceedings Search

If you're looking for an exact phrase use quotation marks on text fields.

*Please fill out at least one Field.

Title:
Author:
Affiliation:
Subject:

Advanced Search Papers Search

If you're looking for an exact phrase use quotation marks on text fields.

*Please fill out at least one Field.

Name:
Affiliation:
Country:
Conference:
Subject:

Advanced Search Authors Search

If you're looking for an exact phrase use quotation marks on text fields.

*Please fill out at least one Field.

Name:
Country:
Subject:

Advanced Search Affiliations Search

If you're looking for an exact phrase use quotation marks on text fields.

Proceedings

Proceedings Search *Please fill out at least one Field. *Value must be an number!

Title:
ISBN:
Year:
Acronym:
Subject:

Advanced Search Proceedings Search

If you're looking for an exact phrase use quotation marks on text fields.

Papers

Papers Search *Please fill out at least one Field.

Title:
Author:
Affiliation:
Subject:

Advanced Search Papers Search

If you're looking for an exact phrase use quotation marks on text fields.

Authors

Authors Search *Please fill out at least one Field.

Name:
Affiliation:
Country:
Conference:
Subject:

Advanced Search Authors Search

If you're looking for an exact phrase use quotation marks on text fields.

Advanced Search

Paper

Variable Importance Analysis in Default Prediction using Machine Learning Techniques

Topics: Business Intelligence; Data Analytics; Data Science; Datamining; Decision Support Systems; Feature Selection; Pattern Recognition; Statistics Exploratory Data Analysis

In Proceedings of the 7th International Conference on Data Science, Technology and Applications DATA - Volume 1, 56-62, 2018 , Porto, Portugal

Authors: Başak Gültekin and Betül Erdoğdu Şakar

Affiliation: Faculty of Engineering and Natural Sciences, Bahçeşehir University, Beşiktaş and Turkey

Keyword(s): Credit Scoring, Default Prediction, Feature Selection, Classification, Boruta, Logistic Regression, Random Forest, Artificial Neural Network.

Related Ontology Subjects/Areas/Topics: Applications ; Artificial Intelligence ; Biomedical Engineering ; Biomedical Signal Processing ; Business Analytics ; Business Intelligence ; Cardiovascular Technologies ; Computing and Telecommunications in Cardiology ; Data Analytics ; Data Engineering ; Data Manipulation ; Data Mining ; Databases and Information Systems Integration ; Datamining ; Decision Support Systems ; Decision Support Systems, Remote Data Analysis ; Enterprise Information Systems ; Health Engineering and Technology Applications ; Health Information Systems ; Human-Computer Interaction ; Knowledge Discovery and Information Retrieval ; Knowledge-Based Systems ; Methodologies and Methods ; Neurocomputing ; Neurotechnology, Electronics and Informatics ; Pattern Recognition ; Physiological Computing Systems ; Sensor Networks ; Signal Processing ; Soft Computing ; Software Engineering ; Statistics Exploratory Data Analysis ; Symbolic Systems

Abstract: In this study, different data mining techniques were applied to a real bank credit data set from a public bank to provide an automated and objective credit scoring. Two-step methodology was used for objective credit scoring: Determining the variables to be included in the model and deciding on the model to classify the potential credit application as “bad credit (default)” or “good credit (not default)”. The phrases “bad credit” and “good credit” are used as class labels since they are used like this in banking jargon in Turkey. For this two-step procedure, different variable selection algorithms like Random Forest, Boruta and machine learning algorithms like Logistic Regression, Random Forest, Artificial Neural Network were tried. At the end of the feature selection phase, CRA_Score and III_Score variables were determined as most important variables. Moreover, occupation and bank product number were also predictor variables. For the classification phase, Neural Network model was the best model with higher accuracy and low average square error also Random Forest model better resulted than Logistic Regression model. (More)

CC BY-NC-ND 4.0

Guest: Register as new SciTePress user now for free.

SciTePress user: please login.

My Papers

You are not signed in, therefore limits apply to your IP address 216.73.216.9

In the current month:

Recent papers: 100 available of 100 total

2⁺ years older papers: 200 available of 200 total

Paper citation in several formats:

Gültekin, B., Erdoğdu Şakar and B. (2018). Variable Importance Analysis in Default Prediction using Machine Learning Techniques. In Proceedings of the 7th International Conference on Data Science, Technology and Applications - DATA; ISBN 978-989-758-318-6; ISSN 2184-285X, SciTePress, pages 56-62. DOI: 10.5220/0006872400560062

@conference{data18,
author={Başak Gültekin and Betül {Erdoğdu Şakar}},
title={Variable Importance Analysis in Default Prediction using Machine Learning Techniques},
booktitle={Proceedings of the 7th International Conference on Data Science, Technology and Applications - DATA},
year={2018},
pages={56-62},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0006872400560062},
isbn={978-989-758-318-6},
issn={2184-285X},
}

TY - CONF

JO - Proceedings of the 7th International Conference on Data Science, Technology and Applications - DATA
TI - Variable Importance Analysis in Default Prediction using Machine Learning Techniques
SN - 978-989-758-318-6
IS - 2184-285X
AU - Gültekin, B.
AU - Erdoğdu Şakar, B.
PY - 2018
SP - 56
EP - 62
DO - 10.5220/0006872400560062
PB - SciTePress