loading
Papers Papers/2022 Papers Papers/2022

Research.Publish.Connect.

Paper

Paper Unlock

Authors: Moheb M. R. Henein ; Doaa M. Shawky and Salwa K. Abd-El-Hafiz

Affiliation: Engineering Mathematics and Physics Department, Faculty of Engineering, Cairo University, Giza, 12613 and Egypt

Keyword(s): Software Defect Prediction, Under-sampling, Clustering, K-means, Artificial Neural Network.

Abstract: Detection of software defective modules is important for reducing the time and resources consumed by software testing. Software defect data sets usually suffer from imbalance, where the number of defective modules is fewer than the number of defect-free modules. Imbalanced data sets make the machine learning algorithms to be biased toward the majority class. Clustering-based under-sampling shows its ability to find good representatives of the majority data in different applications. This paper presents an approach for software defect prediction based on clustering-based under-sampling and Artificial Neural Network (ANN). Firstly, clustering-based under-sampling is used for selecting a subset of the majority samples, which is then combined with the minority samples to produce a balanced data set. Secondly, an ANN model is built and trained using the resulted balanced data set. The used ANN is trained to classify the software modules into defective or defect-free. In addition, a sensit ivity analysis is conducted to choose the number of majority samples that yields the best performance measures. Results show the high prediction capability for the detection of defective modules while maintaining the ability of detecting defect-free modules. (More)

CC BY-NC-ND 4.0

Sign In Guest: Register as new SciTePress user now for free.

Sign In SciTePress user: please login.

PDF ImageMy Papers

You are not signed in, therefore limits apply to your IP address 3.22.249.158

In the current month:
Recent papers: 100 available of 100 total
2+ years older papers: 200 available of 200 total

Paper citation in several formats:
Henein, M.; Shawky, D. and Abd-El-Hafiz, S. (2018). Clustering-based Under-sampling for Software Defect Prediction. In Proceedings of the 13th International Conference on Software Technologies - ICSOFT; ISBN 978-989-758-320-9; ISSN 2184-2833, SciTePress, pages 185-193. DOI: 10.5220/0006911402190227

@conference{icsoft18,
author={Moheb M. R. Henein. and Doaa M. Shawky. and Salwa K. Abd{-}El{-}Hafiz.},
title={Clustering-based Under-sampling for Software Defect Prediction},
booktitle={Proceedings of the 13th International Conference on Software Technologies - ICSOFT},
year={2018},
pages={185-193},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0006911402190227},
isbn={978-989-758-320-9},
issn={2184-2833},
}

TY - CONF

JO - Proceedings of the 13th International Conference on Software Technologies - ICSOFT
TI - Clustering-based Under-sampling for Software Defect Prediction
SN - 978-989-758-320-9
IS - 2184-2833
AU - Henein, M.
AU - Shawky, D.
AU - Abd-El-Hafiz, S.
PY - 2018
SP - 185
EP - 193
DO - 10.5220/0006911402190227
PB - SciTePress