real-time or frequent updates to the cost function may
become logistically complex, requiring additional
processes for data ingestion and threshold
recalibration.
4.4 Potential Extensions
One extension might involve incorporating hybrid
models that blend XGBoost with specialized anomaly
detection techniques, particularly for unusual
behavior in credit usage. Another possibility involves
deeper exploration of feature contributions, moving
beyond threshold scans to examine which attributes
drive cost outcomes most strongly. In some cases, a
time-series or sequence-based approach might be
pursued if payment patterns over multiple months can
be formulated as a sequence, thus applying RNN or
transformer architectures. Finally, methods such as
SHAP or LIME can clarify which features
consistently trigger a default label, thereby satisfying
stakeholders who need transparency (Zhang & Li,
2020).
4.5 Limitations Revisited
Although this cost-sensitive approach unifies data
preprocessing, multi-model comparison, and
threshold optimization in XGBoost, several
constraints remain. First, the 1000:100 ratio is
heuristic and may not reflect actual institutional
practices. Different banks could adopt distinct scales
based on average credit limits or interest structures,
indicating that a tiered or dynamic cost matrix might
better capture genuine lending risks. Second, only a
basic version of XGBoost was tested here, and
advanced hyperparameter tuning or specialized cost-
sensitive objectives could further refine outcomes.
Third, the study hinges on one UCI dataset,
suggesting a need for external validation on
proprietary data to verify real-world performance.
Finally, threshold scanning occurs offline, whereas
continuous risk environments require pipelines for
frequent data ingestion, model retraining, and
threshold readjustment. Nonetheless, emphasizing
penalties for missed defaulters reduces estimated bad
debt from roughly 877,900 to 432,400, underscoring
how accuracy (or AUC) alone may conceal the severe
costs of high-risk borrowers. Future directions
include multitier cost ratios guided by credit lines,
deeper tuning of XGBoost, and integration of
interpretability tools such as SHAP or LIME for
regulatory approval. Online or dynamic thresholds
could also adapt to evolving economic signals,
ensuring that modeling keeps pace with real-world
lending operations (Zhou, 2012).
5 CONCLUSION
This study proposed a cost-sensitive framework for
credit card default prediction, which integrates
comprehensive data preprocessing, multi-model
comparison, and threshold optimization in the
XGBoost. Highlighting the imbalanced cost of
missing defaulters brought potential bad debt down
from around 877,900 to 432,400, pointing to the fact
that some metrics like accuracy or AUC can miss the
point in high-stakes finance contexts.
Discussion of these results emphasized how a
decision threshold adjustment, instead of just
maximizing standard measures, can better suit
classification to real-world lending goals. This insight
advances practice through cost-based threshold
tuning and overcoming missing sentiment, which will
improve risk management and profits.
Still, some limitations do exist. Decisions made
using the single-cost ratio present in this paper may
not generalize to all lenders, nor is it based on any
numerous datasets. Refining this approach with
multi-tiered cost frameworks or more sophisticated
hyperparameter searches would enhance its
robustness and adaptability. Moreover, the
interpretability issue is not solved in complex
ensemble approaches such as gradient boosting,
highlighting the need for explainable tools that can
help fulfill the requirements of transparency in
financial systems.
Therefore, in practice, this cost-sensitive strategy
can be implemented as part of the day-to-day decision
processes of banks and credit issuers that can update
their thresholds as market conditions change. Next
steps may be for models to become adaptive, updating
cost ratios in real time to further improve the risk
capture while maintaining a good balance with
overall portfolio performance.
REFERENCES
Bahnsen, A. C., Aouada, D., & Ottersten, B. 2015.
Example-dependent cost-sensitive logistic regression
for credit scoring. Expert Systems with Applications,
42(6), 2476–2486.
Brown, I., & Mues, C. 2012. An experimental comparison
of classification algorithms for imbalanced credit
scoring data sets. Expert Systems with Applications,
39(3), 3446–3453.