new fraud strategies when maintaining
competitiveness in terms of accuracy rate and low
latency in identifying decisions.
3 LITERATURE SURVEY
Machine learning has brought great strides in the
development of the credit card fraud detection
systems. Kandi and García-Dopico (2025) have
Discussed the fact that the combination of LSTM
with XGBoost for fraud detection can be beneficial,
but it is challehing in terms of computation. Tayebi
and El Kafhali, 2025) is an autoencoder based model
using pseudorandom patterns, but they observed that
their proposed models fail to detect mimicry attacks
and provide room for hybrid solutions. Mim et al.
(2024) proposed a soft voting ensemble, which
improved classification but decreased the
interpretability of the decision. Chu et al. (2023) used
ensemble models on the original European cardholder
data and found real-life evidence that limitations of
the dataset impede its validity.
Li et al. (2023) introduced the fraud policy by
reinforcement learning, stressing flexibility yet
mentioning its training cost. Li, Xu, Wu, and Zhang
(2023) examined collaborative schemes which
enhanced learning, but made privacy issues possible.
Zhang et al. (2023) considered interpretable deep
learning approaches to avoid the explainability
problem, echoing the medical needs. Liu et al. (2023)
presented federated learning to enhanced data
privacy across institutes, struggled with varied model
performance.
Wu, Li and Zhou (2022) used VAE for anomaly
detection, which suffered from the overfitting on the
unbalanced data. Li et al. (2022) applied GNNs for
transaction relationship mapping, but encountered
processing inefficiency. Zhang et al. (2022) added
blockchain to better secure and transact the
dissemination, however, its real-time applicability
was unclear. Xu et al. (2022) also introduced human
expert prior into machine learning for providing
baseline knowledge, but manual interventions made
it very difficult to adapt according to the case.
Zhang et al. (2021) studied complex deep
reinforcement learning models that were found to be
promising yet computationally expensive. Chen et al.
(2021) proposed transfer learning approaches that
helped generalize more expressions but with the
necessity of source-target data alignment. Xu et al.
(2021) proposed an explainable AI framework
specific to finance, and Wang et al. (2021) focused on
dynamic model updates and suffered from
distributed synchronization challenges.
Chen et al. (2020) applied hybrid evolutionary
algorithms to the selection process, which provides
optimization of hyperparameter with the price of
convergence rate for training. Bhattacharya et al.
(2020): balanced the classes with synthesized
sampling, however the introduction of noise proved
to be difficult. Zhang et al. (2019) developed a hybrid
rule and machine learning-based system, but its
precision was high and adaptation was low. Lastly,
Smith et al. (2010) described legacy knowledge-
based systems that provided fundamental
benchmarking but seemingly could not adapt to
current fraud threats.
This aggregate research indicates the need for a
coherent model that considers accuracy,
interpretability, real-time response and adaptability
variables which the proposed XGBoost-based system
looks to incorporate and improve.
4 METHODOLOGY
An Intelligent On-Line Fraud Detection Based on the
XGBoost Algorithm for Dynamic Credit Card
Transactions Proposed approach is an intelligent
real-time fraud detection concept using the XGBoost
algorithm for dynamic credit card transactions. This
approach is organized such that prompt
identification, precision and progressive learning are
its integral elements to keep the pace with ever-
changing fraudulent tactics. In general, the entire
system consists of a set of housed phases including:
Data-preprocessing, Feature-engineering, Imbalance-
handling, Model-training, Hyperparameter-
optimization, Real-time-deployment with continuous
learning.
First, transactional datasets are obtained from
trusted, actual sources of the real world, which
include real and fake records. These data are
frequently characterized by a very heavy class
imbalance, counterfeit transactions are, in fact, a
small percentage. Preprocessing: The data is
subjected to preprocessing and unnecessary fields are
removed, null values are being managed, categorical
variables are being encoded and continuous
variables are being normalized. Features in the time
dimension also engineered, such as transaction
volume per user, merchant risk scores, and
transaction velocity are used as contextually relevant
inputs to the model.
Figure 1 shows the Real-Time
Credit Card Fraud Detection Workflow using
XGBoost.