
expert users, by automating the selection of models,
hyper-parameter tuning, and evaluation processes. A
large part of the business analysis is credit card fraud
detection. Hasan et al. emphasizes transparency and
interpretability in AI models, particularly in regulated
industries like finance. The study assesses four
machine learning models (ANN, SVM, Random
Forest, and Logistic Regression) using a dataset that
is extremely skewed. When it came to identifying
fraudulent transactions, ANN had the highest
precision and SVM the highest recall. In the study of
Alenzi and Aljehane (Alenzi et al., 2020), the
research also emphasizes the financial advantages of
employing explainable AI models for adaptive
learning and real-time fraud detection. It is creating a
logistic regression-based artificial intelligence system
to identify credit card fraud. It deals with the rising
problem of credit card fraud brought on by an
increase in internet sales. It draws attention to the
difficulties in handling erratic, unbalanced data and
dataset outliers. Additionally, there are numerous
connections between federated learning and credit
card fraud detection. The paper that Abdul Salam et
al (Abdul Salam et al., 2024) proposes a FL approach
for prevent credit card fraud, addressing critical
issues such as data privacy and class imbalance in
financial organizations. Traditional centralized
models have data sharing limits, therefore federated
learning is a feasible alternative since it allows
institutions to exercise a shared model without
disclosing sensitive data. Addressing class imbalance,
the research employs a variety of resampling
strategies, including oversampling (Smote, AdaSyn)
and undersampling (RUS), and analyses their efficacy
across multiple machine learning algorithms. The
Random Forest (RF) classifier has the best accuracy
(99.99%), beating approaches such as Logistic
Regression and K-Nearest Neighbors. The study
demonstrates how hybrid resampling methods
combined with federated learning significantly
improve fraud detection while maintaining data
privacy.
For the rest part of paper, first, how the other
researcher uses Federated Learning in credit fraud or
different ways that the researcher will commonly use
AI to prevent or protect against credit card fraud in
Section 2, compare these methods to each other, and
emphasize the challenge facing in this area in Section
3. In Section 4, summarize the paper and get the
conclusion according to the paper that this paper
discussed here.
2 METHOD
2.1 Preliminaries of Federated Learning
Federated Learning (FL) is a decentralized machine
learning technique that accept numerous devices or
institutions to coach a model without giving raw data.
Unlike traditional approaches, where data is
centralized, FL allows for model training at the data
source, ensuring privacy and security. After local
training, clients send only model modifications
(weights, gradients, etc.) to a central server, which
aggregates these updates to enhance the model as a
whole. The clients receive the updated model back,
and the process keeps on until the model is optimized.
FL is especially effective in sensitive industries such
as business, hospitality, and mobile applications,
where data privacy is a top priority. The pipeline
includes model initialization, local training,
aggregation, and global model updates. FL complies
with GDPR laws by guaranteeing that data remains
with the owner while benefiting from collective
learning across many sources.
2.2 Vertical Federated Learning
The Boliang Lv et al.'s (Lv et al., 2021) paper
explores the application of Vertical Federated
Learning (VFL) for identifying e-banking fraud
accounts by combining financial and social data while
preserving privacy. Vertical federated learning allows
financial institutions and social media companies to
collaborate without sharing sensitive data, using
encrypted sample alignment and model training
techniques. The system was tested using 60,000 data
samples, demonstrating that federated learning
improves accuracy, precision, and recall compared to
local models. This method is beneficial in detecting
fraudulent accounts before transactions occur,
safeguarding users, and enhancing fraud prevention.
The study carried out by Fan et al. (Fan et al., 2024)
highlights federated learning's effectiveness in cross-
industry data collaboration. The study introduces a
unique defence method termed Federated Learning
Similar Gradients (FLSG), intended to prevent
inference assaults in VFL. When various institutions
want to collaborate on model training without sharing
raw data yet have different feature sets for the same
users, they employ VFL. In this strategy, the study
proposes replacing actual gradients with identical
gradients estimated using cosine distance, making
label inference attacks more difficult. This strategy
improves VFL security by protecting privacy while
DAML 2024 - International Conference on Data Analysis and Machine Learning
512