why deceptive Reviews is significant and how
machine learning models can help. The objectives
section shows the list of research objectives. The
description of data set and models used in the study
are explained in the methodology section. The
performance of the different models are presented in
the result and interpretation section. Finally, The
conclusion, which and interpretation section. Finally,
the conclusion, which follows a references section
containing a list of all the sources consulted,
concludes the study and makes recommendations for
further research.
1.2 Objectives
• To examine the machine learning models used
for deceptive Reviews system.
• To assess how well the machine learning
models, detect misleading reviews.
• To train and test a machine learning model to
identify fake and genuine reviews.
• To evaluate the model’s performance using
accuracy, precision, recall, and confusion
matrix.
2 RELATED WORKS
Several machine learning (ML) and deep learning
(DL) techniques have been used in recent advances in
fraudulent reviews, greatly increasing the precision
and effectiveness of detecting false information in
online reviews.
In order to improve the accuracy of deceptive
review identification across various platforms, a
number of researches have investigated sophisticated
machine learning and deep learning techniques. Sree
and Tripathi (2023) utilized Evidential Classifiers to
improve classification accuracy by leveraging
probabilistic reasoning in identifying deceptive
reviews. Similarly, Abdulqader et al. (2022)
developed a Unified Detection Model that integrates
deception theories with behavioral science to analyze
online review patterns, enhancing the detection of
fraudulent content. Chauhan et al. (2022) provided a
comprehensive review of techniques for detecting
fake images and videos, which can be extended to
identifying manipulated reviews through neural
networks and GAN-based models. Catelli et al.
(2023) proposed a method leveraging BERT and
ELECTRA for sentiment analysis to detect deceptive
reviews in datasets related to Italian cultural heritage,
demonstrating the effectiveness of deep learning
models in distinguishing deceptive content. Liu et al.
(2021) explored a multidimensional representation
approach with fine-grained aspect analysis to identify
deceptive reviews by modeling semantic
relationships and contextual information.
Furthermore, Tufail et al. (2022) investigated the
impact of fake reviews on e-commerce platforms
during and after the COVID-19 pandemic and
introduced SKL-based models using K-Nearest
Neighbor (KNN) and Support Vector Machine
(SVM) to classify reviews as genuine or deceptive
(
Pandit, Anala 2018). Deep learning models, especially
convolutional neural networks (CNNs), have proven
to be effective in establishing robust classification
baselines by capturing subtle patterns in review data
(
Rathore et al., 2023). These models demonstrate
superior performance in analyzing contextual
information, sentiment polarity, and behavioral
patterns that distinguish genuine reviews from fake
ones.
3 METHODOLOGY
In this study focuses on deceptive Reviews by first
pre-processing the text data through steps like
removing punctuation, converting to lowercase,
eliminating stop words, and ap- plying stemming.
The dataset is split into training, validation, and
testing subsets, where the model undergoes training,
fine- tuning, and performance evaluation,
respectively. This process involves preprocessing the
textual data by removing irrelevant terms, applying
stemming techniques, and converting the text into
numerical form using TF-IDF. The figure 1 shows the
Flow of the work. The model’s performance is
assessed through evaluation metrics such as accuracy,
precision, recall, F1-score, and ROC-AUC.
Furthermore, a confusion matrix is employed to
analyze and understand the nature of prediction
errors. The models are then tested on unseen data to
ensure they generalize well to new inputs. This
methodology allows us to identify the most effective
model for accurately detecting fake reviews.
3.1 Stemming
In deceptive Reviews, writers may use different
forms of words, such as” buying”,” bought”, and”
buys”, which all con- vey similar meaning. Stemming
normalizes these variations to a single root word like”
buy”, reducing the vocabulary size, and improving
model efficiency. This process helps the model
generalize better by focusing on the core meaning of
the text. In deceptive review detection, stemming