
rizing brain tumors into four classes: glioma, menin-
gioma, pituitary tumor, and no tumor. To achieve this,
the research focuses on:
• Applying Advanced Deep Learning Models: Uti-
lizing Convolutional Neural Networks (CNNs),
Vision Transformers (ViTs), and transfer learning
with fine-tuned pre-trained models (InceptionV3,
VGG16, and VGG19) to improve classification
accuracy and performance in brain tumor detec-
tion. The fine-tuned models are referred to as FT-
InceptionV3, FT-VGG16, and FT-VGG19.
• Exploring Hybrid Architectures and Ensemble
Learning: Developing and evaluating innovative
hybrid architectures by combining multiple mod-
els, such as CNN and ViT (Ens-CNN-VIT), ViT
and FT-InceptionV3 (Ens-VIT-FT-InceptionV3),
and FT-VGG16 and FT-InceptionV3 (Ens-FT-
VGG16-FT-InceptionV3). Ensemble learning en-
hances diagnostic precision by leveraging the
complementary strengths of different models,
mitigating their weaknesses, and improving over-
all robustness and generalization.
Through these objectives, the study aims to con-
tribute to the growing field of AI-driven medical di-
agnostics, ultimately enabling early, accurate detec-
tion and improved treatment of brain tumors, while
reducing the global burden of these life-threatening
conditions.
The remainder of this paper is organized as fol-
lows: Section 2 reviews related work on brain tumor
classification and the application of deep learning in
medical imaging. Section 3 outlines the proposed hy-
brid models. Section 4 presents the experimental re-
sults, followed by a comparative evaluation with pre-
vious studies in Section 5. Finally, Section 6 con-
cludes the paper and outlines future research direc-
tions.
2 RELATED WORK
In this section, we review the literature and findings
from research studies that focused on brain tumor de-
tection.
G
´
omez-Guzm
´
an et al.,(G
´
omez-Guzm
´
an et al.,
2023) evaluated seven deep CNN models for brain
tumor classification using MRI scans. Their study
compared a generic CNN with six pre-trained mod-
els, including InceptionV3, ResNet50, and Xception.
Among these, InceptionV3 achieved the highest ac-
curacy of 97.12%, demonstrating its ability to ex-
tract robust and distinctive features for classification.
Their findings emphasized the effectiveness of CNNs
in identifying brain tumors by capturing hierarchical
patterns in MRI images. This research contributed to
the advancement of automated brain tumor classifica-
tion, reinforcing the suitability of CNN-based archi-
tectures for medical imaging applications.
Khaliki and Basarslan(Khaliki and Bas¸arslan,
2024) advanced this research by incorporating trans-
fer learning techniques, leveraging pre-trained mod-
els like VGG16, VGG19, EfficientNetB4, and In-
ceptionV3. These architectures, initially trained on
large-scale datasets, were fine-tuned for brain tu-
mor classification tasks, achieving impressive accu-
racies of 98% and 97% with VGG16 and Efficient-
NetB4, respectively. The use of transfer learning sig-
nificantly reduced training time while enhancing the
models generalization capabilities on smaller medical
datasets. Their hybrid classification system demon-
strated that leveraging pre-trained networks could
bridge the gap between limited medical data availabil-
ity and high-precision diagnostics.
Balamurugan and Gnanamanoharan (Balamuru-
gan and Gnanamanoharan, 2023) and Fki et al., (Fki
et al., 2024)introduced a hybrid approach that com-
bined a Deep Convolutional Neural Network (DCNN)
with the enhanced LuNet algorithm for both segmen-
tation and classification of brain tumors. Their model
achieved unprecedented accuracies of 99.4% in seg-
mentation and 99.5% in classification, emphasizing
the efficacy of integrating advanced feature extraction
techniques with deep learning models.
Dhakshnamurthy et al., (Dhakshnamurthy et al.,
2024) proposed a hybrid model combining VGG16
and ResNet-50 architectures, achieving a ground-
breaking accuracy of 99.98%. This model outper-
formed standalone architectures such as AlexNet,
VGG16, and ResNet-50, emphasizing the effective-
ness of combining complementary strengths to en-
hance diagnostic precision. The integration of ResNet
skip connections with VGG16 deep feature extraction
layers facilitated improved gradient flow during train-
ing, addressing challenges associated with vanishing
gradients in deep networks.
Mahmud and Muntasir et al.,(Mahmud et al.,
2023) further contributed to this field by designing a
CNN-based approach for early detection of brain tu-
mors, achieving an accuracy of 93.3%. Their model
incorporated extensive preprocessing and data aug-
mentation techniques, including rotation, flipping,
and zooming, to enhance model robustness. By com-
paring their CNN model with architectures such as
ResNet-50 and InceptionV3, they demonstrated that
carefully designed CNN architectures could outper-
form more complex models in tasks requiring preci-
sion and computational efficiency.
DATA 2025 - 14th International Conference on Data Science, Technology and Applications
456