0.5, the F1-score can not fully reference the priorities
of Recall or Precision in a certain realistic application.
Notably in an ischaemic stroke forecast, Recall is far
more important to declare the model’s performance
than Precision. TranSOP just right gains a relatively
high score in Recall yet a rather low value of
Precision. Hence, the F1-score can not play a
dominant part in indicators to conjecture which model
performs best. From a comprehensive perspective,
TranSOP indisputably becomes the most appropriate
model to predict ischaemic stroke.
Table 2: Comparison between three models.
Model F1-score AUC ACC
XGBoost&xDeepFM 68.69 71.37 73.91
SSMMSRP 65.96 85.65 75.73
TRANSOP 59.72 88.41 80.02
The next experiment is devised to discriminate
whether unimodal or multimodal input can train
TranSOP to forecast a more fastidious upshot. In the
pre-training process, two forms of unimodal, image
and text solely, were pitched on to input TranSOP.
Based on a primitive version of TRANSOP, image or
textual modal has been previously processed into ViT
(Chen et al., 2021), DeiT (Li et al., 2021), SwinT
(Touvron et al., 2022) or BERT, ALBERT (Liu et al.,
2021), and RoBERTa (Liu et al., 2019). Finally, as is
exposed (Table 3), almost all multimodal inputs
outmaneuver unimodal inputs, which suggests that
multimodal data are dedicated to better performance
for TranSOP and to spur TranSOP to be increasingly
precise.
Table 3: Comparison between unimodal and
multimodal.
Model Image
modal
Text
modal
F1-
score
AUC ACC
Unimodal None BERT 0.87 0.74 0.82
Unimodal None RoBERTa 0.85 0.72 0.82
Unimodal None ALBERT 0.87 0.76 0.87
Unimodal ViT None 0.56 0.36 0.65
Unimodal DeiT None 0.55 0.40 0.65
Unimodal SwinT None 0.55 0.39 0.60
Multimodal DeiT BERT 0.89 0.75 0.84
Multimodal SwinT BERT 0.89 0.75 0.84
Multimodal SwinT RoBERTa 0.89 0.78 0.84
In the meantime, the discourse (Lan et al., 2019)
also suggested that SSMMSRP outperforms single-
modality methods by 2.6% for both image and tabular
input in ROC-AUC, ameliorating by 3.3%, 5.6% for
image and tabular single-modality respectively in
balanced accuracy terms. Meanwhile, the
experimental outcome is nearly similar to (Delgrange
et al., 2024), which vindicates the legitimacy of this
experiment.
4 CONCLUSIONS
In this paper, three disparate models
XGBoost&xDeepFM, SSMMSRP and TranSOP are
selected to participate in the ischaemic stroke
prediction performance test and detect diverse
Transformer architecture-based Large Language
models, based on MR CLEAN dataset gleaned and
collated by an professional American disease
institution center. The upshot has disclosed that
TranSOP possessed the best comprehensive
performance in portending ischaemic stroke among
all the modules selected, and combined multimodality
data came to more remarkable fruition than any single
modality. Despite rigmarole in processing billions of
parameters, high complexity in the hierarchical
structure and deficiency in distinguishing exclusive
textual single-modality, a Transformer structure-
based Large Language model still takes on a role as
an augur for predicting ischaemic stroke. In future
work, such Transformer-based Multimodal Large
Language models like TranSOP are hopeful to be
prophets for a wide range of cardiovascular and
cranial diseases.
REFERENCES
Chadha A. Stroke Prediction using Clinical and Social
Features in Machine Learning. arXiv. 2025.
Chen, J., Lu, Y., Yu, Q., Luo, X., Adeli, E., Wang, Y., ... &
Zhou, Y. TransUNet: Transformers make strong
encoders for medical image segmentation. arXiv
preprint arXiv:2102.04306. 2021.
Dai W., Jiang Y., Mou C., Zhang C. An Integrative
Paradigm for Enhanced Stroke Prediction: Synergizing
XGBoost and xDeepFM Algorithms. ICBDT 2023.
2023.
Delgrange C., Demler O., Mora S., Menze B., de la Rosa
E., Davoudi N. A Self-Supervised Model for Multi-
modal Stroke Risk Prediction. arXiv. 2024.
Du S., Zheng S., Wang Y., Bai W., O’Regan D. P., Qin C.
TIP: Tabular-Image Pre-training for Multimodal
Classification with Incomplete Data. arXiv. July 2024.
Fernandez Lozano C., Hervella P., Mato Abad V., et al.
Random forest based prediction of stroke outcome.
Scientific Reports, 11:10071. doi:10.1038/s41598-021-
89434-7. 2021.
hosla A., Cao Y., Lin C. C. Y., Chiu H. K., Hu J., & Lee H.
An integrated machine learning approach to stroke
prediction. Proceedings of the 16th ACM SIGKDD