dB 43.00
SSIM 0.93
Precision 0.965
Recall 0.913
F1-Score 0.937
Table 2 indicates that SRGAN performed
exceptionally well in APTOS-2019, because of the
outstanding PSNR_dB, SSIM and Precision Recall
F1-score.
4 CONCLUSION
APTOS-2019 dataset includes 3662 diabetic
retinopathy fundus images, which are used to
downsample the LR images for model training. This
method simulates the clinical image-generating
challenges. The generator was first trained for 50
epochs to prevent the model from collapsing, after
that, both G and D performed the adversarial training
with a learning rate of 0.0001 and 500 epochs.
Aiming to increase the model robustness, data
augmentation is utilized at the beginning of training,
such as random cropping, horizontal and vertical
flipping, and rotation (every image is coped with the
normalization process)
SRGAN is specially designed for generating SR
retinopathy fundus images, because of the VGG-19-
based discriminator. It greatly enhances the resolution
of images and recreates the details. From the
experimental results, SRGAN has a potential clinical
image analysis application value, especially in
segment.
Although SRGAN performs well in this case,
there are still some challenges: certain areas in other
pictures may appear blurred, and low-level detail
reconstruction; Significant computational resources
are used in training that obstacles to wider application;
This dataset has limited variability of diabetic
retinopathy images, which may limit the
generalization ability of the model.
That is the reason why future research will focus
on improving the model generalization ability and
making it easier to be trained. In the end, integrating
SRGAN will aid in early detection and intervention,
aiming to develop medical technology.
REFERENCES
Abbas, R., Gu, N., 2023. Improving deep learning-based
image super-resolution with residual learning and
perceptual loss using SRGAN model. Soft Computing,
27(21), 16041-16057.
Deng, Z., Zhang, H., Liang, X., Yang, L., Xu, S., Zhu, J., &
Xing, E. P., 2017. Structured generative adversarial
networks. In Advances in neural information
processing systems, 30.
Elanwar, R., Betke, M., 2024. Generative adversarial
networks for handwriting image generation: a review.
The Visual Computer, 1-24.
Goodfellow, I., Pouget-Abadie, J., Mirza, M., Xu, B.,
Warde-Farley, D., Ozair, S., & Bengio, Y., 2020.
Generative adversarial networks. Communications of
the ACM, 63(11), 139-144.
Gonog, L., & Zhou, Y., 2019. A review: generative
adversarial networks. In 2019 14th IEEE Conference on
Industrial Electronics and Applications (ICTEA) (pp.
505-510). IEEE.
Jiang, X., Xu, Y., Wei, P., & Zhou, Z., 2020. CT image
super resolution based on improved SRGAN. In 2020
5th International Conference on Computer and
Communication Systems (ICCCS) (pp. 363-367). IEEE.
Ledig, C., Theis, L., Huszar, F., Caballero, J., Cunningham,
A., Acosta, A., & Shi, W., 2017. Photorealistic single
image super-resolution using a generative adversarial
network. In Proceedings of the IEEE conference on
computer vision and pattern recognition (pp. 4681-
4690).
Liu, B., & Chen, J., 2021. A super resolution algorithm
based on attention mechanism and SRGAN network.
IEEE Access, 9, 139138-139145.
Sajeeda, A., & Hossain, B. M., 2022. Exploring generative
adversarial networks and adversarial training.
International Journal of Cognitive Computing in
Engineering, 3, 78-89.
Wang, K., Gou, C., Duan, Y., Lin, Y., Zheng, X., & Wang,
F. Y., 2017. Generative adversarial networks:
introduction and outlook. IEEE/CAA Journal of
Automatica Sinica, 4(4), 588-598.