
the tests carried out on a very broad range of clas-
sifiers that the same pattern has been replicated-
regarding robustness and generalizability of the syn-
thetic datasets. The adoption of CTGAN in generat-
ing tabular datasets and DCGAN for chest X-ray im-
ages has shown promising results. First, with the tab-
ular data, this research has been able to synthesize the
data set quite realistically, while keeping the statisti-
cal patterns that were inherent in the original data set.
Second, the DCGAN managed to generate such chest
X-rays that maintained diagnostic relevance, along
with a high degree of visual fidelity.
It has also been observed that different machine
learning models yield similar performance, hence, un-
derlines the versatility of the synthetic datasets. The
consistency of results across various classifiers sug-
gests that synthetic datasets can easily be integrated
into a wide variety of machine learning tasks and
may become one of the most promising avenues for
privacy-preserving applications in both medical imag-
ing and tabular data analysis. It can be inferred that
when GANs are developed on specific data types, they
can generate synthetic datasets which can perform
just as well as real data in training a model without
revealing sensitive information. Therefore, there is a
potential in applying the proposed approach in other
medical domains. Its validation against real-world di-
verse datasets, and the integration of other measures
that may enhance privacy could form new avenues of
interest in future studies. The proposed approach con-
tributes to enhance machine learning model develop-
ment with a view to privacy by providing a reliable
and consistent framework for synthetic data genera-
tion in different modalities.
ACKNOWLEDGEMENTS
The authors would like to express their sincere grat-
itude to Aarya Gard and Harshal Jain for their valu-
able assistance with simulation and conducting exper-
iments.
REFERENCES
Beriwal, S., Thirunavukkarasu, K., Khan, S., and Abiman-
nan, S. (2022). Detection of infectious diseases in hu-
man bodies by using machine learning algorithms. In
Handbook of Research on Machine Learning: Foun-
dations and Applications, page 229.
Cai, Z., Xiong, Z., Xu, H., Wang, P., Li, W., and Pan, Y.
(2021). Generative adversarial networks: A survey
toward private and secure applications. ACM Com-
puting Surveys (CSUR), 54(6):1–38.
Chen, Y. and Esmaeilzadeh, P. (2024). Generative ai in
medical practice: in-depth exploration of privacy and
security challenges. Journal of Medical Internet Re-
search, 26:e53008.
Grassucci, E., Cicero, E., and Comminiello, D. (2022).
Quaternion generative adversarial networks. In Gen-
erative Adversarial Learning: Architectures and Ap-
plications, pages 57–86. Springer International Pub-
lishing.
Janosi, A., Steinbrunn, W., Pfisterer, M., and Detrano, R.
(1989). Heart disease [dataset]. UCI Machine Learn-
ing Repository.
Kermany, D., Zhang, K., and Goldbaum, M. (2018). La-
beled optical coherence tomography (oct) and chest
x-ray images for classification. Mendeley Data, V2.
Khan, S., Thirunavukkarasu, K., Hammad, R., Bali, V., and
Qader, M. R. (2021). Convolutional neural network
based sars-cov-2 patients detection model using ct im-
ages. International Journal of Intelligent Engineering
Informatics, 9(2):211–228.
Prowal, P., Singh, A. S., Thirunavukkarasu, K., Khan, S.,
and Qader, M. R. (2021). Machine learning algo-
rithms based on feature selection method used for the
prediction of breast cancer. In 2021 IEEE Interna-
tional Conference on Data Analytics for Business and
Industry (ICDABI), pages 100–106.
Shafik, W. (2025). Generative adversarial networks: Secu-
rity, privacy, and ethical considerations. In Generative
Artificial Intelligence (AI) Approaches for Industrial
Applications, pages 93–117. Springer Nature Switzer-
land.
Venugopal, R., Shafqat, N., Venugopal, I., Tillbury, B.,
Stafford, H., and Bourazeri, A. (2022). Privacy pre-
serving generative adversarial networks to model elec-
tronic health records. Neural Networks, 153:339–348.
Utilizing Generative Adversarial Networks for Preserving Privacy in Developing Machine Learning Models for the Healthcare Industry
551