loading
Papers Papers/2022 Papers Papers/2022

Research.Publish.Connect.

Paper

Paper Unlock

Authors: Scott McLachlan 1 ; Kudakwashe Dube 2 ; Thomas Gallagher 3 ; Bridget Daley 4 and Jason Walonoski 5

Affiliations: 1 Queen Mary University of London, United Kingdom ; 2 Massey University, New Zealand ; 3 University of Montana, United States ; 4 Western Sydney Local Health District, Australia ; 5 The MITRE Corporation, United States

Keyword(s): Synthetic Data, Synthetic Healthcare Record, Knowledge Discovery, Data Mining, Electronic Health Records, Computer Simulation, ATEN Framework, Validation, RS-EHR.

Related Ontology Subjects/Areas/Topics: Artificial Intelligence ; Biomedical Engineering ; Confidentiality and Data Security ; Data Mining ; Databases and Information Systems Integration ; Enterprise Information Systems ; Health Information Systems ; Sensor Networks ; Signal Processing ; Soft Computing

Abstract: Realistic synthetic data are increasingly being recognized as solutions to lack of data or privacy concerns in healthcare and other domains, yet little effort has been expended in establishing a generic framework for characterizing, achieving and validating realism in Synthetic Data Generation (SDG). The objectives of this paper are to: (1) present a characterization of the concept of realism as it applies to synthetic data; and (2) present and demonstrate application of the generic ATEN Framework for achieving and validating realism for SDG. The characterization of realism is developed through insights obtained from analysis of the literature on SDG. The development of the generic methods for achieving and validating realism for synthetic data was achieved by using knowledge discovery in databases (KDD), data mining enhanced with concept analysis and identification of characteristic, and classification rules. Application of this framework is demonstrated by using the synthetic Elect ronic Healthcare Record (EHR) for the domain of midwifery. The knowledge discovery process improves and expedites the generation process; having a more complex and complete understanding of the knowledge required to create the synthetic data significantly reduce the number of generation iterations. The validation process shows similar efficiencies through using the knowledge discovered as the elements for assessing the generated synthetic data. Successful validation supports claims of success and resolves whether the synthetic data is a sufficient replacement for real data. The ATEN Framework supports the researcher in identifying the knowledge elements that need to be synthesized, as well as supporting claims of sufficient realism through the use of that knowledge in a structured approach to validation. When used for SDG, the ATEN Framework enables a complete analysis of source data for knowledge necessary for correct generation. The ATEN Framework ensures the researcher that the synthetic data being created is realistic enough for the replacement of real data for a given use-case. (More)

CC BY-NC-ND 4.0

Sign In Guest: Register as new SciTePress user now for free.

Sign In SciTePress user: please login.

PDF ImageMy Papers

You are not signed in, therefore limits apply to your IP address 18.117.182.179

In the current month:
Recent papers: 100 available of 100 total
2+ years older papers: 200 available of 200 total

Paper citation in several formats:
McLachlan, S.; Dube, K.; Gallagher, T.; Daley, B. and Walonoski, J. (2018). The ATEN Framework for Creating the Realistic Synthetic Electronic Health Record. In Proceedings of the 11th International Joint Conference on Biomedical Engineering Systems and Technologies (BIOSTEC 2018) - HEALTHINF; ISBN 978-989-758-281-3; ISSN 2184-4305, SciTePress, pages 220-230. DOI: 10.5220/0006677602200230

@conference{healthinf18,
author={Scott McLachlan. and Kudakwashe Dube. and Thomas Gallagher. and Bridget Daley. and Jason Walonoski.},
title={The ATEN Framework for Creating the Realistic Synthetic Electronic Health Record},
booktitle={Proceedings of the 11th International Joint Conference on Biomedical Engineering Systems and Technologies (BIOSTEC 2018) - HEALTHINF},
year={2018},
pages={220-230},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0006677602200230},
isbn={978-989-758-281-3},
issn={2184-4305},
}

TY - CONF

JO - Proceedings of the 11th International Joint Conference on Biomedical Engineering Systems and Technologies (BIOSTEC 2018) - HEALTHINF
TI - The ATEN Framework for Creating the Realistic Synthetic Electronic Health Record
SN - 978-989-758-281-3
IS - 2184-4305
AU - McLachlan, S.
AU - Dube, K.
AU - Gallagher, T.
AU - Daley, B.
AU - Walonoski, J.
PY - 2018
SP - 220
EP - 230
DO - 10.5220/0006677602200230
PB - SciTePress