after the presence of all mandatory attributes as per
their FD counterparts, and should be assigned FD
status instead. In the latter case, our templates will
tremendously benefit in increasing the number of FD
concepts in SNOMED-CT and thereby increasing its
rate of adoption (Schulz et al., 2009).
Furthermore, although the EA method has solved
the problem of the skewed distribution of FD and PD
templates, this issue can only be handled if the most
general sample-set exists in the FD template. For ex-
ample, PD DID-myopathy concepts could be audited
against DID template. However, if DID (the most
general template) were absent there would not be
any template to audit the skewed PD DID-myopathy
concepts. A possible solution to this problem would
be manually completing the definition of at least one
of the PD defined concepts to extract a FD template
and then using that template to audit the remaining
PD concepts belonging to the matching sample-set
(Burse et al., 2022b).
Finally, given the semi-automatic nature of the
method, the level of in-depth analysis (as available
in manually curated SNOMED-CT templates) may
not always be feasible. However, extraction of the
basic templates is currently the best option to stan-
dardize thousands of existing SNOMED-CT concepts
into predefined templates. The basic templates ex-
tracted automatically can then be refined into sophis-
ticated templates after manual inspection by domain
experts. Although the level of templates extracted by
this semi-automated method will not be as detailed as
the ones that were manually created by the domain
experts, this is a good start to identify repeating pat-
terns and ascertain if there are any inconsistencies in
the way these concepts are currently modeled. Com-
plete automation of QA of biomedical ontologies will
continue to be a challenge in the health-informatics
domain. We believe that our contribution will aid in
a complementary way to ease the manual efforts of
SNOMED-CT curators.
6 CONCLUSION & FUTURE
WORK
In this work, we presented an improved method to
extract mandatory attribute relationship templates for
SNOMED-CT concepts by considering FD concepts
as a source of ground truth. The method has ex-
tracted a multitude of new templates over the basic
implementation (Burse et al., 2022b). The auditing
results that identified inconsistent PD concepts have
shown promising potential to highlight inconsisten-
cies in the modeling styles of lexically similar con-
cepts. An interesting direction for future work would
be resolving the atomic annotator bottleneck (Burse
et al., 2022b; Burse et al., 2022a) in order to increase
the coverage of SNOMED-CT concepts being ana-
lyzed. Indeed the atomic annotator restricts the num-
ber of SNOMED-CT concepts being processed based
on the length of their FSNs and limits the number of
atomic dictionaries created due to its semi-automatic
nature. We believe the results would further improve
after resolving this bottleneck.
REFERENCES
Agrawal, A. (2018). Evaluating lexical similarity and
modeling discrepancies in the procedure hierarchy of
snomed ct. BMC Medical Informatics and Decision
Making, 18.
Amith, M. T., He, Z., Bian, J., Lossio-Ventura, J. A., and
Tao, C. (2018). Assessing the practice of biomedical
ontology evaluation: Gaps and opportunities. Journal
of biomedical informatics, 80:1–13.
Burse, R., Bertolotto, M., and Mcardle, G. (2022a). A novel
atomic annotator for quality assurance of biomedical
ontologies. In HEALTHINF.
Burse, R., Bertolotto, M., O’Sullivan, D. M., and Mcardle,
G. (2021). Semantic interoperability: the future of
healthcare.
Burse, R., Mcardle, G., and Bertolotto, M. (2022b). Tar-
geting stopwords for quality assurance of snomed-
ct. International journal of medical informatics,
167:104870.
Ceusters, W., Smith, B., Kumar, A., and Dhaen, C. (2004).
Mistakes in medical ontologies: where do they come
from and how can they be detected? Studies in health
technology and informatics, 102:145–63.
Duarte, J., Castro, S., Santos, M. F., Abelha, A., and
Machado, J. (2014). Improving quality of electronic
health records with snomed. In CENTERIS 2014.
IHTSDO (2002a). Ihtsdo snomed international conflu-
ence. https://confluence.ihtsdotools.org/. last accessed
26/10/2021.
IHTSDO (2002b). Sct modelling templates and description
patterns. https://confluence.ihtsdotools.org/display/
SCTEMPLATES/SCT+Modeling+Templates+and+
description+patterns. last accessed 25/09/2022.
IHTSDO (2002c). Sct template specification.
https://confluence.ihtsdotools.org/display/SCTEMPL
ATES/Template+specification. last accessed
25/09/2022.
IHTSDO (2002d). What does it mean if a concept is
fully-defined or primitive and how do i tell the
difference. https://ihtsdo.fresh desk.com/support/
solutions/articles/4000050378-what-does-it-mean-if-
a-concept-is-fully-defined-or-primitive-and-how-do-
i-t. last accessed 15/12/2021.
Facilitating SNOMED-CT Template Creation by Targeting Stopwords
285