How Different Elements of Audio Affect the Word Error Rate of Transcripts in Automated Medical Reporting

Emma Kwint; Anna Zoet; Katsiaryna Labunets; Sjaak Brinkkemper

doi:10.5220/0011794100003414

How Different Elements of Audio Affect the Word Error Rate of Transcripts in Automated Medical Reporting

Emma Kwint, Anna Zoet, Katsiaryna Labunets, Sjaak Brinkkemper

2023

Abstract

Automated Speech Recognition software is implemented in different fields. One of them is healthcare in which it can be used for automated medical reporting, the field of focus of this research. For the first step of automated medical reporting, audio files of consultations need to be transcribed. This research contributes to the investigation of the optimization of the generated transcriptions, focusing on categorizing audio files on specific characteristics before analyzing them. The literature research within this study shows that specific elements of speech signals and audio, such as accent, voice frequency and noise, can have influence on the quality of a transcription an Automated Speech Recognition system carries out. By analyzing existing medical audio data and conducting an pilot experiment, the influence of those elements is established. This is done by calculating the Word Error Rate of the transcriptions, a useful percentage that shows the accuracy. Results of the analysis of the existing data show that noise is an element that carries out significant differences. However the data of the experiment did not show significant differences. This was mainly due to having not enough participants to reason with significance. Further research into the effect of noise, language and different Automated Speech Recognition technologies should be done based on the outcomes of this research.

Download

Paper Citation

in Harvard Style

Kwint E., Zoet A., Labunets K. and Brinkkemper S. (2023). How Different Elements of Audio Affect the Word Error Rate of Transcripts in Automated Medical Reporting. In Proceedings of the 16th International Joint Conference on Biomedical Engineering Systems and Technologies (BIOSTEC 2023) - Volume 5: HEALTHINF; ISBN 978-989-758-631-6, SciTePress, pages 179-187. DOI: 10.5220/0011794100003414

in Bibtex Style

@conference{healthinf23,
author={Emma Kwint and Anna Zoet and Katsiaryna Labunets and Sjaak Brinkkemper},
title={How Different Elements of Audio Affect the Word Error Rate of Transcripts in Automated Medical Reporting},
booktitle={Proceedings of the 16th International Joint Conference on Biomedical Engineering Systems and Technologies (BIOSTEC 2023) - Volume 5: HEALTHINF},
year={2023},
pages={179-187},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0011794100003414},
isbn={978-989-758-631-6},
}

in EndNote Style

TY - CONF

JO - Proceedings of the 16th International Joint Conference on Biomedical Engineering Systems and Technologies (BIOSTEC 2023) - Volume 5: HEALTHINF
TI - How Different Elements of Audio Affect the Word Error Rate of Transcripts in Automated Medical Reporting
SN - 978-989-758-631-6
AU - Kwint E.
AU - Zoet A.
AU - Labunets K.
AU - Brinkkemper S.
PY - 2023
SP - 179
EP - 187
DO - 10.5220/0011794100003414
PB - SciTePress