Toward Designing a Reduced Phone Set Using Text Decoding Accuracy Estimates in Speech BCI
Shuji Komeiji, Koichi Shinoda, Toshihisa Tanaka
2025
Abstract
Reducing the phone set in speech recognition or speech brain-computer interface (BCI) tasks improves phone discrimination accuracy. This reduction may also degrade text decoding accuracy due to increased homonyms. To address this, we propose a novel estimator called the Generalized Pronunciation/Word Confusion Rate (GPWCR), which estimates text decoding accuracy by considering both phone discrimination performance and the number of homonyms. By minimizing the GPWCR, we designed the optimal reduced phone set. Experimental results from Japanese large vocabulary speech recognition demonstrate that the optimal phone set, reduced from 39 to 38 phones, lowered the word error rate from 14.1% to 13.8%.
DownloadPaper Citation
in Harvard Style
Komeiji S., Shinoda K. and Tanaka T. (2025). Toward Designing a Reduced Phone Set Using Text Decoding Accuracy Estimates in Speech BCI. In Proceedings of the 18th International Joint Conference on Biomedical Engineering Systems and Technologies - Volume 1: BIOSIGNALS; ISBN 978-989-758-731-3, SciTePress, pages 980-987. DOI: 10.5220/0013265800003911
in Bibtex Style
@conference{biosignals25,
author={Shuji Komeiji and Koichi Shinoda and Toshihisa Tanaka},
title={Toward Designing a Reduced Phone Set Using Text Decoding Accuracy Estimates in Speech BCI},
booktitle={Proceedings of the 18th International Joint Conference on Biomedical Engineering Systems and Technologies - Volume 1: BIOSIGNALS},
year={2025},
pages={980-987},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0013265800003911},
isbn={978-989-758-731-3},
}
in EndNote Style
TY - CONF
JO - Proceedings of the 18th International Joint Conference on Biomedical Engineering Systems and Technologies - Volume 1: BIOSIGNALS
TI - Toward Designing a Reduced Phone Set Using Text Decoding Accuracy Estimates in Speech BCI
SN - 978-989-758-731-3
AU - Komeiji S.
AU - Shinoda K.
AU - Tanaka T.
PY - 2025
SP - 980
EP - 987
DO - 10.5220/0013265800003911
PB - SciTePress