Toward Designing a Reduced Phone Set Using Text Decoding Accuracy Estimates in Speech BCI

Shuji Komeiji, Koichi Shinoda, Toshihisa Tanaka

2025

Abstract

Reducing the phone set in speech recognition or speech brain-computer interface (BCI) tasks improves phone discrimination accuracy. This reduction may also degrade text decoding accuracy due to increased homonyms. To address this, we propose a novel estimator called the Generalized Pronunciation/Word Confusion Rate (GPWCR), which estimates text decoding accuracy by considering both phone discrimination performance and the number of homonyms. By minimizing the GPWCR, we designed the optimal reduced phone set. Experimental results from Japanese large vocabulary speech recognition demonstrate that the optimal phone set, reduced from 39 to 38 phones, lowered the word error rate from 14.1% to 13.8%.

Download


Paper Citation


in Harvard Style

Komeiji S., Shinoda K. and Tanaka T. (2025). Toward Designing a Reduced Phone Set Using Text Decoding Accuracy Estimates in Speech BCI. In Proceedings of the 18th International Joint Conference on Biomedical Engineering Systems and Technologies - Volume 1: BIOSIGNALS; ISBN 978-989-758-731-3, SciTePress, pages 980-987. DOI: 10.5220/0013265800003911


in Bibtex Style

@conference{biosignals25,
author={Shuji Komeiji and Koichi Shinoda and Toshihisa Tanaka},
title={Toward Designing a Reduced Phone Set Using Text Decoding Accuracy Estimates in Speech BCI},
booktitle={Proceedings of the 18th International Joint Conference on Biomedical Engineering Systems and Technologies - Volume 1: BIOSIGNALS},
year={2025},
pages={980-987},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0013265800003911},
isbn={978-989-758-731-3},
}


in EndNote Style

TY - CONF

JO - Proceedings of the 18th International Joint Conference on Biomedical Engineering Systems and Technologies - Volume 1: BIOSIGNALS
TI - Toward Designing a Reduced Phone Set Using Text Decoding Accuracy Estimates in Speech BCI
SN - 978-989-758-731-3
AU - Komeiji S.
AU - Shinoda K.
AU - Tanaka T.
PY - 2025
SP - 980
EP - 987
DO - 10.5220/0013265800003911
PB - SciTePress