A Fuzzy Logic Approach to Improve Phone Segmentation - A Case Study of the Dutch Language

Victor Milewski, Aysenur Bilgin, Tufan Kumbasar

2017

Abstract

Phone segmentation is an essential task for Automatic Speech Recognition (ASR) systems, which still lack in performance when compared to the ability of humans' speech recognition. In this paper, we propose novel Fuzzy Logic (FL) based approaches for the prediction of phone durations using linguistic features. To the best of our knowledge, this is the first development and deployment of FL based approaches in the area of phone segmentation. In this study, we perform a case study on the Dutch IFA corpus, which consists of 50000 words. Different experiments are conducted on tuned FL Systems (FLSs) and Neural Networks (NNs). The experimental results show that FLSs are more efficient in phone duration prediction in comparison to their Neural Network counterparts. Furthermore, we observe that differentiating between the vowels and the consonants improves the performance of predictions, which can facilitate enhanced ASR systems. The FLS with the differentiation between vowels and consonants had an average Mean Average Precision Error of 43.3396\% on a k=3 fold. We believe that this first attempt of the employment of FL based approaches will be an important step for a wider deployment of FL in the area of ASR systems.

Download


Paper Citation


in Harvard Style

Milewski V., Bilgin A. and Kumbasar T. (2017). A Fuzzy Logic Approach to Improve Phone Segmentation - A Case Study of the Dutch Language.In Proceedings of the 9th International Joint Conference on Computational Intelligence - Volume 1: IJCCI, ISBN 978-989-758-274-5, pages 64-72. DOI: 10.5220/0006499800640072


in Bibtex Style

@conference{ijcci17,
author={Victor Milewski and Aysenur Bilgin and Tufan Kumbasar},
title={A Fuzzy Logic Approach to Improve Phone Segmentation - A Case Study of the Dutch Language},
booktitle={Proceedings of the 9th International Joint Conference on Computational Intelligence - Volume 1: IJCCI,},
year={2017},
pages={64-72},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0006499800640072},
isbn={978-989-758-274-5},
}


in EndNote Style

TY - CONF

JO - Proceedings of the 9th International Joint Conference on Computational Intelligence - Volume 1: IJCCI,
TI - A Fuzzy Logic Approach to Improve Phone Segmentation - A Case Study of the Dutch Language
SN - 978-989-758-274-5
AU - Milewski V.
AU - Bilgin A.
AU - Kumbasar T.
PY - 2017
SP - 64
EP - 72
DO - 10.5220/0006499800640072