Using Neural Network to Develop Speech Recognition
Yida Liu
2024
Abstract
The application of neural networks in the field of speech recognition has made remarkable progress in recent years, which greatly improves the accuracy and robustness of the system. This paper reviews the key technologies and recent progress of neural networks in speech recognition, with emphasis on different types of neural network architectures, such as multi-layer perceptrons (MLP), convolutional neural networks (CNN), and recurrent neural networks (RNN), and their specific applications in processing speech signals. This paper also discusses the differences between deep neural network synthesis models and end-to-end systems, analyzes the existing methods, evaluates the performance of these two current mainstream speech recognition systems from different perspectives such as the usefulness of methods, response speed and synthesis, and analyzes their performance in different languages, noisy environments and speaker variation. Finally, the purpose of this study is to analyze the future development trend of neural network speech recognition, such as more efficient model structure, cross-language transfer learning, and the ability to analyze and discriminate speech intonation. Through these discussions, this paper provides a valuable reference for the future development direction of speech recognition technology.
DownloadPaper Citation
in Harvard Style
Liu Y. (2024). Using Neural Network to Develop Speech Recognition. In Proceedings of the 1st International Conference on Modern Logistics and Supply Chain Management - Volume 1: MLSCM; ISBN 978-989-758-738-2, SciTePress, pages 139-145. DOI: 10.5220/0013235200004558
in Bibtex Style
@conference{mlscm24,
author={Yida Liu},
title={Using Neural Network to Develop Speech Recognition},
booktitle={Proceedings of the 1st International Conference on Modern Logistics and Supply Chain Management - Volume 1: MLSCM},
year={2024},
pages={139-145},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0013235200004558},
isbn={978-989-758-738-2},
}
in EndNote Style
TY - CONF
JO - Proceedings of the 1st International Conference on Modern Logistics and Supply Chain Management - Volume 1: MLSCM
TI - Using Neural Network to Develop Speech Recognition
SN - 978-989-758-738-2
AU - Liu Y.
PY - 2024
SP - 139
EP - 145
DO - 10.5220/0013235200004558
PB - SciTePress