• Advanced Noise Cancellation via Machin
e Learning: Adding Machine Learning
models for noise cancellation can
significantly improve the accuracy of the
system in the noisy environment. Methods
like neural networks trained to remove
background noise could help reduce
mistakes and would make the system more
robust across a range of use cases.
• Multilingual Support: By extending the
system to accommodate multiple
languages, its accessibility and applicability
would be improved, especially in
multilingual areas. This can be
accomplished by supplementing multiplexi
ng speech recognition engines or augmentin
g already selected models inside speech
engines such as Google’s API or CMU
Sphinx.
• Portability and Compact Design: The
Raspberry Pi is of small size which will be
usable for portable purposes but some
additional changes can be made to make it
usability. With battery power for mobility
and a more compact system perhaps a
smaller display or wireless connectivity in
the user experience category, you get one
more aspect of the technology's versatility,
particularly for use-cases on the move like
wearables or assistive technology for the
hearing impaired.
• Improved Speech Models: Developing
speech models better suited to working in
noisy environments or outdoors could also
boost the accuracy. Fine-tuning or training
models on particular background noise
profiles, accents, or use cases may result in
improved performance in those scenarios.
9 CONCLUSIONS
In this paper, we have described the process of
building a real time speech-to-text converter using
Raspberry Pi which demonstrates the potential of
developing low cost portable system that utilizes
open-source software and off-the-shelf hardware
components. The proposed system emerges as
promising in both clean and moderately noisy
prescriptions, which can be helpful in many work
domains, such as transcription, power accessibility,
and education. This enables flexibility in the
deployment environment: it can be through Google’s
Speech-to-Text API in online setups or with CMU
Sphinx for offline usage, depending on user
requirements concerning internet connectivity and
data privacy. The Raspberry Pi used as processing
unit also highlights the feasibility to integrate such
speech-to-text systems on cost-$ and energy-$
constrained embedded platforms. While its accuracy
dips in high-noise settings, incorporating noise-
cancellation methods and machine learning models
presents a straightforward solution for enhancement.
Also expect to see more features in the future, like
multilingual support, battery integration for
portability, and more advanced audio models for
richer environmental settings.
Overall, we believe that with some more
optimizations, particularly in its handling of noise and
its computational efficiency, this speech-to-text
system can serve as a strong backbone for other
applications happening in real-time on the phone, and
we hope that this work is a step forward towards
making this model widely usable in more and more
settings.
REFERENCES
Home automation using raspberry Pi through Siri enabled
mobile devices, December 2015 DOI:10.1109/HNICE
M.2015.7393270 Available at: https://www.researchg
ate.net/publication/30 4297304_Home_automation_us
ing_raspberr y_Pi_ through_ Siri_e nabled_ mobile_
devices
Ivan Froiz- Miguel, Paula Fraga-Lamas, Design,
Implementation, and Practical Evaluation of a Voice
Recognition Based IoT Home Automation System for
Low-Resource Languages. June 2023 DOI:10.1109
ACCESS.2023.3286391 Available at: https://ieeexplo
re.ieee.org/stamp/stamp.jsp? arnumber=10151879
K. Lakshmi, Mr. T. Chandra Sekhar Rao. Design and
Implementation of Text to Speech Conversion Using
Raspberry PI, Vol 4, No 6 (2016) Available at:
https://www.ijitr.com/index.php/ojs/article/v iew/1287
M. Gales and S. Young, The Application of Hidden Markov
Models in Speech Recognition. Foundations and
Trends R in Signal Processing Vol. 1, No. 3 (2007)
195–304 c 2008 DOI: 10.1561/2000000004
Availableat: https://mi.eng.cam.ac.uk/~mjfg/mjfg_NO
W. pdf
Prachi Khilari, Prof. Bhope V. P Implementation of Speech
to Text Conversion. Vol. 4, Issue 7, July 2015
Available at:https://www.ijirset.com/upload/2015/july/
16 7_Implementation.pdf
Surinder Kaur, Sanchit Sharma,Voice Command System
Using Raspberry PI. July 2016 DOI:10.5121/acii.2016
.3306 Availableat: https:// www.researchgate.net/publi
cation/30 5922778_Voice Command_System_ Using_
Raspberry_Pi