Audproc 2
Audproc 2
Audproc 2
PROCESSING
ARIHARASUDHAN
INTRODUCTION
Hello World! Recognizing speech is an
art, fascinating to learn. It’s Ari from
The South to dig deeper the concepts
of Speech recognition. It can be
simply defined as “The Process of
enabling a MODEL to recognize the
text in the speech or audio sample”.
Apparently, it is a way of
communicating with the computer
through speech. It involves
processing the audio which is to be
discussed in detail in a little while.
CHALLENGES
Yet, we have to overcome a lot of
challenges. What if we have to
recognize the speech of the following
person? (Not that of Rowan Atkinson
but MR.BEAN!)
Natural speech is highly variable due
to differences in accents, speaking
rates, and individual styles. This
variability poses a significant
challenge for speech recognition
systems. Environmental noise can
degrade the performance of speech
recognition systems. Robust
algorithms are required to filter out
background noise and focus on the
relevant speech signal. Speech
recognition systems need to handle
input from different speakers, each
with their unique characteristics.
Training models to be speaker
independent is crucial for
widespread applicability. The
system's ability to recognize a wide
vocabulary is essential.
SOUND: WHAT IS IT