Speech Recognition Using DSP PDF
Speech Recognition Using DSP PDF
Speech Recognition Using DSP PDF
By:
Mohammed Wahhab Abdulrazzaq
Abstract
Speech recognition is the process of converting spoken words into text or commands that a
computer can understand.
It involves several steps, including signal processing, feature extraction, and classification.
Digital signal processing (DSP) plays a crucial role in speech recognition by processing and
analyzing the raw speech signal to extract relevant features such as pitch, zero crossing level
and short term energy level.
These features are then used to recognize the male and female speech by using Digital filters
(low pass filter) and features extraction.
In this project 13 audio were tested and recognized as male and female.
After these tests, we found the females speech has more values than males for the above
mentioned characteristics.
Contents
Introduction
Problem to be Discussed
Apparatus of Articulation
Voice Production
Speech Signal
Conclusion
References
Introduction
• Speech recognition using DSP is the process of converting speech signals into
digital representations, which can be processed and analyzed to recognize
spoken words or phrases.
• The first step in speech recognition using DSP is to process the analog speech
signal, typically by digitizing it using an analog-to-digital converter (ADC). This
digital representation of the speech signal can then be processed using various
DSP techniques, such as filtering, feature extraction, and pattern recognition.
Problem to be Discussed
• The mouth, called also oral cavity, in addition to its primary role as
the beginning of the digestive system, also plays a significant role in
communication in humans.
• While primary aspects of the voice are produced in the throat, the
tongue, lips, and jaw are also needed to produce the range of sounds
included in human language.
Step 4: Feature extraction, Features are extracted from the preprocessed speech
signal.
Step 5: Feature analyzing, the extracted features are analyzed and compared with
the proved standards.
Step 6: Signal recognition, judging the signal is either for male or female according
to feature analyzing.
Simulation and Results
• For implementing this project, the MATLAB simulation tool is used.
• Simulation Parameters:
1. 13 sounds are tested.
2. Low Pass filter with cut of freq.=1000 Hz
3. fundamental_freq_level=135
4. zero_crossing_level=12
5. short_energy_level=0.5
Simulation and Results
Simulation and Results
Simulation and Results
Simulation and Results
Simulation and Results
Simulation and Results
Simulation and Results
Simulation and Results
Simulation and Results
Simulation and Results
Conclusion
• Speech signal differs from one human to another.
• Speech signal has different characteristics.
• DSP techniques can be used to extract and analyze the signal
features.
• It is demonstrated the female speech signal has higher values in term
of Fundamental Frequencies, Zero crossing level and short term
energy than male speech signal.
References
[1] M. Saundade P. Kurle, “Speech Recognition using Digital Signal Processing”, International
Journal of Electronics, Communication & Soft Computing Science and Engineering, volume 2, issue
6, 2013.
[2] A. Palumbo, B. Calabrese, P. Vizza, N. Lombardo, A. Garozzo,M. Cannataro, F. Amato and P.
Veltri,” A Novel Portable Device for Laryngeal Pathologies Analysis and Classification”, Springer-
Verlag Berlin Heidelberg 2010.
[3] D. Jurafsky, J. H. Martin "Speech and Language Processing“, second Edition, Prentice Hall, 2008.
[4] J. Layer and P. Trudgill, “Phonetic and linguistic markers in speech”, Cambridge University Press,
Language in Society in 1972.
[5] S. K. Banchhor, “TEXT-DEPENDENT METHOD FOR GENDER IDENTIFICATION THROUGH
SYNTHESIS OF VOICED SEGMENTS”, International Journal of Engineering Science and Technology
(IJEST), Vol. 3 No. 6 June 2011.
[6] D. Kovačićb, E. Balaban,“Voice gender perception by cochlear implantees”, Journal of the
Acoustical Society of America, September 2009.
[7] “Short Term Time Domain Processing of Speech”, Retrieved 19 February 2023, from
vlab.amrita.edu/?sub=3&brch=164&sim=857&cnt=1.
Thanks