hedha houa

Uploaded by

khiarihiba7

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

5 views

hedha houa

Uploaded by

khiarihiba7

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 5

2018 3rd International Conference for Convergence in Technology (I2CT)

The Gateway Hotel, XION Complex, Wakad Road, Pune, India. Apr 06-08, 2018

Speaker Recognition Techniques: A review

Satyam P. Todkar1, Snehal S. Babar2, Dr. J. R. Prasad

Rudrendra U. Ambike3, Prasad B. Suryakar4 Department of Computer Engineering
Department of Computer Engineering Sinhgad College of Engineering
Sinhgad College of Engineering Pune, India
Pune, India [email protected]
1
[email protected], [email protected],
3
[email protected], [email protected]

Abstract—Speaker Recognition is the process of recognizing case of text dependent speaker recognition system, the speaker
the speaker from the individual's speech biometrics. The voice utters the same phrase that was used on which the system was
characteristics of every speaker are different and thus can be trained. Whereas in case of text independent speaker
used to construct a model. This model is later used to recognize recognition system, the speaker is identified irrespective of the
an enrolled speaker from the list of available speakers. The paper
spoken phrase.
makes an effort to discuss different speaker modeling techniques
like Vector Quantization (VQ), Gaussian Mixture Model (GMM), The task of speaker identification s primarily composed of
Neural Networks (NN), etc. Also, different techniques for two modules: feature extraction and feature matching. Feature
extraction of voice characteristics like Mel Frequency Cepstral Extraction deals with finding the feature vector for an input
Coefficients (MFCC), Linear Predictive Coding (LPC) are speech and is related to dimensionality reduction. Features are
discussed. Further, an in-depth analysis of these surveyed the unique attributes that characterize different speakers and
techniques is made to identify their advantages and limitations. hence can be used to model templates for the speaker in the
The work in the field of Speaker Recognition Systems began in training phase. While in the testing phase first the feature
the 1950’s and is evolving since then, it has wide applications in extraction is performed and then these extracted features are
the fields of security, forensics, authentication etc.
matched to the speaker templates by the feature matching
Index Terms—Linear Predictive Coding(LPC); Mel Frequency module.
Cepstral Coefficient(MFCC); Formants Wavelet Entropy
(FWE); Vector Quantization(VQ); Hidden Markov
Model(HMM); Gaussian Mixture Model(GMM); Neural
Network(NN)

I. INTRODUCTION
Extensive work in the field of Speaker Recognition has
been done in the past two to three decades, however, the goal
of these Speaker Recognition Algorithms remain the same.
They are either aimed to identify the speaker from different
speakers available or verify a particular speaker. The voice of
every individual sounds different as they are attributed to II. TYPE STYLE AND FONTS
different features that create the voice, this may be- pitch,
Fig. 1. Voice Recognition Hierarchy
length of the vocal tract, sound frequency etc. The devised
algorithms may use a feature or a combination of features at
different stages to perform the task of recognition. The idea Fig. 1. Shows the hierarchical representation of a voice
behind this Automatic Speaker Recognition (ASR) system is to recognition system. It also presents the different modeling
create a machine that will extract, characterize and then techniques that are used to construct the speaker template.
identify the speaker by the inputted voice samples.
Speaker Recognition can be divided into two types: speaker
identification and speaker verification. Speaker Identification is
the process of identifying a particular speaker from the set of
enrolled speakers whereas the task of speaker verification deals
with validating the affirmed identity of the speaker. The Fig. 2. Block Diagram of Speaker Recognition System
difference between the two is the user explicitly states the
identity in the later. The task of speaker identification can Fig. 2. Represents the block diagram of a speaker recognition
further be divided into text dependent and text independent. In system. It represents the different phases that are involved in

the process to identify the speaker along with the algorithms in the final stage of MFCC forms the acoustic vector for every
that are used to implement them. speech utterance. Fig. 4. Represents the MFCC Block diagram.
The rest of the paper is organized as follows. Section II The MFCC algorithm consists of various phases [2].The
describes pre-processing step, section III describes various pre-emphasis stage is used to artificially boost the higher
feature extraction techniques, section IV describes different frequencies and hence increases the sound to noise ratio.
feature matching techniques, section V deals with modern Framing divides the input voice signal into frames of equal
approaches that are used in the recognition field. sizes, this is done because voice is a stationary signal only for a
small duration. The size of the frame must be optimal since it
II. PRE PROCESSING may affect the time and the frequency resolution. The window
Pre-processing is one of the most important steps in the function is applied in order to remove the discontinuities at the
process to recognize a speaker [1]. The speech of the speaker is frame boundaries which will eliminate the undesirable effect in
nonstationary one and consists of different components that frequency response. The Fast Fourier Transform is applied in
may or may not be useful in the process to identify the speaker. order to convert from the time domain to the frequency
Performing this step removes the unwanted components like domain. It applies DFT algorithm in a speedy manner. The
silenced and unvoiced regions from the voiced regions of the next stage is the Mel-frequency wrapping which is simulated
speech. This reduces the time complexity and the processing by the use of a mel frequency filter bank, which has a
power. triangular band pass frequency response. The Mel-frequency
scale has a linear frequency spacing below 1000Hz and has
III. FEATURE EXTRACTION TECHNIQUES logarithmic spacing for frequencies above 1000Hz. If f is the
frequency of a signal in Hz then Mel (f) can be given by the
A. Linear Predictive Coding (LPC) formula:
LPC is one of the earliest discoveries that is simple and a
popular technique to derive feature vectors. It is able to analyze Mel (f) = 2595*log10 (1+f/700)
speech and can encode good quality speech at a low bit rate. It
is widely used from standard telephony to military The final step is called as Discrete Cosine Transform (DCT).
communication. LPC consists of a predictor that predicts the It is used to convert the log Mel spectrum in the time domain.
current output as a linear combination of previous output. The result hence obtained is called as Mel Frequency Cepstrum
Coefficient and the set of such coefficients forms the acoustic
vector.

Fig. 4. MFCC Block Diagram

Fig. 3. LPC Block Diagram
The user’s speech is taken as an input to the pre-emphasis
stage [1]. This stage acts as an input to the frame blocking C. Formants Wavelet Entropy (FWE)
stage. The frame blocking stage is used block the signal into N
frames. The windowing function is applied to remove the
discontinuities at the frame boundary. The autocorrelated step
is the next step wherein autocorrelation value is calculated for
every windowed frame and then the highest autocorrelation
value is found out. This gives the order of LPC analysis and
then LPC coefficients are derived. The Fig. 3 represents the
LPC Block diagram and the various steps involved in it.

B. Mel Frequency Cepstrum Coefficients (MFCC)

MFCC is the most widely used algorithm for speaker
recognition. It is resilient to a noisy environment and hence can Fig. 5. FWE Block Diagram
recognize the speaker efficiently. It is more effective than the
previously discussed LPC algorithm. The coefficients obtained

2
FWE is a novel approach in the field of speaker recognition speech recognition, bioinformatics and pattern recognition
system. It works by calculating the formants and wavelet problems.
entropy of the filtered input speech. FWE is more efficient as
compared to the MFCC since it has a fixed number of feature
vectors and only these twelve extracted feature coefficients are C. Gaussian Mixture Model (GMM)
used to model the speaker voice template [3]. FWE works on Gaussian Mixture Model is one of the most successful
partially recorded voice samples and is majorly used in speaker classification techniques, this is due to the fact that
forensics. FWE can work for both vowel dependent and Gaussian mixture probability density function is used. GMM is
independent input speech, however, the speaker recognition close to the natural modeling techniques and hence can be used
efficiency is better for the vowel dependent approach. The to model a scenario comprising of higher dimensions [5].
FWE block diagram is shown in the Fig. 5. The speech attributes can readily be Gaussian distributed
FWE has two stages: recording and filtering the speech and hence GMM can be used effectively. A Gaussian Mixture
signals and extracting features. Firstly the input speech is density is a weighted sum of M component densities where M
recorded and then passed to the filter bank. The filter bank is represents the number of Gaussians. Each speaker can be
used to filter the unwanted signals from the speech. The feature effectively modeled and represented by a GMM and is referred
extraction is further divided into two parts for calculating by a model associated with him/her. This model is represented
formants and entropies. Formants represent the acoustic by using λ.
resonance of the speaker’s vocal tract. The Power Spectrum D. VQ & GMM: A Hybrid Approach
Density (PSD) is used to calculate the formants by finding first
five formants as they are easily distinguishable for every The VQ model discussed previously is one of the efficient
speaker. Then the entropies are calculated by using Wavelet and easy approaches to identify the speaker. However with an
Packets (WP). It calculates Shannon entropy for the seven increase in the number of code words the time complexity of
nodes of the wavelet packet, thus enhancing the recognition the algorithm increases along with a decrease in accuracy. One
rate. such modification to the VQ is combining it with the more
sophisticated techniques like GMM [6]. Here, in the process of
IV. FEATURE MATCHING TECHNIQUES recognizing the speaker both the techniques will recognize the
The feature matching algorithms are used in both the speaker by themselves. If both the techniques recognize the
training and the testing phase. In the training phase, the system same speaker then the speaker is readily recognized, however,
is trained by using the extracted feature vectors to construct a if there is a disagreement then the relative index is calculated
speaker model. Whereas in testing phase this model is and the confidence ratio is found. This ratio is used to identify
validated by the system by recognizing speakers or voice the true speaker or may be helpful to detect the outliers.
samples that were not used in the training phase.
A. Vector Quantization (VQ) E. GMM & Pitch Detection Algorithm
VQ is one of the most popular and easy to use feature The sophisticated, and unsupervised algorithm GMM is
matching algorithm. It works by using the extracted feature efficient in itself, however with the advancements in
vectors to construct a model [2]. VQ is a type of unsupervised technology efforts are made to minimize the time complexity
learning algorithm which creates clusters, they represent the of the speaker recognition task. The pitch of a female voice is
models of the enrolled speakers. higher as compared to her male counterpart. GMM is coupled
Initially, the feature vectors are obtained and then they are with Pitch Detection Algorithm (PDA) where the gender of the
classified into different clusters. Feature vectors belonging to speaker is identified by using the pitch [7].
the same cluster have similar properties and model the The pre-processing is an important step in the process of
attributes of the speaker. When next time a feature vector is speaker recognition as it improves the performance of the
obtained, it is compared with all of the existing clusters system. Pre-processing includes: down-sampling which is used
centroids by calculating the Euclidean Distance. The feature to reduce the sampling the rate of a signal; pre-emphasis stage
vector is assigned to the cluster with the minimum distance & that decreases the amplitude of low frequency bands whereas
the centroid of the cluster is updated. The centroids are also increases the amplitude of high frequency bands; The human
termed as code words and the collection of such code words is speech is not continuous in nature and may contain some parts
called a codebook. where there is no speech utterance, elimination of such parts
will increase the speed of identification as the number of
frames are greatly reduced. This is performed by the silence
B. Hidden Markov Model (HMM) removal stage.
HMM is a better and efficient feature matching algorithm The PDA makes use of autocorrelation method waveforms
as compared to the traditional VQ model. HMM is able to where autocorrelation is a function that is a correlation of a
model the statistical variations of the features, give a statistical waveform with itself. The PDA estimates the pitch of an
representation in a way speaker produces the sound [4]. The irregular periodic signal and hence reduces the time complexity
applications of HMM are vivid in nature viz. signal processing, by reducing the number of comparisons to half.

3
F. Neural Networks (NN)
Neural Network is basically an information processing Sr. Techniques Remarks
system. It consists of processing elements which are highly No
interconnected with each other. It is actually used to solve Accuracy gets reduced by the
problems of pattern recognition through the process of various speaker and transmission
learning. It has approaches like feed-forward neural network related effects and it also does
(FFNN) [3] and probabilistic neural network (PNN) [4]. not generalizes well.
Feed-forward neural networks are one of the earliest neural Signifies vocal tract features
networks and very simple to implement. Basically, it consists
of three layers which are input layer, hidden layer (if any), Uses probabilistic model
3. GMM
output layer. The flow of information is unidirectional i.e.
forward from the input layer to the hidden layer and to the Inefficient to handle high
output layer. There is no formation of a cycle or closed loops in dimensional data.
the nodes of the feed-forward network. Used for text dependent
Probabilistic Neural Network is an unsupervised feed- VQ &
forward network. PNN composed of four different layers Uses relative index as
GMM: A
which are: input, pattern, summation, and output. Statistical 4. confidence measures.
Hybrid
algorithms can be implemented with the help of PNN. Approach
A Gaussian function can be used as a probabilistic function Increases complexity of the
for each pattern node. According to the input patterns, the system.
network weights get updated. Then nearest neighborhood 50% reduction in time
function is used to classify the patterns. The following tables I processing.
and II compare the different feature extraction and matching GMM &
techniques. Achieves improved recognition
Pitch
5. rate.
Detection
TABLE I. COMPARISON OF FEATURE EXTRACTION TECHNIQUES Algorithm
Gives erroneous results in case if
a male has a high pitch or a
Sr. Techniques Remarks female has a low pitch.
No Requires less statistical training.
Useful in synthesis of Speech
1. LPC 6. NN Convergence speed is slow, less
Loss of compression information generalizing performance,
Immune to Noise problems of over-fitting.
2. MFCC V. MODERN APPROACHES TO SPEAKER RECOGNITION
Provides not so good correlation
and smooth transition
Has a fixed number of feature A. Denoising
vector coefficients The speaker recognition systems that were proposed made
3. FWE an effort to recognize the speaker with a high degree of
Provides more accuracy in vowel accuracy, however, different factors like noise, the inefficiency
dependent speaker recognition of the recording device and other environmental factors possess
a challenge on these speaker recognition systems.
TABLE II. COMPARISON OF FEATURE MATCHING TECHNIQUE The efficiency of such speaker recognition systems can be
increased if they are trained on pure voice samples. This pure
voice samples can be achieved by subtracting the pure noise
Sr. Techniques Remarks
from the distorted voice signal- distortion occurs due to stray
No
Used for text dependent pickups, etc. This task is implemented by a denoiser [8], which
Clustering technique and approximately finds out the pure speech without noise.
formation of Codebook for every B. Wavelet Cepstral Coefficient (WCC)
1. VQ speaker The MFCC algorithm discussed so far is immune to noise,
however, the Fourier transform used in the MFCC is only
Loss of temporal information restricted in time domain whereas the Wavelet transform is
causing system inaccuracy. restricted in both time and frequency domain [9]. WCC is
Temporal Information is well robust and can be used in the noisy environment along with
2. HMM modeled. fuzzy logic systems to increase the speaker recognition
accuracy.

4
REFERENCES [5] Bagul, S. G., & Shastri, R. K. (2013, August). “Text independent
speaker recognition system using gmm”. In Human Computer
Interactions (ICHCI), 2013 International Conference on (pp. 1-5).
[1] Subhashini, P. P. S., & Pratap, T. “TEXT-INDEPENDENT SPEAKER IEEE.
RECOGNITION USING COMBINED LPC AND MFC [6] Desai, D., & Joshi, M. (2014). “Speaker recognition using MFCC and
COEFFICIENTS”. International Journal of Research in Engineering hybrid model of VQ and GMM”. In Recent Advances in Intelligent
and Technology, 2014. Informatics (pp. 53-63). Springer International Publishing.
[2] Martinez, J., Perez, H., Escamilla, E., & Suzuki, M. M. (2012, [7] AboElenein, N. M., Amin, K. M., Ibrahim, M., & Hadhoud, M. M.
February). “Speaker recognition using Mel frequency Cepstral (2016, May). “Improved text-independent speaker identification system
Coefficients (MFCC) and Vector quantization (VQ) techniques”. In for real time applications”. In Electronics, Communications and
Electrical Communications and Computers (CONIELECOMP), 2012 Computers (JEC-ECC), 2016 Fourth International Japan-Egypt
22nd International Conference on (pp. 248-251). IEEE. Conference on (pp. 58-62). IEEE.
[3] Daqrouq, K., & Tutunji, T. A. (2015). “Speaker identification using [8] Tkachenko, M., Yamshinin, A., Lyubimov, N., Kotov, M., &
vowels features through a combined method of formants, wavelets, and Nastasenko, M. (2017, September). “Speech Enhancement for Speaker
neural network classifiers”. Applied Soft Computing, 27, 231-239. Recognition Using Deep Recurrent Neural Networks”. In International
[4] Ahmad, K. S., Thosar, A. S., Nirmal, J. H., & Pande, V. S. (2015, Conference on Speech and Computer (pp. 690-699). Springer, Cham.
January). “A unique approach in text independent speaker recognition [9] Rathor, S., & Jadon, R. S. (2017, July). “Text independent speaker
using MFCC feature sets and probabilistic neural network”. In Advances recognition using wavelet cepstral coefficient and butter worth filter”.
in Pattern Recognition (ICAPR), 2015 Eighth International Conference In 2017 8th International Conference on Computing, Communication
on (pp. 1-6). IEEE. and Networking Technologies (ICCCNT) (pp. 1-5). IEEE.

Simulation of Digital Communication Systems Using Matlab
From Everand
Simulation of Digital Communication Systems Using Matlab
Mathuranathan Viswanathan
3.5/5 (22)
Azure Data Factory
No ratings yet
Azure Data Factory
4 pages
EEL6586 Final Project:: A Speaker Identification and Verification System
No ratings yet
EEL6586 Final Project:: A Speaker Identification and Verification System
16 pages
Maretext Independent Speaker Identification Based On K-Mean Algorithm
No ratings yet
Maretext Independent Speaker Identification Based On K-Mean Algorithm
9 pages
Speaker Recognition System Using MFCC and Vector Quantization
No ratings yet
Speaker Recognition System Using MFCC and Vector Quantization
7 pages
Digital Signal Processing "Speech Recognition": Paper Presentation On
No ratings yet
Digital Signal Processing "Speech Recognition": Paper Presentation On
12 pages
Abstract:: Text-Independent and Dependent Methods. in A Text
No ratings yet
Abstract:: Text-Independent and Dependent Methods. in A Text
11 pages
Digital Signal Processing: The Final
No ratings yet
Digital Signal Processing: The Final
13 pages
Voice Recognition
100% (1)
Voice Recognition
18 pages
DC Motor Control
No ratings yet
DC Motor Control
2 pages
MFCC and Vector Quantization For Arabic Fricatives2012
No ratings yet
MFCC and Vector Quantization For Arabic Fricatives2012
6 pages
Speaker Recognition
100% (1)
Speaker Recognition
15 pages
Speaker Recognition Using Matlab
No ratings yet
Speaker Recognition Using Matlab
14 pages
Voice Activation Using Speaker Recognition For Controlling Humanoid Robot
No ratings yet
Voice Activation Using Speaker Recognition For Controlling Humanoid Robot
6 pages
Feature Extraction Methods LPC, PLP and MFCC in Speech Recognition
No ratings yet
Feature Extraction Methods LPC, PLP and MFCC in Speech Recognition
5 pages
Feature Extraction Methods LPC, PLP and MFCC in Speech Recognition
No ratings yet
Feature Extraction Methods LPC, PLP and MFCC in Speech Recognition
5 pages
pxc3872774 PDF
No ratings yet
pxc3872774 PDF
7 pages
An Approach To Extract Feature Using MFC
No ratings yet
An Approach To Extract Feature Using MFC
5 pages
Advanced Signal Processing Using Matlab
No ratings yet
Advanced Signal Processing Using Matlab
20 pages
DWT and Mfccs Based Feature Extraction Methods For Isolated Word Recognition
No ratings yet
DWT and Mfccs Based Feature Extraction Methods For Isolated Word Recognition
6 pages
Speech Recognition Using MFCC: September 2015
No ratings yet
Speech Recognition Using MFCC: September 2015
5 pages
Speaker Recognition
No ratings yet
Speaker Recognition
11 pages
MFCC Step
100% (1)
MFCC Step
5 pages
The Development Process and Current State of The Speech Recognition Technology
No ratings yet
The Development Process and Current State of The Speech Recognition Technology
8 pages
Feature Extraction Methods LPC, PLP and MFCC
100% (1)
Feature Extraction Methods LPC, PLP and MFCC
5 pages
Automatic Speech Recognition Using Cepstral and Itakura-Saito Distances For Vocal Command
No ratings yet
Automatic Speech Recognition Using Cepstral and Itakura-Saito Distances For Vocal Command
5 pages
Speaker Recognition Publish
No ratings yet
Speaker Recognition Publish
6 pages
Reconocimiento de Voz - MATLAB
No ratings yet
Reconocimiento de Voz - MATLAB
5 pages
Ma Kale
No ratings yet
Ma Kale
3 pages
Speech Processing Unit 4 Notes
No ratings yet
Speech Processing Unit 4 Notes
16 pages
Acoustic Parameters For Speaker Verification
No ratings yet
Acoustic Parameters For Speaker Verification
16 pages
Mel-Frequency Cepstrum Coefficients (MFCC) Melalui Jaringan Syaraf
No ratings yet
Mel-Frequency Cepstrum Coefficients (MFCC) Melalui Jaringan Syaraf
7 pages
Speaker Recognition Using Vocal Tract Features
No ratings yet
Speaker Recognition Using Vocal Tract Features
5 pages
Methodology For Speaker Identification and Recognition System
100% (1)
Methodology For Speaker Identification and Recognition System
13 pages
Comp Sci - Speech Recognition - Sandeep Kaur
No ratings yet
Comp Sci - Speech Recognition - Sandeep Kaur
6 pages
Voice Recognition Using MFCC Algorithm
No ratings yet
Voice Recognition Using MFCC Algorithm
4 pages
Isolated Digit Recognition System
100% (1)
Isolated Digit Recognition System
3 pages
Performance Evaluation of MLP For Speech Recognition in Noisy Environments Using MFCC & Wavelets
No ratings yet
Performance Evaluation of MLP For Speech Recognition in Noisy Environments Using MFCC & Wavelets
5 pages
(IJCST-V10I3P32) :rizwan K Rahim, Tharikh Bin Siyad, Muhammed Ameen M.A, Muhammed Salim K.T, Selin M
No ratings yet
(IJCST-V10I3P32) :rizwan K Rahim, Tharikh Bin Siyad, Muhammed Ameen M.A, Muhammed Salim K.T, Selin M
6 pages
Speaker Identification Using Mel Frequency Cepstral Coefficients
No ratings yet
Speaker Identification Using Mel Frequency Cepstral Coefficients
5 pages
2_CNN based speaker recognition in language and text independent small scale system
No ratings yet
2_CNN based speaker recognition in language and text independent small scale system
4 pages
Speaker Recognition System - v1
No ratings yet
Speaker Recognition System - v1
7 pages
JAWS (Screen Reader)
No ratings yet
JAWS (Screen Reader)
18 pages
Utterance Based Speaker Identification
No ratings yet
Utterance Based Speaker Identification
14 pages
Speech Recognition Using Discrete Hidden Markov Model: Department of ECE, Saveetha Engineering College, Chennai, India
No ratings yet
Speech Recognition Using Discrete Hidden Markov Model: Department of ECE, Saveetha Engineering College, Chennai, India
6 pages
Monalisha_barik_paper
No ratings yet
Monalisha_barik_paper
5 pages
Voice Recognition
No ratings yet
Voice Recognition
6 pages
Performance Improvement of Speaker Recognition System
No ratings yet
Performance Improvement of Speaker Recognition System
6 pages
Isolated Word Recognition Using LPC & Vector Quantization: M. K. Linga Murthy, G.L.N. Murthy
No ratings yet
Isolated Word Recognition Using LPC & Vector Quantization: M. K. Linga Murthy, G.L.N. Murthy
4 pages
134 Rashid Bicet2021
No ratings yet
134 Rashid Bicet2021
9 pages
Speech Feature Extraction and Classification Techniques: Kamakshi and Sumanlata Gautam
No ratings yet
Speech Feature Extraction and Classification Techniques: Kamakshi and Sumanlata Gautam
3 pages
Spoken Language Identification Using Hybrid Feature Extraction Methods
No ratings yet
Spoken Language Identification Using Hybrid Feature Extraction Methods
5 pages
Voice Recognition With Neural Networks, Type-2 Fuzzy Logic and Genetic Algorithms
No ratings yet
Voice Recognition With Neural Networks, Type-2 Fuzzy Logic and Genetic Algorithms
8 pages
8834 PDF
No ratings yet
8834 PDF
8 pages
2 Text Independent Voice Based Students Attendance System Under Noisy Environment Using RASTA-MFCC Feature
No ratings yet
2 Text Independent Voice Based Students Attendance System Under Noisy Environment Using RASTA-MFCC Feature
6 pages
Speaker Recognition Using MFCC and VQ
No ratings yet
Speaker Recognition Using MFCC and VQ
2 pages
AJSAT Vol.5 No.2 July Dece 2016 pp.23 30
No ratings yet
AJSAT Vol.5 No.2 July Dece 2016 pp.23 30
8 pages
Voice Analysis Using Short Time Fourier Transform and Cross Correlation Methods
No ratings yet
Voice Analysis Using Short Time Fourier Transform and Cross Correlation Methods
6 pages
Some Case Studies on Signal, Audio and Image Processing Using Matlab
From Everand
Some Case Studies on Signal, Audio and Image Processing Using Matlab
Dr. Hedaya Mahmood Alasooly
No ratings yet
Error-Correction on Non-Standard Communication Channels
From Everand
Error-Correction on Non-Standard Communication Channels
Edward A. Ratzer
No ratings yet
Signal, Audio and Image Processing
From Everand
Signal, Audio and Image Processing
Dr. Hidaia Mahmood Alassouli
No ratings yet
Linkers and Loaders
No ratings yet
Linkers and Loaders
4 pages
Sangeetha S FlowCV Resume 20231130
No ratings yet
Sangeetha S FlowCV Resume 20231130
2 pages
1 Kongo Gumi Mix and Match Braid
No ratings yet
1 Kongo Gumi Mix and Match Braid
4 pages
Currency Converter Application
No ratings yet
Currency Converter Application
11 pages
C1SE.19 Proposal SRS v2.0
No ratings yet
C1SE.19 Proposal SRS v2.0
19 pages
Heart Disease Prediction Using ML
No ratings yet
Heart Disease Prediction Using ML
18 pages
VirtualAccountSystemV03
No ratings yet
VirtualAccountSystemV03
23 pages
Final Proejct
No ratings yet
Final Proejct
55 pages
Hold Harmless and Indemnity Agreement
100% (1)
Hold Harmless and Indemnity Agreement
9 pages
Recording Calls Study
No ratings yet
Recording Calls Study
97 pages
Tutorialquestion Bank: Course Title Operations Research Course Code AME021 Class Semester Year Team of Instructors
No ratings yet
Tutorialquestion Bank: Course Title Operations Research Course Code AME021 Class Semester Year Team of Instructors
17 pages
Onboarding Checklist
No ratings yet
Onboarding Checklist
78 pages
Grade 7 Information Technology Notes
No ratings yet
Grade 7 Information Technology Notes
3 pages
Dbms Assignment Indexes by Shivanshu Mishra
No ratings yet
Dbms Assignment Indexes by Shivanshu Mishra
5 pages
Medical Statistics With R
No ratings yet
Medical Statistics With R
85 pages
F30926498_S230952590_DSR109233240
No ratings yet
F30926498_S230952590_DSR109233240
7 pages
Final CN Handbook Revised
No ratings yet
Final CN Handbook Revised
192 pages
Ivi 1853 Endpoint Security For Endpoint Manager
No ratings yet
Ivi 1853 Endpoint Security For Endpoint Manager
2 pages
Python Notes
No ratings yet
Python Notes
9 pages
Guide to Agentic AI Multi Agent Pattern 1741332267
No ratings yet
Guide to Agentic AI Multi Agent Pattern 1741332267
11 pages
Campus Mobile Navigation System Based On
No ratings yet
Campus Mobile Navigation System Based On
2 pages
Krishna Teja
No ratings yet
Krishna Teja
6 pages
Completing The Square
100% (1)
Completing The Square
17 pages
Oracle Forms and Reports Training Course Highlights
No ratings yet
Oracle Forms and Reports Training Course Highlights
4 pages
22428-2023-Winter-question-paper[Msbte study resources]
No ratings yet
22428-2023-Winter-question-paper[Msbte study resources]
4 pages
Closing A Practice Guide
No ratings yet
Closing A Practice Guide
9 pages
Project 1914198
No ratings yet
Project 1914198
30 pages
ReleaseNotes 4.07.13.2243
No ratings yet
ReleaseNotes 4.07.13.2243
5 pages
MU Online Exam Guidelines-July'24 Session
No ratings yet
MU Online Exam Guidelines-July'24 Session
14 pages

hedha houa

Uploaded by

hedha houa

Uploaded by

2018 3rd International Conference for Convergence in Technology (I2CT)

Speaker Recognition Techniques: A review

Satyam P. Todkar1, Snehal S. Babar2, Dr. J. R. Prasad

978-1-5386-4273-3/18/$31.00 ©2018 IEEE 1

Fig. 4. MFCC Block Diagram

B. Mel Frequency Cepstrum Coefficients (MFCC)

You might also like