Speaker Recognition using MFCC and GMM

11 views (last 30 days)
I've run the system using the following for training: Speech data(NTIMIT) --> MFCC (feature extraction) --> GMM (modeling)
for testing:
Speech data(NTIMIT)--> MFCC (feature extraction) --> EM (scores)
the accuracy I am getting is 44% for 461 speakers. it was confirmed by 2 at least(1. Reynolds. 2. Patra) that running such system should give an accuracy of 60.8% for 630 speakers i have done lots of changes in terms of sampling frequency (mainly 8000 or 16000), number of MFCC cepstums, number of MFCC mixtures and iterations and the window size and that was the best percentage I could get.
I am using an MFCC and GMM codes which gave good result with TIMIT
advice would be really appreciated

Accepted Answer

mamdouh
mamdouh on 4 May 2011
i can tell that you should give more attention to the training data and the prepossessing step .... you can use PCA algorithm for dimensionality reduction and the class separability measure i think it may help
and if you can help me with code of the mfcc and the gmm i'll be thankful Regards.
  2 Comments
Shaikha Hajri
Shaikha Hajri on 6 May 2011
PCA didnt work as expected and dimentionality reduction helps mostly in execution time and degraded the accuracy!
you can get the codes from the VOICEBOX: http://www.ee.ic.ac.uk/hp/staff/dmb/voicebox/voicebox.html
VINAY
VINAY on 16 Apr 2013
Hi, You can use DTW algorithm for matching the speech files and speaker recognition.

Sign in to comment.

More Answers (1)

Brian Hemmat
Brian Hemmat on 20 Mar 2020
Audio Toolbox provides several examples for speaker recognition (both identification and verification):

Categories

Find more on Sequence and Numeric Feature Data Workflows in Help Center and File Exchange

Tags

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!