Speaker Recognition System Based On MFCC and DCT
Garima Vyas1, Barkha Kumari2
1Garima Vyas, Deperment of ECE, Amity University, Noida, India.
2Barkha Kumari, Deperment of ECE, Amity University, Noida, India.
Manuscript received on May 16, 2013. | Revised Manuscript received on June 13, 2013. | Manuscript published on June 30, 2013. | PP: 167-169 | Volume-2, Issue-5, June 2013. | Retrieval Number: E1736062513/2013©BEIESP
Open Access | Ethics and Policies | Cite
© The Authors. Blue Eyes Intelligence Engineering and Sciences Publication (BEIESP). This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/)
Abstract: This paper examines and presents an approach to the recognition of speech signal using frequency spectral information with Mel frequency. It is a dominant feature for speech recognition. Mel-frequency cepstral coefficients (MFCCs) are the coefficients that collectively represent the shortterm power spectrum of a sound, based on a linear cosine transform of a log power spectrum on a non linear mel scale of frequency. The performance of MFCC is affected by the number of filters, the shape of filters, the way that filters are spaced, and the way that the power spectrum is warped. In this paper the optimum values of above parameters are chosen to get an efficiency of 99.5 % over a very small length of audio file.
Keywords: Speech recognition, Feature extraction, Feature Matching, DCT, MFCCs.