Text Dependent Speaker Identification And Intruder Detection System
Abstract
This research paper will present an automatic Speaker Identification System using MFCC (Mel-Frequency Cepstral Coefficients) and BPNN (Back Propagation Neural Network). The objective of this work is to classify 20 speakers’ pattern and to identify each registered speaker correctly while testing with new input speech without any false identification. MFCC is used for the extraction of speech features from each speaker and BPNN is used for identification of the test speaker. The developed classifier model is tested with both registered and unregistered speakers and found that it successfully identifies all the registered speakers correctly and reject the intruder speakers. Scaled conjugate gradient training function is used for training the BPNN. A speech database consisting of 20 speakers is created from a group of 10 male and 10 female speakers with the same sentence spoken twice. The classification accuracy rate obtained from the classification is 92.1% and the correct identification rate obtained is 100%. Matlab simulation tool is used in this work
References
Bhargab Medhi1, Prof. P.H.Talukdar2 “Assamese Speaker Recognition Using Artificial Neural Network” International Journal of Advanced Research in Computer and Communication Engineering Vol. 4, Issue 3, pp 321-324, March 2015.
E.C. Gordon,Signal and Linear System Analysis.John Wiley & Sons Ltd., New York, USA,1998.
J Sirisha Devi “Language and Text Independent Speaker Recognition System using Artificial Neural Networks and Fuzzy Logic” International Journal of Recent Technology and Engineering (IJRTE), Volume-7, Issue-6, pp 327-330, March 2019.
Jorge MARTINEZ*, Hector PEREZ, Enrique ESCAMILLA, Masahisa Mabo SUZUKI, “Speaker recognition using Mel Frequency Cepstral Coefficients (MFCC) and Vector Quantization (VQ) Techniques” CONIELECOMP 2012, 22nd International Conference on Electrical Communications and Computers, 27-29 Feb. 2012 pages: 248 - 251,IEEE Conference Publications
Khan Suhail Ahmad1, Anil S. Thosar2, Jagannath H. Nirmal3 and Vinay S. Pande4 “A Unique Approach in Text Independent Speaker Recognition using MFCC Feature Sets and Probabilistic Neural Network” 2015 Eighth International Conference on Advances in Pattern Recognition (ICAPR) Year: 5-7 June2015 Pages: 1 - 6, IEEE Conference Publications.
Koustav Chakraborty, AsmitaTalele Prof.SavithaUpadhya, “Voice Recognition Using MFCC Algorithm” International Journal of Innovative Research in Advanced Engineering (IJIRAE) Volume 1 Issue 10 (November 2014), page 158-161.
Md. Ali Hossain1, Md. Mijanur Rahman2, Uzzal Kumar Prodhan3, Md. Farukuzzaman Khan4 “Implementation Of Back-Propagation Neural Network For Isolated Bangla Speech Recognition” International Journal of Information Sciences and Techniques (IJIST) Vol.3, No.4, pp 1-9 July 2013.
N.AYSHWARYA1,G.LOGESHWARI2,G.S.ANANDHA MALA3 “FEED FORWARD BACK PROPAGATION NEURAL NETWORK FOR SPEAKER INDEPENDENT SPEECH RECOGNITION”, International Journal of Industrial Electronics and Electrical Engineering, ISSN: 2347-6982 Volume-2, Issue-8, pp 36-39, Aug.-2014.
Neha Chauhan “Speaker recognition using pattern recognition neural network and feedforward neural network, International Journal of Scientific & Engineering Research, Volume 8, Issue 3, pp 1444-1446, March-2017.
Roger Achkar*, Mustafa El-Halabi*, ElieBassil*, Rayan Fakhro*, Marny Khalil* “Voice Identity Finder Using the Back Propagation Algorithm of an Artificial Neural Network” Complex Adaptive Systems, Publication 6, Conference Organized by Missouri University of Science and Technology 2016 - Los Angeles, CA.
S. S.Wali, S.M. Hatture, S. Nandyal “MFCC Based Text-Dependent Speaker Identification Using BPNN” International Journal of Signal Processing System VOL.3, No.1, pp 30-34, June 2015.
Yi Wang, Dr. Bob Lawlor, “Speaker Recognition Based on MFCC and BP Neural Networks” 28th Irish Signals And systems Conference Year: 20-21 June 2017, Pages: 1 - 4, IEEE Conference publication.
Sajeer, K., & Rodrigues, P.. “Speaker Recognition System Based on Wavelet Features and Gaussian Mixture Models”. International Journal of Engineering and Advanced Technology, 9(1), 5363-5367, 2019.
Singh, P. P., & Rani, P. “ An approach to extract feature using mfcc” IOSR Journal of Engineering, 4(8), 21-25. 2014
Matsui, T., & Tanabe, K. “Comparative study of speaker identification methods: dPLRM, SVM and GMM. IEICE” transactions on information and systems, 89(3), 1066-1073. (2006).
Doddington, G. R. “Speaker recognition -Identifying people by their voices” Proceedings of the IEEE, 73(11), 1651-1664. Fang, (1985).
Fang, C.. From dynamic time warping (DTW) to hidden markov model (HMM). University of Cincinnati, 3, 19. (2009).
Rakesh, D. R., & Jayasimhan, N. K. “Speaker recognition and authentication” International Journal of Computer Science and Mobile Computing, 2(5), 402-407. 2013.
Maesa, A., Garzia, F., Scarpiniti, M., & Cusani, R. “ Text independent automatic speaker recognition system using mel-frequency cepstrum coefficient and gaussian mixture models” Journal of Information Security, 3(04), 335-340. 2012
Reynolds, D. A. (2002, May). An overview of automatic speaker recognition technology. In 2002 IEEE International Conference on Acoustics, Speech, and Signal Processing (Vol. 4, pp. IV-4072). IEEE