Speaker Identification using GFCC with PITCH & ZCR

Krithish Goli, Vaibhav Jain, J.V. Vidhya

Speaker Identification using GFCC with PITCH & ZCR

Krithish Goli, Vaibhav Jain, J.V. Vidhya

Abstract

Speaker Recognition as a biometric technique used for audio classification The sound generated by a person is said to be altogether exceptional and relies upon larynx or the voice box. Mel Frequency Cepstral Coefficients are said to be less superior and more robust to noise than the less commonly used Gammtone Frequency Cepstral Coefficients. Adaptive whitening noise filtering is used over the given audio wave and calculating gammatone frequency cepstral coefficients features with addition of Pitch and Zero Crossing Rate are used as an input. Decision making models such as Neural network, Support Vector Machine ,XG Boost algorithm and K-means clustering are used for the purpose of classification of speakers and a correlation is made for the equivalent.

Requires Subscription PDF

Published

2020-04-21

How to Cite

Krithish Goli, Vaibhav Jain, J.V. Vidhya. (2020). Speaker Identification using GFCC with PITCH & ZCR. International Journal of Advanced Science and Technology, 29(06), 26 - 33. Retrieved from http://sersc.org/journals/index.php/IJAST/article/view/11290

Download Citation

Issue

Vol. 29 No. 06 (2020): Vol. 29 No. 06 (2020)

Section

Articles