CNN WITH DATA AUGMENTATION FOR VOICE PATHALOGY IDENTIFICATION

C.S.Kanimozhiselvi, T.Sathiyawathi

CNN WITH DATA AUGMENTATION FOR VOICE PATHALOGY IDENTIFICATION

C.S.Kanimozhiselvi, T.Sathiyawathi

Abstract

Deep learning techniques plays an important role in speech analysis and the performance has been greatly improved in the past few years. In this work, voice disorder classification is performed using deep convolution neural networks and the disorders have been successfully classified. To identify voice disorders, the deep learning based techniques are more suitable and they typically required large training datasets. The availability of the real world speech data is limited; the speech augmentation techniques are employed to increase the training data. Audio augmentation is proven to be useful for training the neural network and to make effective predictions. It also helps in avoiding overﬁtting and improves robustness of model. The classification accuracy of the model is improved from 87% to 98%, due to audio augmentation.

Requires Subscription PDF

Published

2020-03-11

How to Cite

T.Sathiyawathi, C. (2020). CNN WITH DATA AUGMENTATION FOR VOICE PATHALOGY IDENTIFICATION. International Journal of Advanced Science and Technology, 29(3s), 1182 - 1188. Retrieved from http://sersc.org/journals/index.php/IJAST/article/view/5981

Download Citation

Issue

Vol. 29 No. 3s (2020): Vol 29 No 3s (2020) (Special Issue)

Section

Articles