Journal of Voice
Volume 25, Issue 1 , Pages 38-43, January 2011

Discrimination Between Pathological and Normal Voices Using GMM-SVM Approach

Thinkit Speech Lab, Institute of Acoustics, Chinese Academy of Science, Beijing, China

Accepted 4 August 2009. published online 08 February 2010.

Summary 

Acoustic features of vocal tract function are used widely in the study of pathological voices detection. Classification of normal and pathological voices by acoustic parameters is a useful way to diagnose voice diseases. In this aspect, mel-frequency cepstral coefficients are proved to be effective with traditional classifiers such as Gaussian Mixture Model (GMM). However, the accuracy of the classification method can be further improved. In this article, a Gaussian mixture model supervector kernel-support vector machine (GMM-SVM) classifier is compared with GMM classifier for the detection of voice pathology. We found that a sustain vowel phonation can be classified as normal or pathological with an accuracy of 96.1%. Voice recordings are selected from the Kay database to carry out the experiments. Experimental results show that equal error rates decrease from 8.0% for GMM to 4.6% for GMM-SVM.

Key Words: Pathological voices, GMM-SVM

To access this article, please choose from the options below

Login to an existing account or Register a new account.

  • Purchase this article for 31.50 USD (You must login/register to purchase this article)

    Online access for 24 hours. The PDF version can be downloaded as your permanent record.

  • Subscribe to this title

    Get unlimited online access to this article and all other articles in this title 24/7 for one year.

  • Claim access now

    For current subscribers with Society Membership or Account Number.

  • Visit SciVerse ScienceDirect to see if you have access via your institution.
 

PII: S0892-1997(09)00119-2

doi:10.1016/j.jvoice.2009.08.002

Journal of Voice
Volume 25, Issue 1 , Pages 38-43, January 2011