Journal of Voice
Volume 24, Issue 6 , Pages 667-677 , November 2010

Pathological Likelihood Index as a Measurement of the Degree of Voice Normality and Perceived Hoarseness

  • Juan Ignacio Godino-Llorente

      Affiliations

    • Universidad Politécnica de Madrid, Circuits & Systems Engineering Department, Ctra. de Valencia, Madrid, Spain
    • Corresponding Author InformationAddress correspondence and reprint requests to Juan Ignacio Godino-Llorente, Universidad Politécnica de Madrid, Ctra. de Valencia, km. 7, 28031, Madrid, Spain.
  • ,
  • Pedro Gómez-Vilda

      Affiliations

    • Universidad Politécnica de Madrid, Circuits & Systems Engineering Department, Ctra. de Valencia, Madrid, Spain
  • ,
  • Fernando Cruz-Roldán

      Affiliations

    • Universidad de Alcalá, Escuela Politécnica, Signal Theory and Communications Department, Ctra. de Madrid-Barcelona, Alcalá de Henares, Madrid, Spain
  • ,
  • Manuel Blanco-Velasco

      Affiliations

    • Universidad de Alcalá, Escuela Politécnica, Signal Theory and Communications Department, Ctra. de Madrid-Barcelona, Alcalá de Henares, Madrid, Spain
  • ,
  • Rubén Fraile

      Affiliations

    • Universidad Politécnica de Madrid, Circuits & Systems Engineering Department, Ctra. de Valencia, Madrid, Spain

,Accepted 20 April 2009.

References 

  1. Smits I, Ceuppens P, De Bodt M. A comparative study of acoustic voice measurements by means of Dr. Speech and Computerized Speech Lab. J Voice. 2005;19:187–196
  2. Baken RJ, Orlikoff R. Clinical Measurement of Speech and Voice. 2nd ed. San Diego, CA: Singular Publishing Group; 2000;
  3. Feijoo S, Hernández-Espinosa C. Short-term stability measures for the evaluation of vocal quality. J Speech Hear Res. 1990;33:324–334
  4. Klingholtz F, Martin F. The measurement of the signal-to-noise ratio (SNR) in continuous speech. Speech Commun. 1987;6:15–26
  5. Yumoto E, Gould WJ, Baer T. Harmonics-to-noise ratio as an index of the degree of hoarseness. J Acoust Soc Am. 1982;71:1544–1550
  6. Qi Y, Weinberg B, Bi N, Hess W. Minimizing the effect of period determination on the computation of amplitude perturbation of voice. J Acoust Soc Am. 1995;97:2525–2532
  7. Qi Y, Hillman RE. Temporal and spectral estimations of harmonics-to-noise ratio in human voice signals. J Acoust Soc Am. 1997;102:537–543
  8. de Krom G. A cepstrum-based technique for determining a harmonics-to-noise ratio in speech signals. J Speech Hear Res. 1993;36:254–266
  9. Kasuya H, Ogawa S, Mashima K, Ebihara S. Normalized noise energy as an acoustic measure to evaluate pathologic voice. J Acoust Soc Am. 1986;80:1329–1334
  10. Deliyski D. Acoustic model and evaluation of pathological voice production. In: Proceedings of Eurospeech '93. Vol. 3:1993;p. 1969–1972Berlin, Germany
  11. Prosek RA, Montgomery AA, Walden BE, Hawkins DB. An evaluation of residue features as correlates of voice disorders. J Commun Disord. 1987;20:105–117
  12. Michaelis D, Gramss T, Strube HW. Glottal-to-noise excitation ratio—a new measure for describing pathological voices. Acustica/Acta Acustica. 1997;83:700–706
  13. Winholtz W. Vocal tremor analysis with the vocal demodulator. J Speech Hear Res. 1992;35:562–563
  14. Boyanov B, Hadjitodorov S. Acoustic analysis of pathological voices. A voice analysis system for the screening of laryngeal diseases. IEEE Eng Med Biol Mag. 1997;16:74–82
  15. Yumoto E, Sasaki Y, Okamura H. Harmonics-to-noise ratio and psychophysical measurement of the degree of hoarseness. J Speech Hear Res. 1984;27:2–6
  16. Hadjitodorov S, Boyanov B, Teston B. Laryngeal pathology detection by means of class-specific neural maps. IEEE Trans Inf Technol Biomed. 2000;4:68–73
  17. Parsa V, Jamieson DG. Identification of pathological voices using glottal noise measures. J Speech Lang Hear Res. 2000;43:469–485
  18. Hadjitodorov S, Mitev P. A computer system for acoustic analysis of pathological voices and laryngeal disease screening. Med Eng Phys. 2002;24:419–429
  19. Ritchings RT, McGillion MA, Moore CJ. Pathological voice quality assessment using artificial neural networks. Med Eng Phys. 2002;24:561–564
  20. Godino-Llorente JI, Gómez-Vilda P, Blanco-Velasco M. Dimensionality reduction of a pathological voice quality assessment system based on Gaussian mixture models and short-term cepstral parameters. IEEE Trans Biomed Eng. 2006;53:1943–1953
  21. Deller JR, Proakis JG, Hansen JHL. Discrete-Time Processing of Speech Signals. New York: Macmillan Series for Prentice Hall; 1993;
  22. Reynolds DA, Rose RC. Robust text-independent speaker identification using Gaussian mixture speaker models. IEEE Trans Speech Audio Processing. 1995;3:72–83
  23. Reynolds DA. Speaker identification using Gaussian mixture speaker models. Speech Commun. 1995;17:91–108
  24. Weiss NA. Introductory Statistics. 6th ed. Reading, MA: Addison Wesley; 2000;
  25. Manfredi C, D'Aniello M, Bruscaglioni P, Ismaelli A. A comparative analysis of fundamental frequency estimation methods with application to pathological voices. Med Eng Phys. 2000;22:135–147
  26. Martin AF, Doddington GR, Kamm T, Ordowski M, Przybocki MA. The DET curve in assessment of detection task performance. In: Proceedings of Eurospeech '97. Vol. IV:1997;p. 1895–1898Rhodes, Crete
  27. Hanley JA, McNeil BJ. A method of comparing the areas under receiver operating characteristics curves derived from the same cases. Radiology. 1983;148:839–843
  28. Hanley JA, McNeil BJ. The meaning and use of the area under a receiver operating characteristic (ROC) curve. Radiology. 1982;143:29–36
  29. Kay Elemetrics Corp . Disordered Voice Database. Version 1.03. Lincoln Park, NJ: Kay Elemetrics Corp; 1994;[CD-ROM]
  30. Hirano M. Psycho-Acoustic Evaluation of Voice. New York: Springer-Verlag; 1981;
  31. Dejonkere PH, Remacle M, Fresnel-Elbaz E, Woisard W, Crevier-Euchman L, Millet B. Differentiated perceptual evaluation of pathological voice quality: reliability and correlations with acoustic measurements. Rev Laringol Otol Rhinol. 1996;117:219–224
  32. Hirano M, Hibi S, Yoshida T, Hirade Y, Kasuya H, Kikuchi Y. Acoustic analysis of pathological voice. Acta Otolaringol. 1988;105(2):432–438
  33. Revis J, Giovanni A, Wuyts F. Comparison of different types of vowel fragments for the evaluation of voice quality. In: Proceedings of Voicedata'98. 1998;p. 80–85
  34. Bou-Ghazale SE, Hansen JHL. A comparative study of traditional and newly proposed features for recognition of speech under stress. IEEE Trans Speech Audio Processing. 2000;8:429–442
  35. Childers DG, Sung-Bae K. Detection of laryngeal function using speech and electroglottographic data. IEEE Trans Biomed Eng. 1992;39:19–25
  36. Oppenheim AV, Schafer RW, Buck JR. Discrete-Time Signal Processing. 2nd ed. New Jersey: Prentice Hall; 1999;
  37. Schalkoff RJ. Pattern Recognition: Statistical, Structural and Neural Approaches. New York: John Wiley & Sons; 1991;
  38. Moon TK. The expectation-maximization algorithm. IEEE Signal Processing Mag. 1996;13(6):47–60
  39. Tax DMJ, van Breukelen M, Duin RPW, Kittler J. Combining multiple classifiers by averaging of by multiplying. Pattern Recognit. 2000;33:1475–1485
  40. Kittler J, Hatef M, Duin RPW, Matas J. On combining classifiers. IEEE Trans Pattern Anal Mach Intell. 1998;20:226–239
  41. Bishop CM. Neural Networks for Pattern Recognition. 2nd ed. Oxford, UK: Oxford University Press; 1995;
  42. Michaelis D, Fröhlich M, Strube HW. Selection and combination of acoustic features for the description of pathologic voices. J Acoust Soc Am. 1998;103:1628–1639
  43. Fröhlich M, Michaelis D, Strube HW. Acoustic “breathiness measures” in the description of pathological voices,”. In: Proceedings of ICASSP'98. 1998;p. 937–940Seattle, WA, USA
  44. Kasuya H, Ogawa S, Kikuchi Y, Ebihara S. An acoustic analysis of pathological voice and its application to the evaluation of laryngeal pathology. Speech Commun. 1986;5:171–181

PII: S0892-1997(09)00056-3

doi: 10.1016/j.jvoice.2009.04.003

Journal of Voice
Volume 24, Issue 6 , Pages 667-677 , November 2010