« Previous
Next »
Journal of Voice
Volume 25, Issue 1
, Pages 38-43
, January 2011
Discrimination Between Pathological and Normal Voices Using GMM-SVM Approach
References
- . A comparative study of acoustic voice measurements by means of Dr. Speech and Computerized Speech Lab. J Voice. 2005;19:187–196
- . Objective measures of voice production in normal subjects following prolonged voice use. J Voice. 1995;9:127–133
- . Quantitative spectral evaluation of shimmer and jitter. J Speech Lang Hear Res. 1985;28:169–174
- . A cepstrum-based technique for determining a harmonic-to-noise ratio in speech signals. J Speech Hear Res. 1993;36:254–266
- . Method for evaluation of the noise-to-harmonic-component ratios in pathological and normal voices. Acustica. 1990;70:89–91
- . Harmonics-to-noise ratio as an index of the degree of hoarseness. J Acoust Soc Am. 1982;71:1544
- . Acoustic correlates of vocal quality. J Speech Lang Hear Res. 1990;33:298–306
- . An evaluation of residue features as correlates of voice disorders. J Commun Disord. 1987;20:105–117
- . Temporal and spectral estimations of harmonics-to-noise ratio in human voice signals. J Acoust Soc Am. 1997;102:537
- Cepstral peak prominence: a more reliable measure of dysphonia. Ann Otol Rhinol Laryngol. 2003;112:324
- . Discrimination of pathological voices using a time-frequency approach. IEEE Trans Biomed Eng. 2005;52:421–430
- . Automatic detection of voice impairments by means of short-term cepstral parameters and neural network based detectors. IEEE Trans Biomed Eng. 2004;51:380–384
- . Pathological voice quality assessment using artificial neural networks. Med Eng Phys. 2002;24:561–564
- Chen W, Peng C, Zhu X, Wan B, Wei D. SVM-based identification of pathological voices. In: Engineering in Medicine and Biology Society. EMBS 2007. 29th Annual International Conference of the IEEE. August 22–26, 2007:3786–3789.
- . Robust text-independent speaker identification using Gaussian mixture speaker models. IEEE Trans Speech Audio Process. 1995;3:72–83
- . Support vector machines using GMM supervectors for speaker verification. IEEE Signal Process Lett. 2006;13:308–311
- Matejka R, Burget L, Schwarz P, et al. STBU system for the NIST 2006 speaker recognition evaluation. In: IEEE International Conference on Acoustics, Speech and Signal Processing, 2007. ICASSP 2007; 2007.
- . Dimensionality reduction of a pathological voice quality assessment system based on Gaussian mixture models and short-term cepstral parameters. IEEE Trans Biomed Eng. 2006;53:1943–1953
- Kay Elemetrics, Corp. Disordered Voice Database, 1.03 ed. Lincoln Park, NJ, 1994.
- . Analog-to-digital converters and their applications in radio receivers. IEEE Commun Mag. 1995;33:39–45
- L. Rabiner, Juang B-H. Fundamentals of Speech Recognition. Upper Saddle River, NJ: Prentice-Hall, Inc., 1993.
- . Speaker verification using adapted Gaussian mixture models. Digit Signal Process. 2000;10:19–41
- . Support vector machines for classification and regression. ISIS Tech Rep. 1998;14:
- . SVMLight: Support Vector Machine. University of Dortmund; 1999;http://svmlight.joachims.org
- . From regularization operators to support vector kernels. In: Advances in Neural Information Process Systems 10. Cambridge, MA: MIT press; 1998;p. 343–349
- . A Gentle Tutorial on the EM Algorithm and Its Application to Parameter Estimation for Gaussian Mixture and Hidden Markov Models. Vol. 4. Berkeley, CA: International Computer Science Institute; 1998;
- . A hybrid algorithm for speaker adaptation using MAP transformation and adaptation. IEEE Signal Process Lett. 1997;4:167–169
- . Information Theory and Statistics. New York, NY: Wiley; 1959;
- . Fast approximation of Kullback-Leibler distance for dependence trees and hidden Markov models. IEEE Signal Process Lett. 2003;10:115–118
- Wan V, Campbell W. Support vector machines for speaker verification and identification. In: Neural Networks Signal Processing Proceedings of IEEE. December 11–13, 2000;2:775–784.
- . Receiver operating characteristic (ROC) methodology: the state of the art. Crit Rev Diagn Imaging. 1989;29:307–355
- Martin A, Doddington G, Kamm T., Ordowski M., Przybocki M. The DET curve in assessment of detection task performance. In: Fifth European Conference on Speech Communication and Technology. April 15–20, 1997.
- Cheng JM, Wang HC. A method of estimating the equal error rate for automatic speaker verification. In: Chinese Spoken Language Processing, 2004.
- . A new kernel for SVM MLLR based speaker recognition. Proc Interspeech. 2007;
- . Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models. Comput Speech Lang. 1995;9:171
PII: S0892-1997(09)00119-2
doi: 10.1016/j.jvoice.2009.08.002
© 2011 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
« Previous
Next »
Journal of Voice
Volume 25, Issue 1
, Pages 38-43
, January 2011
