The purpose of this paper is to introduce an iterative nonlinear weighted method based on the variation in spectral energy distribution present in a voice signal to differentiate between four voice types: type 1 voice signals are nearly periodic, type 2 voice signals have strong modulations and subharmonics, type 3 signals are chaotic, and type 4 signals are dominated by stochastic noise.
A total of 135 voice signal samples of the sustained vowel /a/ were obtained from the Disordered Voice Database and then individually categorized into the appropriate voice types based on the classification system described in Sprecher et al (2010). Voice samples were analyzed using the nonlinear methods of spectrum convergence ratio, rate of divergence, and nonlinear energy difference ratio (NEDR) to investigate classifier efficacy.
An iterative nonlinear weighted method based on the derivative of instantaneous frequency and Fourier transformations is applied to calculate spectral energy distributions. The distribution is then used to calculate the NEDR to classify voice signal types.
Statistical analysis revealed that NEDR effectively differentiated between all four voice types (P < 0.001). Subsequent multiclass receiver operating characteristic analysis demonstrated that NEDR (area under the curve [95% CI] = 0.99 [0.96–1.0]) possessed the greatest classification accuracy relative to spectrum convergence ratio and rate of divergence.
NEDR was shown to be an effective metric for objective differentiation between all four voice signal types. NEDR calculations occurred approximately instantaneously, constituting a substantial improvement over the tedious computational time required for calculation of previous nonlinear parameters. This metric could assist clinicians in the diagnosis of voice disorders and monitor the efficacy of treatment through observation of voice acoustical improvement over time.
To read this article in full you will need to make a payment
Purchase one-time access:Academic & Personal: 24 hour online accessCorporate R&D Professionals: 24 hour online access
One-time access price info
- For academic or personal research use, select 'Academic and Personal'
- For corporate R&D use, select 'Corporate R&D Professionals'
Subscribe:Subscribe to Journal of Voice
Already a print subscriber? Claim online access
Already an online subscriber? Sign in
Register: Create an account
Institutional Access: Sign in to ScienceDirect
- Exploiting nonlinear recurrence and fractal scaling properties for voice disorder detection.Biomed Eng Online. 2007; 6: 23
- Workshop on acoustic voice analysis: summary statement.in: National Center for Voice and Speech. 1995: 26-27 (Denver, CO; Available at:)
- Updating signal typing in voice: addition of type 4 signals.J Acoust Soc Am. 2010; 127: 3710-3716
- Measuring vocal quality with speech synthesis.J Acoust Soc Am. 2001; 110: 2560-2566
- Validity of rating scale measures of voice quality.J Acoust Soc Am. 1998; 104: 1598-1608
- What determines the differences in perceptual rating of dysphonia between experienced rater?.Folia Phoniatr Logop. 1998; 50: 305-310
- Pathological speech processing: state-of-the-art, current challenges, and future directions.(IEEE Conference Proceedings)2016: 6470-6474
- Nonlinear analyses of elicited modal, raised, and pressed rabbit phonation.J Voice. 2014; 28: 538-547
- Nonlinear dynamic-based analysis of severe dysphonia in patients with vocal fold scar and sulcus vocalis.J Voice. 2012; https://doi.org/10.1016/j.jvoice.2011.09.006
- Nonlinear dynamics of phonations in excised larynx experiments.J Acoust Soc Am. 2003; 114: 2198-2205
- Chaos in voice, from modeling to measurement.J Voice. 2005; 20: 2-17
- Bifurcations and chaos in newborn infant cries.Phys Lett. 1990; 145: 418-424
- Nonlinear dynamic analysis of disordered voice: the relationship between the correlation dimension (D2) and pre-/post-treatment change in perceived dysphonia severity.J Voice. 2010; 24: 285-293
- Suitability of acoustic perturbation measures in analysing periodic and nearly periodic voice signals.Folia Phoniatr Logop. 2005; 57: 38-47
- An objective parameter for quantifying the turbulent noise portion of voice signals.J Voice. 2016; 30: 664-669
- Using rate of divergence as an objective measure to differentiate between voice signal types based on the amount of disorder in the signal.J Voice. 2017; 31: 16-23
- Quantifying correlations in pitch- and amplitude contours of sustained phonation.Acta Acustica United Acustica. 2000; 86: 129-135
- The relationship between cepstral peak prominence and selected parameters of dysphonia.J Voice. 2002; 16: 20-27
- Predicting voice disorder status from smoothed measures of cepstral peak prominence using Praat and Analysis of Dysphonia in Speech and Voice (ADSV).J Voice. 2017; 31: 557-566
- Objective voice analysis for dysphonic patients: a multiparametric protocol including acoustic and aerodynamic measurements.J Voice. 2001; 15: 529-542
- Discrete-Time Signal Processing.2nd ed. Prentice Hall, Upper Saddle River, NJ1999
- Weighted optimization-based distributed Kalman filter for nonlinear target tracking in collaborative sensor networks.IEEE Trans Cybern. 2016; 99: 1-14
- Uncertainty-aware frequency estimation algorithm for passive wireless resonant SAW sensor measurement.Sens Actuators A Phys. 2016; 237: 136-146
- Nonlinear image processing and filtering: a unified approach based on vertically weighted regression.Int J Appl Math Comput Sci. 2008; 18: 49-61
- Suboptimal FIR filtering of nonlinear models in additive white Gaussian noise.IEEE Trans Signal Process. 2012; 60: 5519-5527
- On effect size.Psychol Methods. 2012; 17: 137-152
- Digital Processing of Speech Signals.Prentice Hall, Upper Saddle River, NJ1978
- Perturbation and nonlinear dynamic analyses of voices from patients with unilateral laryngeal paralysis.J Voice. 2005; 19: 519-528
- Nonlinear dynamic analysis of voices before and after surgical excision of vocal polyps.J Acoust Soc Am. 2004; 115: 2270-2277
- Chaotic component obscured by strong periodicity in voice production system.Phys Rev E Stat Nonlin Soft Matter Phys. 2008; 77 (061922)
- Nonlinear phenomena in contemporary vocal music.J Voice. 2004; 18: 1-12
- Nonlinear dynamical analysis of speech.J Acoust Soc Am. 1996; 100: 615-629
- Occurrence frequencies of acoustic patterns of vocal fry in American English speakers.J Voice. 2016; 30 (e11-759.e20): 759
- Loud voice during environmental noise exposure in patients with vocal nodules.Logoped Phoniatr Vocol. 2007; 32: 60-70
- Outcomes measurement in voice disorders: application of an acoustic index of dysphonia severity.J Speech Hear Res. 2009; 52: 482-499
- Vowel-related differences in laryngeal articulatory and phonatory function.J Speech Hear Res. 1998; 41: 712-724
- Materials of acoustic analysis: sustained vowel versus sentence.J Voice. 2012; 26: 563-565
- Vowel selection and its effects on perturbation and nonlinear dynamic measures.Folia Phoniatr Logop. 2011; 63: 88-97
- The effects of vowels on voice perturbation measures.J Voice. 2004; 18: 318-324
- Vocal stability and vocal tract configuration: an acoustic and electroglottographic investigation.J Voice. 1995; 9: 173-181
- Acoustic discrimination of pathological voice: sustained vowels versus continuous speech.J Speech Hear Res. 2001; 44: 327-339
- Acoustic analyses of sustained and running voices from patients with laryngeal pathologies.J Voice. 2008; 22: 1-9
- Quantifying dysphonia severity using a spectral/cepstral-based acoustic index: comparisons with auditory-perceptual judgements from the CAPE-V.Clin Linguist Phon. 2010; 24: 742-758
- A nonlinear dynamical systems analysis of fricative consonants.J Acoust Soc Am. 1995; 97: 2511-2524
Published online: May 18, 2018
Accepted: February 14, 2018
Conflict of interest: None.
© 2018 The Voice Foundation. Published by Elsevier Inc. All rights reserved.