Objective Indices of Perceived Vocal Strain



      A limited number of experiments have investigated the perception of strain compared to the voice qualities of breathiness and roughness despite its widespread occurrence in patients who have hyperfunctional voice disorders, adductor spasmodic dysphonia, and vocal fold paralysis among others.


      The purpose of this study is to determine the perceptual basis of strain through identification and exploration of acoustic and psychoacoustic measures.


      Twelve listeners evaluated the degree of strain for 28 dysphonic phonation samples on a five-point rating scale task. Computational estimates based on cepstrum, sharpness, and spectral moments (linear and transformed with auditory processing front-end) were correlated to the perceptual ratings.


      Perceived strain was strongly correlated with cepstral peak prominence, sharpness, and a subset of the spectral metrics. Spectral energy distribution measures from the output of an auditory processing front-end (ie, excitation pattern and specific loudness pattern) accounted for 77–79% of the model variance for strained voices in combination with the cepstral measure.


      Modeling the perception of strain using an auditory front-end prior to acoustic analysis provides better characterization of the perceptual ratings of strain, similar to our prior work on breathiness and roughness. Results also provide evidence that the sharpness model of Fastl and Zwicker (2007) is one of the strong predictors of strain perception.

      Key Words

      To read this article in full you will need to make a payment

      Purchase one-time access:

      Academic & Personal: 24 hour online accessCorporate R&D Professionals: 24 hour online access
      One-time access price info
      • For academic or personal research use, select 'Academic and Personal'
      • For corporate R&D use, select 'Corporate R&D Professionals'


      Subscribe to Journal of Voice
      Already a print subscriber? Claim online access
      Already an online subscriber? Sign in
      Institutional Access: Sign in to ScienceDirect


        • Hirano M.
        Psycho-acoustic evaluation of voice: GRBAS scale for evaluating the hoarse voice.
        Clinican Examination of Voice. Springer-Verlag, New York1981
        • Kempster G.B.
        • Gerratt B.R.
        • Abbott K.V.
        • et al.
        Consensus auditory-perceptual evaluation of voice: development of a standardized clinical protocol.
        Am J Speech-Lang Pathol. 2009; 18: 124-132
        • Lowell S.Y.
        • Kelley R.T.
        • Awan S.N.
        • et al.
        Spectral-and cepstral-based acoustic features of dysphonic, strained voice quality.
        Ann Otol Rhinol Laryngol. 2012; 121: 539-548
        • Netsell R.
        • Lotz W.
        • Shaughnessy A.L.
        Laryngeal aerodynamics associated with selected voice disorders.
        Am J Otolaryngol. 1984; 5: 397-403
        • Sundberg J.
        • Gauffin J.
        Waveform and spectrum of the glottal voice source.
        Speech Music Hearing Q Progress Status Report. 1978; 19: 35-50
        • Bergan C.C.
        • Titze I.R.
        • Story B.
        The perception of two vocal qualities in a synthesized vocal utterance: ring and pressed voice.
        J Voice. 2004; 18: 305-317
        • Moisik S.R.
        • Esling J.H.
        Modeling the biomechanical influence of epilaryngeal stricture on the vocal folds: A low-dimensional model of vocal–ventricular fold coupling.
        J Speech Lang Hear Res. 2014; 57: S687-S704
        • Titze I.R.
        Nonlinear source–filter coupling in phonation: theory.
        J Acoust Soc Am. 2008; 123: 1902-1915
        • Hillenbrand J.
        • Houde R.A.
        Acoustic correlates of breathy vocal quality: dysphonic voices and continuous speech.
        J Speech Lang Hear Res. 1996; 39: 311-321
        • Latoszek B.B.v.
        • Maryn Y.
        • Gerrits E.
        • et al.
        The Acoustic Breathiness Index (ABI): a multivariate acoustic model for breathiness.
        J Voice. 2017; 31 (511. e511-511. e527)
        • Samlan R.A.
        • Story B.H.
        • Bunton K.
        Relation of perceived breathiness to laryngeal kinematics and acoustic measures based on computational modeling.
        J Speech Lang Hear Res. 2013; 56: 1209-1223
        • Eddins D.A.
        • Anand S.
        • Camacho A.
        • et al.
        Modeling of breathy voice quality using pitch-strength estimates.
        J Voice. 2016; 30 (774. e771-774. e777)
        • Shrivastav R.
        The use of an auditory model in predicting perceptual ratings of breathy voice quality.
        J Voice. 2003; 17: 502-512
        • Shrivastav R.
        • Camacho A.
        • Patel S.
        • et al.
        A model for the prediction of breathiness in vowels.
        J Acoust Soc Am. 2011; 129: 1605-1615
        • Shrivastav R.
        • Sapienza C.M.
        Objective measures of breathy voice quality obtained using an auditory model.
        J Acoust Soc Am. 2003; 114: 2217-2224
        • Eddins D.A.
        • Kopf L.M.
        • Shrivastav R.
        The psychophysics of roughness applied to dysphonic voice.
        J Acoust Soc Am. 2015; 138: 3820-3825
        • Eddins D.A.
        • Shrivastav R.
        Psychometric properties associated with perceived vocal roughness using a matching task.
        J Acoust Soc Am. 2013; 134: EL294-EL300
        • Latoszek B.B.v.
        • De Bodt M.
        • Gerrits E.
        • et al.
        The exploration of an objective model for roughness with several acoustic markers.
        J Voice. 2018; 32: 149-161
        • Latoszek B.B.v.
        • Maryn Y.
        • Gerrits E.
        • et al.
        A meta-analysis: acoustic measurement of roughness and breathiness.
        J Speech Lang Hear Res. 2018; 61: 298-323
        • Wolfe V.
        • Martin D.
        Acoustic correlates of dysphonia: type and severity.
        J Commun Disord. 1997; 30: 403-416
        • Zwirner P.
        • Murry T.
        • Woodson G.E.
        Perceptual-acoustic relationships in spasmodic dysphonia.
        J Voice. 1993; 7: 165-171
        • Dejonckere P.H.
        • Remacle M.
        • Fresnel-Elbaz E.
        • et al.
        Differentiated perceptual evaluation of pathological voice quality: reliability and correlations with acoustic measurements.
        Rev Laryngologie-otologie-rhinol. 1996; 117: 219-224
        • Karlsson I.
        Modelling voice variations in female speech synthesis.
        Speech Commun. 1992; 11: 491-495
        • Bhuta T.
        • Patrick L.
        • Garnett J.D.
        Perceptual evaluation of voice quality and its correlation with acoustic measurements.
        J Voice. 2004; 18: 299-304
        • Awan S.N.
        • Roy N.
        • Jetté M.E.
        • et al.
        Quantifying dysphonia severity using a spectral/cepstral-based acoustic index: comparisons with auditory-perceptual judgements from the CAPE-V.
        Clin Linguist Phon. 2010; 24: 742-758
        • Deal R.E.
        • Emanuel F.W.
        Some waveform and spectral features of vowel roughness.
        J Speech Lang Hear Res. 1978; 21: 250-264
        • Hirano M.
        • Hibi S.
        • Yoshida T.
        • et al.
        Acoustic analysis of pathological voice: some results of clinical application.
        Acta oto-laryngol. 1988; 105: 432-438
        • Prosek R.A.
        • Montgomery A.A.
        • Walden B.E.
        • et al.
        An evaluation of residue features as correlates of voice disorders.
        J Commun Disorders. 1987; 20: 105-117
        • Titze I.R.
        Workshop on acoustic voice analysis: summary statement.
        Natl Center Voice Speech. 1994;
        • Shrivastav R.
        • Camacho A.
        A computational model to predict changes in breathiness resulting from variations in aspiration noise level.
        J Voice. 2010; 24: 395-405
        • Moore B.C.
        • Glasberg B.R.
        • Baer T.
        A model for the prediction of thresholds, loudness, and partial loudness.
        J Audio Eng Soc. 1997; 45: 224-240
        • Zwicker E.
        • Fastl H.
        Psychoacoustics: Facts and Models.
        Springer-Verlag, New York1990
        • Fastl H.
        • Zwicker E.
        Psychoacoustics: Facts and Models.
        3rd ed. Springer, Berlin2007
        • Zwicker E.
        • Fastl H.
        Pitch and pitch strength.
        Psychoacoustics: Facts and Models. Springer-Verlag, New York1990: 102-132
        • von Bismarck G.
        Sharpness as an attribute of the timbre of steady sounds.
        Acta Acust United Acust. 1974; 30: 159-172
        • Shrivastav R.
        • Eddins D.A.
        • Anand S.
        Pitch strength of normal and dysphonic voices.
        J Acoust Soc Am. 2012; 131: 2261-2269
        • Camacho A.
        On the use of auditory models' elements to enhance a sawtooth waveform inspired pitch estimator on telephone-quality signals.
        in: Paper presented at the 2012 11th International Conference on Information Science, Signal Processing and their Applications (ISSPA); 2012. 2012
        • Patel S.
        • Shrivastav R.
        • Eddins D.A.
        Identifying a comparison for matching rough voice quality.
        J Speech Lang Hear Res. 2012; 55: 1407-1422
        • Shrivastav R.
        • Sapienza C.M.
        • Nandur V.
        Application of psychometric theory to the measurement of voice quality using rating scales.
        J Speech Lang Hearing Res. 2005; 48: 323-335
        • Hillenbrand J.
        • Cleveland R.A.
        • Erickson R.L.
        Acoustic correlates of breathy vocal quality.
        J Speech Lang Hear Res. 1994; 37: 769-778