Summary
Objectives
This study aimed to estimate the intertext variability of smoothed cepstral peak prominence
(CPPS), examine whether sound-processing techniques improved its variability and diagnostic
capability, and evaluate the degree of intertext variability in detail with reference
to the CPPS variabilities in sustained vowels.
Study design
This was a retrospective study.
Methods
Text readings of 58 Japanese syllables were recorded from 210 speakers with different
diagnoses and varying degrees of dysphonia, and were divided into six passages. Applying
the sound-processing techniques to those passages, we prepared three sample types:
(1) nonprocessed, (2) only-loud, and (3) only-voiced samples. The intertext CPPS variability
and diagnostic properties were compared across the passages and sample types. For
detailed analysis, we subsequently extracted 63 normophonic speakers who maintained
constant quality in their vowel utterances to evaluate the degree of intertext CPPS
variability in relation to the variabilities between repeated identical vowels and
across different vowels.
Results
Although several combinations of passages showed moderate-to-large CPPS variabilities,
those variabilities were decreased by either technique, especially the deletion of
silent segments, which resulted in the best diagnostic accuracy. The degree of intertext
CPPS variability for the only-voiced samples was comparable to that of the CPPS variabilities
in sustained vowels.
Conclusions
The sound-processing technique removing silent segments should be applied to enhance
the diagnostic properties of CPPS. The additional technique of deleting unvoiced segments
is worth adopting if clinicians and researchers seek to attenuate the influence of
text differences in calculating CPPS values.
Key Words
To read this article in full you will need to make a payment
Purchase one-time access:
Academic & Personal: 24 hour online accessCorporate R&D Professionals: 24 hour online accessOne-time access price info
- For academic or personal research use, select 'Academic and Personal'
- For corporate R&D use, select 'Corporate R&D Professionals'
Subscribe:
Subscribe to Journal of VoiceAlready a print subscriber? Claim online access
Already an online subscriber? Sign in
Register: Create an account
Institutional Access: Sign in to ScienceDirect
References
- A basic protocol for functional assessment of voice pathology, especially for investigating the efficacy of (phonosurgical) treatments and evaluating new assessment techniques. Guideline elaborated by the Committee on Phoniatrics of the European Laryngological Society (ELS).Eur Arch Otorhinolaryngol. 2001; 258: 77-82
- Measuring voice outcomes: state of the science review.J Laryngol Otol. 2009; 123: 823-829
- Perceptual assessment of voice quality: past, present, and future.Persp Voice Voice Disord. 2010; 20: 62-67
- American National Standard Bioacoustical Terminology.Acoustical Society of America, New York, NY2015 (ANSI S3.20-2015)
- Collinearity and sample coverage issues in the objective measurement of vocal quality: the case of roughness and breathiness.J Speech Lang Hear Res. 2018; 61: 1-24
- Perceptual evaluation of voice quality: review, tutorial, and a framework for future research.J Speech Lang Hear Res. 1993; 36: 21-40
- Differential diagnosis of hoarseness.Folia Phoniatr. 1969; 21: 9-19
- Psycho-acoustic evaluation of voice.in: Arnold GE Winckel F Wyke BD Clinical Examination of Voice. Disorders of Human Communication 5. Springer-Verlag, Vienna, Austria1981: 81-84
- Consensus auditory-perceptual evaluation of voice: development of a standardized clinical protocol.Am J Speech Lang Pathol. 2009; 18: 124-132
- Some acoustic measures of fundamental periodicity of normal and pathologic larynges.J Acoust Soc Am. 1963; 35: 344-353
- Vowel amplitude modulations in patients with laryngeal diseases.J Acoust Soc Am. 1969; 45: 839-844
- Application of some acoustic measures for the evaluation of laryngeal dysfunction.Stud Phonol. 1973; 7: 17-23
- Vocal shimmer in sustained phonation of normal and pathologic voice.Ann Otol Rhinol Laryngol. 1976; 85: 377-381
- Harmonics-to-noise ratio as an index of the degree of hoarseness.J Acoust Soc Am. 1982; 71: 1544-1549
- Normalized noise energy as an acoustic measure to evaluate pathologic voice.J Acoust Soc Am. 1986; 80: 1329-1334
- Long-term average speech spectra for normal and hearing-impaired adolescents.J Acoust Soc Am. 1982; 71: 196-202
- Analysis, synthesis, and perception of voice quality variations among female and male talkers.J Acoust Soc Am. 1990; 87: 820-857
- Acoustic correlates of breathy vocal quality.J Speech Hear Res. 1994; 37: 769-778
- Acoustic correlates of breathy vocal quality: dysphonic voices and continuous speech.J Speech Hear Res. 1996; 39: 311-321
- Listener experience and perception of voice quality.J Speech Hear Res. 1990; 33: 103-115
- Individual differences in voice quality perception.J Speech Hear Res. 1992; 35: 512-520
- Perceptual evaluation.J Voice. 1992; 6: 155-158
- When and why listeners disagree in voice quality assessment tasks.J Acoust Soc Am. 2007; 122: 2354-2364
- Application of psychometric theory to the measurement of voice quality using rating scales.J Speech Lang Hear Res. 2005; 48: 323-335
- The effect of experience on classification of voice quality.J Voice. 2012; 26: 299-303
- Speech tasks and interrater reliability in perceptual voice evaluation.J Voice. 2014; 28: 725-732
- A comparison of voice quality ratings made by Japanese and American listeners using the GRBAS scale.Folia Phoniatr Logop. 2003; 55: 147-157
- Test-retest study of the GRBAS scale: influence of experience and professional background on perceptual rating of voice quality.J Voice. 1997; 11: 74-80
- Acoustic analyses of sustained and running voices from patients with laryngeal pathologies.J Voice. 2008; 22: 1-9
- Acoustic discrimination of pathological voice: sustained vowels versus continuous speech.J Speech Hear Res. 2001; 44: 327-339
- A pitch-synchronous analysis of hoarseness in running speech.J Acoust Soc Am. 1988; 84: 1292-1301
- Consistency and reliability of voice quality ratings for different types of speech fragments.J Speech Hear Res. 1994; 37: 985-1000
- Workshop on Acoustic Voice Analysis: Summary Statement.National Center for Voice and Speech, Denver, CO1995
- Acoustic measurement of overall voice quality: a meta-analysis.J Acoust Soc Am. 2009; 126: 2619-2634
- A meta-analysis: acoustic measurement of roughness and breathiness.J Speech Hear Res. 2018; 61: 1-26
- Short-time spectrum and ''cepstrum'' techniques for vocal-pitch detection.J Acoust Soc Am. 1964; 36: 296-302
- Cepstrum pitch determination.J Acoust Soc Am. 1967; 41: 293-309
- Cepstrum analysis of pathologic voices.J Phonetics. 1986; 14: 501-507
- The relationship between cepstral peak prominence and selected parameters of dysphonia.J Voice. 2002; 16: 20-27
- Cepstral peak prominence: a more reliable measure of dysphonia.Ann Otol Rhinol Laryngol. 2003; 112: 324-333
- Acoustic and perceptual parameters relating to connected speech are more reliable measures of hoarseness than parameters relating to sustained vowels.ORL J Otorhinolaryngol Relat Spec. 2004; 66: 70-73
- Spectral- and cepstral-based measures during continuous speech: capacity to distinguish dysphonia and consistency within a speaker.J Voice. 2011; 25: e223-e232
- Vowel- and text-based cepstral analysis of chronic hoarseness.J Voice. 2012; 26: 416-424
- Predictive value and discriminant capacity of cepstral- and spectral-based measures during continuous speech.J Voice. 2013; 27: 393-400
- Use of cepstral analyses for differentiating normal from dysphonic voices: a comparative study of connected speech versus sustained vowel in European Portuguese female speakers.J Voice. 2014; 28: 282-286
- A cepstral analysis of normal and pathologic voice qualities in Iranian adults: a comparative study.J Voice. 2017; 31 (508.e17–508.e23)
- Perceptual structure of adductor spasmodic dysphonia and its acoustic correlates.J Voice. 2012; 26 (818.e5–818.e13)
- Cepstral analysis of hypokinetic and ataxic voices: correlations with perceptual and other acoustic measures.J Voice. 2014; 28: 673-680
- Praat, a system for doing phonetics by computer.Glot Int. 2001; 5: 341-345
- Intrasubject variability of objective voice measures.J Voice. 1996; 10: 166-174
- Consistency of fundamental frequency and perturbation in repeated phonations of sustained vowels, reading, and connected speech.J Speech Hear Disord. 1990; 55: 360-363
- Measures of vocal function during changes in vocal effort level.J Voice. 1995; 9: 429-438
- Consistency of acoustic and aerodynamic measures of voice production over 28 days under various testing conditions.J Voice. 1999; 13: 477-483
- Jitter and shimmer differences among sustained vowel phonations.J Speech Hear Res. 1982; 25: 12-14
- Vocal stability and vocal tract configuration: an acoustic and electroglottographic investigation.J Voice. 1995; 9: 173-181
- Reliability of objective voice measures of normal speaking voices.J Voice. 2013; 27: 170-176
- Consistency of voice frequency and perturbation measures in children using cepstral analyses: a movement toward increased recording stability.JAMA Otolaryngol Head Neck Surg. 2013; 139: 811-816
- Effects of vocal intensity and vowel type on cepstral analysis of voice.J Voice. 2012; 26 (670.e15–670.e20)
- Toward improved ecological validity in the acoustic measurement of overall voice quality: combining continuous speech and sustained vowels.J Voice. 2010; 24: 540-555
- External validation of the acoustic voice quality index version 03.01 with extended representativity.Ann Otol Rhinol Laryngol. 2016; 125: 571-583
- Validation of the acoustic voice quality index in the Japanese language.J Voice. 2017; 31 (260.e1–260.e9)
- The acoustic voice quality index version 03.01 for the Japanese-speaking population.J Voice. 2018; (In press)
- The influence of gender and age on the acoustic voice quality index and dysphonia severity index: a normative study.J Voice. 2018; (In press)
- Examination—collecting information. Clinical Manual of Voice Disorders for Speech Therapists.in: Hirose H. ST no tame no onsei shogai shinryo manual. Interuna Publishers Inc., Tokyo, Japan2008: 13-32
- Acoustic analysis using recorded samples of sustained vowels and text readings: utility of smoothed cepstral peak prominence.in: Proceedings of the 117th annual Meeting of the ORL Society of Japan. 2016: 587
- Ordinal Methods for Behavioral Data Analysis.Psychology Press Ltd, London, UK1996
- Appropriate statistics for ordinal level data: should we really be using t-test and Cohen's d for evaluating group differences on the NSSE and other surveys?.in: Annual Meeting of the Florida Association of Institutional Research. 2006
- Investigating Communication: An Introduction to Research Methods.Prentice Hall, Englewood Cliffs, NJ1991
- Probability, Statistics and Estimation: Propagation of Uncertainties in Experimental Measurement.Creative Commons, Mountain View, CA2017
- The acoustic voice quality index. Toward expanded measurement of dysphonia severity in German subjects.HNO. 2012; 60: 715-720
- Objective assessment of pediatric voice disorders with the acoustic voice quality index.J Voice. 2012; 26 (672.e1–672.e7)
- The value of the acoustic voice quality index as a measure of dysphonia severity in subjects speaking different languages.Eur Arch Otorhinolaryngol. 2014; 271: 1609-1619
- A preliminary study of the acoustic voice quality index in Finnish speaking population.in: 11th Pan European Voice Conference, Florence, Italy2015 (August 31–September 4)
- Auditory-perceptual and acoustic methods in measuring dysphonia severity of Korean speech.J Voice. 2016; 30: 587-594
- Validation of the acoustic voice quality index in the Korean language.J Voice. 2018; (In press)
- Validation of the acoustic voice quality index in the Lithuanian language.J Voice. 2017; 31 (257.e1–257.e11)
- Validation of the acoustic voice quality index version 03.01 and the acoustic breathiness index in the Spanish language.Ann Otol Rhinol Laryngol. 2018; (In press)
- Effective glottal instant detection and electroglottographic parameter extraction for automated voice pathology assessment.IEEE J Biomed Health Inform. 2018; 22: 398-408
- Frequency and amplitude perturbation analysis of electroglottograph during sustained phonation.J Acoust Soc Am. 1986; 80: 58-62
- Glottographic signal perturbation in biomechanically different types of dysphonia.Laryngoscope. 1998; 108: 18-25
- Glottographic measures before and after levodopa treatment in Parkinson's disease.Laryngoscope. 1999; 109: 1287-1294
- Effectiveness of the computed analysis of electroglottographic signals in muscle tension dysphonia.Folia Phoniatr Logop. 2012; 64: 145-150
- Statistical analysis of the reliability of acoustic and electroglottographic perturbation parameters for the detection of vocal roughness.J Voice. 2014; 28 (263.e9–263.e16)
- Immediate effects of humming on computed electroglottographic parameters in patients with muscle tension dysphonia.J Voice. 2014; 28: 733-741
- Investigation of the immediate effects of humming on vocal fold vibration irregularity using electroglottography and high-speed laryngoscopy in patients with organic voice disorders.J Voice. 2017; 31: 48-56
Article info
Publication history
Published online: October 30, 2018
Accepted:
September 25,
2018
Identification
Copyright
© 2018 The Voice Foundation. Published by Elsevier Inc. All rights reserved.