Summary
Objectives
Cepstral analysis does not require the detection of pitch within waveforms, which
makes it suitable for acoustic evaluation of connected speech contexts and severely
disordered voice. Although the utility of cepstral measurements, including cepstral
peak prominence (CPP) and cepstral spectral index of dysphonia (CSID), has been reported
for several languages, it has yet to be demonstrated in the Japanese language. The
current study aimed to investigate the utility of cepstral acoustic analysis for the
Japanese language as an indicator of dysphonia and the degree of dysphonia severity.
Methods
Ninety-five patients with dysphonia and thirty volunteers without voice complaint
uttered the sustained vowel /a/ and read four Japanese sentences designed to elicit
different laryngeal behaviors. The recorded voice samples were evaluated perceptually
by three raters according to the GRBAS scale (grade) and overall severity (OS) on
a visual analog scale. Participants were then divided into four groups based on grade
and OS: non-, mildly, moderately, and severely dysphonic groups. For the acoustic
analysis, CPP and CSID were computed using the Analysis of Dysphonia in Speech and
Voice, while jitter percentage (Jitt), shimmer percentage (Shim), and noise to harmonic
ratio were computed using the Multi-Dimensional Voice Program.
Results
Statistical analysis revealed that both CPP and CSID differed significantly between
all groups, except for grade between the non-dysphonic and mildly dysphonic groups.
Pearson correlation analysis between the acoustic measurements and the perceptual
ratings revealed that the absolute correlation coefficients for CPP, CSID, and Jitt
were greater than 0.7. Specifically, those for CPP and CSID were greater than 0.8
for OS. Receiver operating characteristic curve analysis showed that the AUC for CPP,
CSID, Jitt, and Shim was greater than 0.8 for both grade and OS. The cut-off values
for CPP and CSID, as determined by the Youden Index, were 6.74–7.18 and 12.16–20.39,
respectively.
Conclusion
The current study demonstrated the validity of CPP and CSID as indicators of dysphonia
and indices of dysphonia severity in the Japanese language.
Key Words
To read this article in full you will need to make a payment
Purchase one-time access:
Academic & Personal: 24 hour online accessCorporate R&D Professionals: 24 hour online accessOne-time access price info
- For academic or personal research use, select 'Academic and Personal'
- For corporate R&D use, select 'Corporate R&D Professionals'
Subscribe:
Subscribe to Journal of VoiceAlready a print subscriber? Claim online access
Already an online subscriber? Sign in
Register: Create an account
Institutional Access: Sign in to ScienceDirect
References
- The usefulness of auditory perceptual assessment and acoustic analysis as a screening test for voice problems.Phoniatr Logop. 2019; : 1-8https://doi.org/10.1159/000504220
- Validation of the Cepstral Spectral Index of Dysphonia (CSID) as a screening tool for voice disorders: development of clinical cutoff scores.J Voice. 2016; 30: 130-144https://doi.org/10.1016/j.jvoice.2015.04.009
- Validation of the measures of cepstral peak prominence as a measure of dysphonia severity in Spanish-speaking subjects.Acta Otorrinolaringol Esp. 2019; 70: 222-228https://doi.org/10.1016/j.otorri.2018.04.008
- A meta-analysis: acoustic measurement of roughness and breathiness.J Speech Lang Hear Res. 2018; 61: 298-323https://doi.org/10.1044/2017_JSLHR-S-16-0188
- Vowel- and text-based cepstral analysis of chronic hoarseness.J Voice. 2012; 26: 416-424https://doi.org/10.1016/j.jvoice.2011.05.001
- Acoustic Analysis of Voice, in Laryngeal Function & Voice Disoeders.Thieme Medical Publishers, New York2019
- Cepstral peak prominence: a more reliable measure of dysphonia.Ann Otol Rhinol Laryngol. 2003; 112: 324-333https://doi.org/10.1177/000348940311200406
- Estimating dysphonia severity in continuous speech: application of a multi-parameter spectral/cepstral model.Clin Linguist Phon. 2009; 23: 825-841https://doi.org/10.3109/02699200903242988
- Quantifying dysphonia severity using a spectral/cepstral-based acoustic index: comparisons with auditory-perceptual judgements from the CAPE-V.Clin Linguist Phon. 2010; 24: 742-758https://doi.org/10.3109/02699206.2010.492446
- The relationship between cepstral peak prominence and selected parameters of dysphonia.J Voice. 2002; 16: 20-27https://doi.org/10.1016/s0892-1997(02)00067-x
- Quantifying the cepstral peak prominence, a measure of dysphonia.J Voice. 2014; 28: 783-788https://doi.org/10.1016/j.jvoice.2014.05.005
- The acoustic voice quality index: toward improved treatment outcomes assessment in voice disorders.J Commun Disord. 2010; 43: 161-174https://doi.org/10.1016/j.jcomdis.2009.12.004
- Toward validation of the cepstral spectral index of dysphonia (CSID) as an objective treatment outcomes measure.J Voice. 2013; 27: 401-410https://doi.org/10.1016/j.jvoice.2013.04.002
- Spectral-cepstral estimation of dysphonia severity: external validation.Ann Otol Rhinol Laryngol. 2013; 122: 40-48https://doi.org/10.1177/000348941312200108
- Validation of the acoustic voice quality index in the Japanese language.J Voice. 2017; 31https://doi.org/10.1016/j.jvoice.2016.05.010
- Reliability of clinician-based (GRBAS and CAPE-V) and patient-based (V-RQOL and IPVI) documentation of voice disorders.J Voice. 2007; 21: 576-590https://doi.org/10.1016/j.jvoice.2006.05.001
- Clinical Examination of Voice.Springer, New York1981
Article info
Publication history
Published online: September 17, 2020
Footnotes
The authors report no conflicts of interest.
This study was supported by a research grant from the Japan Society of Logopedics and Phoniatrics.
Identification
Copyright
© 2020 The Voice Foundation. Published by Elsevier Inc. All rights reserved.