Abstract
Objectives
Study design
Methods
Results
Conclusions
Key Words
1. INTRODUCTION
2. THEORY
2.1 Time-domain EGG analysis
- (1)The Ap–p metric potentially holds one answer to our question: we would expect it to decrease with the decreasing adduction that might result from increased tracheal pull at high LV.23Unfortunately, the strength of the EGG signal depends on many things, including skin conductance, tissue distribution, and especially the distance between the skin electrodes and the glottis. In singing, VLP relative to the EGG electrodes changes a great deal, and we do not wish to fixate a subject and her larynx while the LV is exercised during singing. For this reason, Ap–p will be treated sceptically in this study.
- (2)The Qc metric, too, reports on the degree of adduction; and fortunately, it is independent of Ap–p. Several ways of calculating Qc are found in the literature, giving quite diverse results.26,27We propose here a threshold-free definition, Qci, being simply the integral over time, of the EGG pulse normalized from zero to one in amplitude and zero to one in cycle time, in other words, the area under the normalized curve. The greater this area, the more contact there has been between the vocal folds, over the cycle. Because the vast majority of EGG pulses have only one peak, Qci is also highly correlated with conventional Qc metrics. In principle, the value of this integral can range from zero to one. In the present material, we have seen Qci values from 0.18 to 0.6 (Supplementary file A-13). The advantages of this metric are that it considers the relative amount of contacting over the entire cycle waveform, rather than the relative time of a threshold crossing or peak; it does not rely on the existence of identifiable peaks in the EGG derivative; and it is less sensitive to noise in the signal. Figure 2 shows how the integral Qci differentiates between three EGG pulse shapes that would have received the same Qc if calculated using a conventional threshold at 3/7 of Ap–p. In this study, we are looking for effects that may be subtle, which Qci may reveal.FIGURE 2Here, both time and amplitude are normalized to the EGG cycle. The integrated area Qci under the normalized EGG pulse is preferred over a threshold-based contact quotient QcT, because Qci accounts for the shape of the whole pulse. The three pulse shapes shown (gray-blue-orange) have the same QcT, but receive different Qci values that better represent the amount of VF contact.
- (3)If tracheal pull does affect the VF configuration, then it might have an effect also on the speed of contacting, ie, the dEGGmax metric. In order to make the dEGGmax metric independent of Ap–p, we normalize it such that dEGGmaxN for a pure sine wave (no VF collision) receives the value one (representing the maximum derivative, or the peak of the cosine). Any VF contacting will give values greater than one, due to the contacting flank being steeper than that of the sine wave. This allows the comparison of maxima in the dEGG. In the present material, we have seen values of dEGGmaxN of up to six (Supplementary file A-14).

2.2 Harmonic-domain EGG analysis
- Selamtzis A
- Ternström S
2.3 Clustering of wave shapes
2.4 Measuring the percentage of vital capacity λ
2.5 The fo-SPL plane

3. METHOD
3.1 Participants
Subject | Age | Voice classification | Bunch and Chapman taxonomy |
---|---|---|---|
S1 | 44 | Contralto | 5.1 Local community club singers |
S2 | 43 | Mezzosoprano | 3.1 National singer |
S3 | 28 | Soprano | 4.1 Regional singer |
S4 | 24 | Mezzosoprano | 7.2 Full-time student in singing |
S5 | 40 | Mezzosoprano | 3.1 National singer |
S6 | 40 | Soprano | 2.1 International operatic singer |
S7 | 38 | Mezzosoprano | 5.1 Local community club singers |
S8 | 24 | Soprano | 7.2 Full-time student in singing |
3.2 Design and task


3.3 Equipment
Track # | Based on | Description |
---|---|---|
1 | clock | Starting time in seconds, for this cycle(monotonically increasing, but not contiguous) (A-11, A-12) |
2 | audio | fo in floating-point semitones; 57.0 = 220 Hz. Updated at a fixed interval of 21.53 ms. |
3 | audio | SPL @ 0.3 m in calibrated dB re 20 µPa. |
4 | audio | clarity, a metric of periodicity 0…1 (cycles for which clarity <0.96 were rejected) |
5 | audio | Crest factor of the audio signal, the peak-to-rms ratio in dB (not reported here) (A-17, A-18) |
6 | egg | Cluster number (1…5) as assigned to egg cycle wave shapes by FonaDyn (supervised learning) |
7 | egg | Running estimate of the sample entropy SampEn of the egg cycle data (not discussed here) (A-19) |
8…17 | egg | Level, in Bels re. full scale, of egg harmonic 1…N (here, N = 10 was used throughout) |
18 | egg | Level in Bels of the residual (the power remaining when harmonics 1…N are accounted for) (A-16) |
19…28 | egg | Phase in radians of egg harmonic 1…N |
29 | egg | Twice the phase of harmonic 1 (for internal use in FonaDyn) |
30 | a/d | Vertical laryngeal position (VLP), updated every 10 ms |
31 | a/d | Intraoral pressure during [p]-occlusion, pio, updated for every /pa/ (A-15) |
32 | a/d | λ (% vital capacity), updated every 10 ms |
33 | egg | EGG peak-to-peak amplitude Ap-p (based on FDs 1…10) |
34 | egg | Integrated contact quotient Qci (based on FDs 1…10) (A-13) |
35 | egg | Normalized dEGG peak amplitudedEGGmaxN (ditto) (A-14) |
3.4 Procedure

3.5 Cluster analysis of EGG wave shapes
3.6 Relating EGG metrics to other physiological signals


3.7 Comparing EGGs across conditions
4. RESULTS
4.1 Overviews of the EGG over the range of the song


4.2 Variation achieved in λ
(1) | (2) | (3) | (4) |
---|---|---|---|
Subject | Mean λ (%) “low LV” | Mean λ (%) “high LV” | Change in λ (%) |
S1 | 4.97 | 61.3 | 56.3 |
S2 | 21.3 | 72.8 | 51.5 |
S3 | -0.62 | 90.9 | 91.5 |
S4 | -4.34 | 71.9 | 76.2 |
S5 | 18.7 | 49.6 | 30.9 |
S6 | 13.2 | 59.3 | 46.1 |
S7 | 6.48 | 60.9 | 54.4 |
S8 | 11.2 | 65.3 | 54.1 |
Average | 8.9% | 66.5% | 57.6% |
4.3 Subject consistency across replications

Subject | [ɑ:] to [ɑ:] (replications) | [ɑ:] to [pɑ:] | [ɑ:] to lyric | All contexts ʻhigh λʼ to ʻlow λʼ | |
---|---|---|---|---|---|
Tokens per set | 6:6 | 12:12 | 12:12 | 18:18 | |
Medians of all per-cell distances | |||||
S1 | 0.69 | 0.88 | 0.86 | 0.80 | |
S2 | 0.32 | 0.71 | 0.51 | 0.30 | |
S3 | 0.24 | 0.56 | 0.53 | 0.32 | |
S4 | 0.56 | 0.62 | 0.76 | 0.63 | |
S5 | 0.48 | 0.38 | 0.44 | 0.43 | |
S6 | 0.53 | 0.51 | 0.58 | 0.58 | |
S7 | 0.46 | 0.60 | 0.86 | 0.72 | |
S8 | 0.43 | 0.40 | 0.43 | 0.31 | All distances between |
Average | 0.46 | 0.58 | 0.62 | 0.51 | all five clusters |
Means of all per-cell distances | Means (SD) | ||||
S1 | 1.23 | 1.41 | 1.49 | 1.24 | 4.16 (1.96) |
S2 | 0.45 | 1.61 | 0.64 | 0.47 | 2.59 (0.87) |
S3 | 0.46 | 1.04 | 0.87 | 0.54 | 2.99 (1.42) |
S4 | 0.72 | 0.82 | 0.99 | 0.80 | 3.62 (1.40) |
S5 | 0.56 | 0.54 | 0.58 | 0.55 | 2.40 (0.98) |
S6 | 0.66 | 0.66 | 0.72 | 0.70 | 2.43 (0.78) |
S7 | 0.59 | 0.81 | 1.01 | 0.88 | 2.76 (1.10) |
S8 | 0.48 | 0.45 | 0.50 | 0.39 | 1.51 (0.27) |
Average | 0.64 | 0.92 | 0.85 | 0.70 | 2.81 (1.41) |
4.4 Effect of conditions
4.5 Correlations



4.6 λ effect on EGG time-domain metrics


5. DISCUSSION AND CONCLUSION
- •The shape of the EGG waveform is quite personal, yet within individuals varies consistently over fo and SPL. This variation in the EGG wave shape was much larger than the variations due to phonetic context or percent vital capacity λ. It was accounted for by making so-called delta-VRPs, in which EGG wave shapes are always compared at the same fo and SPL.
- •Within subjects, the variation in EGG wave shape across the range of the song was found to be very similar in the different singing tasks [ɑ:], [pɑ:], or a lyric (section 4.1), with good reproducibility across replications (section 4.3). This means that EGG studies may be ecologically valid for nonphonetic conclusions about normal singing, even if only [ɑ:] vocalizations are used.
- •At the lowest SPLs, the higher λ values were associated with small reductions in the normalized maximum contacting speed dEGGmaxN (Figure 15) and small increases in the contact quotient Qci (Figure 16). These observed effects of λ on the cycle-normalized EGG waveform are consistent with the abducting effect that is postulated by the tracheal-pull hypothesis. Note that the lowest SPLs shown here resulted from the productions sung in the softest dynamic, piano, which were still well above the threshold of phonation.
- •Also at moderate and high SPLs, the eight subjects exhibited small yet statistically significant changes in dEGGmaxN and Qci with λ, but these changes were different from subject to subject, and not consistently in line with the tracheal pull hypothesis.
Pabon P, Ternström S. Feature Maps of the Acoustic Spectrum of the Voice. J Voice, e-publication available online 27 September 2018, https://doi.org/10.1016/j.jvoice.2018.08.014, open access.
Acknowledgments
Appendix. Supplementary materials
References
Doscher BM. The Functional Unity of the Singing Voice. Metuchen NJ: Scarecrow Press; 1994.
- Singing: The Mechanism and the Technic.Fisher, New York, NY1967
- Kinematics of the chest wall during speech production: volume displacements of the rib cage, abdomen, and lung.J Speech Lang Hear Res. 1973; 16: 78-115https://doi.org/10.1044/jshr.1601.78
- Dynamics of the chest wall during speech production: function of the thorax, rib cage, diaphragm, and abdomen.J Speech Lang Hear Res. 1976; 19: 297-356https://doi.org/10.1044/jshr.1902.297
- Kinetic aspects of singing.J Appl Physiol. 1966; 21: 483-496https://doi.org/10.1152/jappl.1966.21.2.483
- Respiratory kinematics in classical (opera) singers.J Speech Lang Hear Res. 1985; 28: 104-122https://doi.org/10.1044/jshr.2801.104
- Respiratory kinematics in female classical singers.J Voice. 1990; 4: 120-128https://doi.org/10.1016/S0892-1997(05)80136-5
- Lung volume levels in professional classical singing.Logop Phoniatr Vocology. 1997; 22: 61-70https://doi.org/10.3109/14015439709075316
- Respiratory behavior during the learning of a novel aria by a highly trained classical singer.in: Davis PJ Fletcher NH Vocal Fold Physiology/Controlling Complexity and Chaos. Singular Publishing Group, Inc., San Diego, CA1996: 325-343
- Respiratory function during speaking and singing in professional country singers.J Voice. 1996; 10: 39-49https://doi.org/10.1016/S0892-1997(96)80017-8
- X-ray studies on bronchial movements.Am J Anatomy. 1922; 35: 303-320
- Die Stärke des Trachealzuges beim lebenden Menschen und seine Bedeutung für die Kehlkopfmechanik.Ztschr Biol. 1959; 111: 155-164
- Lung volume and phonation: a methodological study.Logop Phoniatr Vocol. 1996; 21: 13-20https://doi.org/10.3109/14015439609099198
- Effects of lung volume on the glottal voice source.J Voice. 1998; 12: 424-433https://doi.org/10.1016/S0892-1997(98)80051-9
- Tracking multi-channel electroglottograph measurement of larynx height in singers.Scand J Logop Phoniatr. 1993; 18: 143-152https://doi.org/10.3109/14015439309101360
- Vertical laryngeal position during continuous and discrete vocal frequency change.J Speech Lang Hear Res. 1975; 18: 707-718https://doi.org/10.1044/jshr.1804.707
- Effects of inhalatory abdominal wall movement on vertical laryngeal position during phonation.J Voice. 2001; 15: 384-394https://doi.org/10.1016/S0892-1997(01)00040-6
- Belly-in or belly-out? Effects of inhalatory behaviour and lung volume on voice function in male opera singers.Tmh-Qpsr. 2003; 45: 61-74
- Physiological characteristics of the supported singing voice. A preliminary study.J Voice. 1995; 9: 45-56https://doi.org/10.1016/S0892-1997(05)80222-X
- Consistency of inhalatory breathing patterns in professional operatic singers.J Voice. 2001; 15: 373-383https://doi.org/10.1016/S0892-1997(01)00039-X
- Questions regarding the function of external laryngeal muscles.in: Brewer D. Research Potentials in Voice Physiology. State University of NY, Syracuse, NY1964: 20-40
- Laryngeal function associated with changes in lung volume during voice and speech production in normal speaking women.Unpublished doctoral dissertation. University of Arizona, 1999 (Available at:) (Accessed April 25, 2018)
- Effects of lung volume on the glottal voice source and the vertical laryngeal position in male professional opera singers.Tmh-Qpsr. 2003; 45 (Accessed April 19, 2018)
- Electroglottography.J Voice. 1992; 6: 98-110https://doi.org/10.1016/S0892-1997(05)80123-7
- Electroglottography and direct measurement of vocal fold contact area.in: Fujimura O Vocal Physiology: Voice Production, Mechanisms and Function. Raven Press, New York1988: 279-291
- Relationship between the electroglottographic signal and vocal fold contact area.J Voice. 2016; 30: 161-171https://doi.org/10.1016/j.jvoice.2015.03.018
- A comparison of different methods to measure the EGG contact quotient.Logop Phoniatr Vocol. 2006; 31: 126-138https://doi.org/10.1080/14015430500376580
- FonaDyn—a system for real-time analysis of the electroglottogram, over the voice range.SoftwareX. 2018; 7: 74-80https://doi.org/10.1016/j.softx.2018.03.002
- Measurement of the separate volume changes of rib cage and abdomen during breathing.J Appl Physiol. 1967; 22: 407-422https://doi.org/10.1152/jappl.1967.22.3.407
- Electroglottographic Analysis of Phonatory Dynamics and States.KTH Royal Institute of Technology, 2014 (Licentiate thesis, Stockholm, Sweden, 2014. ISBN 978-91-7595-189-8)
- Analysis of vibratory states in phonation using spectral features of the electroglottographic signal.J Acoust Soc Am. 2014; 136: 2773-2783https://doi.org/10.1121/1.4896466
- Investigation of the relationship between electroglottogram waveform, fundamental frequency, and sound pressure level using clustering.J Voice. 2016; https://doi.org/10.1016/j.jvoice.2016.11.003
- More like this.(Doctoral thesis in Computer Science) University of California at San Diego, 2012: 151-152 (Available online at) (September 2018)
- Electroglottography in Real-Time Feedback for Healthy Singing.KTH Royal Institute of Technology, Stockholm, Sweden2016 (M.Sc. degree thesis in computer science and communication. Available online at) (February 2018)
- The voice range profile: Its function, applications, pitfalls and potential.Acta Acust United Acust. 2016; 102: 268-283https://doi.org/10.3813/AAA.918943
- Taxonomy of singers used as subjects in scientific research.J Voice. 2000; 14: 363-369https://doi.org/10.1016/S0892-1997(00)80081-8
Pabon P, Ternström S. Feature Maps of the Acoustic Spectrum of the Voice. J Voice, e-publication available online 27 September 2018, https://doi.org/10.1016/j.jvoice.2018.08.014, open access.
Article info
Publication history
Identification
Copyright
User license
Creative Commons Attribution (CC BY 4.0) |
Permitted
- Read, print & download
- Redistribute or republish the final article
- Text & data mine
- Translate the article
- Reuse portions or extracts from the article in other works
- Sell or re-use for commercial purposes
Elsevier's open access license policy