Analysis of Vocal-fold Vibrations from High-Speed Laryngeal Images Using a Hilbert Transform-Based Methodology

Published:September 08, 2004DOI:


      This paper presents a Hilbert transform-based approach to analyze vocal fold vibrations in human subjects exhibiting normal and abnormal voice productions. This new approach is applied to the analysis of glottal area waveform (GAW) and is capable of providing useful information on the vocal fold vibration. The GAW is extracted from high-speed laryngeal images by delineating the glottal edge for each image frame. An analytic signal is generated through the Hilbert transform of the GAW, which yields a recognizable pattern of the vocal fold vibration in the analytic phase plane. The vibratory pattern is comprehensive and can be correlated with specific voice conditions. Quantitative measures of the glottal perturbation are introduced using the analytic amplitude and instantaneous frequency obtained from the analysis. Examples of clinical voice recordings are used to evaluate and test the effectiveness of this approach in providing qualitative representation and quantitative characteristics of vocal fold vibratory behavior. The results demonstrate the potential of using this new analytical tool incorporated with the high-speed laryngeal imaging modality for clinical voice assessment.

      Key Words

      To read this article in full you will need to make a payment


      Subscribe to Journal of Voice
      Already a print subscriber? Claim online access
      Already an online subscriber? Sign in
      Institutional Access: Sign in to ScienceDirect


        • Hirano M.
        Examination of vocal fold vibration.
        in: Arnold G.E. Winckel F. Wyke B.D. Disorders of Human Communication. Springer-Verlag, Vienna1981
        • Bless D.M.
        • Hirano M.
        • Feder R.J.
        Videostroboscopic evaluation of the larynx.
        Ear Nose Throat J. 1987; 66: 289-296
        • Booth J.R.
        • Childers D.G.
        Automated analysis of ultra high-speed laryngeal films.
        IEEE Trans Biomed Eng. 1979; 26: 185-192
        • Woo P.
        Quantification of videostrobolaryngoscopic findings—measurements of the normal glottal cycle.
        Laryngoscope. 1996; 106: 1-27
        • Noordzij J.P.
        • Woo P.
        Glottal area waveform analysis of benign vocal fold lesions before and after surgery.
        Ann Otol Rhinol. Laryngol. 2000; 109: 441-446
        • Kitzing P.
        Stroboscopy—a pertinent laryngological examination.
        J Otolaryngol. 1985; 14: 151-157
        • Timke R.
        • von Leden H.
        • Moore P.
        Laryngeal vibrations: Measurements of the glottic wave; Part I. The normal vibratory cycle.
        Archives Otolaryngology. 1958; 68: 1-19
        • Timke R.
        • von Leden H.
        • Moore P.
        Laryngeal vibrations: measurements of the glottic wave; Part II. physiological variations. A.M.A.
        Archives Otolaryngology. 1959; 69: 438-444
        • von Leden H.
        • Moore P.
        • Timke R.
        Laryngeal vibrations: measurements of the glottic wave; Part III. The pathologic larynx.
        Archives Otolaryngology. 1960; 71: 16-35
        • Wittenberg T.
        • Tigges M.
        • Mergell P.
        • Eysholdt U.
        Functional imaging of vocal fold vibration: Digital multislice high-speed kymography.
        J Voice. 2000; 14: 422-442
        • Colton R.H.
        • Casper J.K.
        • Brewer D.W.
        • Conture E.G.
        Digital processing of laryngeal images: A preliminary report.
        J. of Voice. 1989; 3: 132-142
        • Sataloff R.T.
        • Spiegel J.R.
        • Caroll L.M.
        • Schiebel B.R.
        • Darby K.S.
        • Rulnick R.
        Strobovideolaryngoscopy in professional voice users: Results and clinical value.
        J. Voice. 1988; 1: 359-364
        • Woo P.
        • Colton R.H.
        • Casper J.K.
        • Brewer D.W.
        Diagnostic value of stroboscopic examination in hoarse patients.
        J. Voice. 1991; 5: 231-238
        • Hirano M.
        • Bless D.M.
        Videostroboscopic Examination.
        Singular Publishing Group, San Diego, CA1993
        • Hirose H.
        • Kiritani S.
        • Imagawa H.
        High-speed digital image analysis of laryngeal behavior in running speech.
        in: Fukimura O. Vocal Physiology: Voice Production. Raven Press Ltd., New York1988
        • Kiritani S.
        High-speed digital image recording for observing vocal fold vibration.
        in: Kent R.D. Ball M. Voice Quality Measurement. Singular Publishing Group, San Diego, CA2000
        • Honda K.
        • Kiritani S.
        • Iwagawa H.
        • Hirose H.
        High-speed digital recording of vocal fold vibrations using a solid-state image sensor.
        in: Baer T. Sasaki C. Harris K.S. Laryngeal Function in Phonation and Respiration. Little, Brown and Co., Boston, MA1987: 485-491
        • Gonzales R.
        • Woods R.
        Digital Image Processing.
        Addison-Wesley, Reading, MA1992
        • Jain A.
        Fundamentals of Digital Image Processing.
        Prentice-Hall, Englewood Cliffs, NJ1986 (408)
        • Davies E.
        Machine Vision.
        Academic Press, New York1990 (91–96)
        • Mallat S.
        A Wavelet Tour of Signal Processing.
        Academic Press, New York1999
      1. Huang NE, Shen Z, Long SR, Wu ML, Shih HH. The empirical mode decomposition and Hilbert spectrum for non-linear and non-stationary time series analysis. Proc. R. Soc. London. Ser. A 1998;54:903–995.

        • Hahn S.L.
        Hilbert Transforms in Signal Processing.
        Artech House, Norwood, MA1996 (305)
        • Titze I.R.
        Principles of Voice Production.
        Allyn & Bacon, Boston, MA1994
      2. Titze IR. Workshop on acoustic voice analysis, summary statement. National Center for Voice and Speech, Wendell Johnson Speech and Hearing Center, The University of Iowa; 1995.