A Novel Source-Filter Stochastic Model for Voice Production

  • E. Cataldo
    Address correspondence and reprint requests to E. Cataldo, Universidade Federal Fluminense, Rua Passo da Patria, 156, Niteroi, Brazil.
    Universidade Federal Fluminense, Graduate program in Electrical and Telecommunications Engineering, Niterói, RJ, Brazil
    Search for articles by this author
  • L. Monteiro
    Universidade Federal Fluminense, Graduate program in Electrical and Telecommunications Engineering, Niterói, RJ, Brazil
    Search for articles by this author
  • C. Soize
    Université Gustave Eiffel, Laboratoire Modélisation et Simulation Multi Echelle, Marne-La-Vallée, France
    Search for articles by this author
Published:January 01, 2021DOI:


      The novel stochastic model to produce voiced sounds proposed in this paper uses the source-filter Fant theory to generate voice signals and, consequently, it does not consider the coupling between the vocal tract and the vocal folds. Two novelties are proposed in the paper. The first one is the new model obtained from the unification of two other deterministic one mass-spring-damper models obtained from the literature and the second one is to build a stochastic model which can generate and control the level of jitter resulting even in hoarse voice signals or with pathological characteristics but using a simpler model than those ones discussed in the literature. An inverse stochastic problem is then solved for two cases, considering a normal voice and other obtained from a case of paralysis on the vocal folds. The parameters of the model are identified in the two cases allowing the validation of the model.


      To read this article in full you will need to make a payment

      Purchase one-time access:

      Academic & Personal: 24 hour online accessCorporate R&D Professionals: 24 hour online access
      One-time access price info
      • For academic or personal research use, select 'Academic and Personal'
      • For corporate R&D use, select 'Corporate R&D Professionals'


      Subscribe to Journal of Voice
      Already a print subscriber? Claim online access
      Already an online subscriber? Sign in
      Institutional Access: Sign in to ScienceDirect


        • Bangayan P.
        • Long C.
        • Alwan A.
        • et al.
        Analysis by synthesis of pathological voices using the klatt synthesizer.
        Speech Commun. 1997; 22: 343-368
        • Bowman A.W.
        • Azzalini A.
        Applied smoothing techniques for data analysis: the kernel approach with S-Plus illustrations.
        Oxford University Press, 1997
        • Cataldo E.
        • Soize C.
        • Sampaio R.
        Using bayesian method for updating the probability density function related to the tension parameter in a voice production model.
        J Biomech. 2012; 45: S481
        • Cataldo E.
        • Soize C.
        Jitter generation in voice signals produced by a two-mass stochastic mechanical model.
        Biomed Signal Process Control. 2016; 27: 87-95
        • Cataldo E.
        • Soize C.
        Stochastic mechanical model of vocal folds for producing jitter and for identifying pathologies through real voices.
        J Biomech. 2018; 74: 126-133
      1. 2018
        • Fant G.
        The acoustic theory of speech production.
        Mouton, The Hague, 1981
        • Kre P.
        • Soize C.
        Mathematics of random phenomena.
        Reidel, Dordrecht1986
        • Laje R.
        • Gardner T.
        • Mindlin G.B.
        Continuous model for vocal fold oscillations to study the effect of feedback.
        Phys Rev E. 2001; 64
        • Li L.
        • Saigusa H.
        • Hakazawa Y.
        A pathological study of bamboo nodule of the vocal fold.
        J Voice. 2010; 24: 738-741
        • Lucero J.C.
        A theoretical study of the hysteresis phenomenon at vocal fold oscillation onset-offset.
        J Acoust Soc Am. 1999; 105: 423-431
        • Lucero J.C.
        • Koenig L.L.
        • Loureno K.G.
        • et al.
        A lumped mucosal wave model of the vocal folds revisited: recent extensions and oscillation hysteresis.
        J Acous Soc Am. 2011; 129: 1568-1579
        • Lucero J.C.
        • Pelorson X.
        • Hirtun A.V.
        Phonation threshold pressure at large asymmetries of the vocal folds.
        Biomed Signal Process Control. 2020; 62: 102105
        • Mendonza L.
        • Vellasco M.
        • Cataldo E.
        • et al.
        Classification of vocal aging using parameters extracted from the glottal signal.
        J Voice. 2014; 21: 157-168
        • Mongia P.K.
        • Sharma R.K.
        Estimation and statistical analysis of human voice parameters to investigate the influence of psychological stress and to determine the vocal tract transfer function of an individual.
        J Comput Netw Commun. 2014;
        • Muta H.
        • Baer T.
        • Wagatsuma K.
        • et al.
        A pitch-synchronous analysis of hoarseness in running speech.
        J. Acoust. Soc. Am. 1988; 84: 1292-1301
        • Pinto N.R.
        • Titze I.R.
        Unification of perturbation measures in speech signals.
        J Acoust Soc Am. 1990; 87: 1278-1289
        • Prasad K.S.
        • Ramaiah G.K.
        • Manjunatha M.B.
        Backend tools for speech synthesis in speech processing.
        Indian J Sci Technol. 2017; 10: 1-8
        • Qureshi T.M.
        A one-mass physical model of the vocal folds with seesaw-like oscillations.
        Arch Acoust. 2011; 36: 15-27
        • Rabiner L.R.
        • Schafer R.W.
        Theory and applications of digital speech processing.
        Prentice Hall, 1978
        • Soize C.
        The Fokker-Planck equation for stochastic dynamical systems and its explicit steady state solutions.
        World Scientific, Singapore1994
        • Schoentgen J.
        • De Guchteneere R.
        Time series analysis of jitter.
        J Phonet. 1995; 23: 189-201
        • Schoengten J.
        • De Guchteneere R.
        Predictable and random components of jitter.
        Speech Commun. 1997; 21: 255-272
        • Schoengten J.
        Stochastic models of jitter.
        J Acoust Soc Am. 2001; 109: 1631-1650
        • Talkin D.
        A robust algorithm for pitch tracking (rapt).
        Speech CodingSynth. 1995; : 495-518
        • Titze I.R.
        • Palaparthi A.
        • Smith S.
        Benchmarks for time-domain simulation of sound propagation in soft-walled airways: steady configurations.
        J Acoust Soc Am. 2014; 136: 3249-3261
        • Titze I.R.
        Parametrization of the glottal area, glottal flow, and vocal fold contact area.
        J Acoust Soc Am. 1984; 75: 570-580
        • Titze I.R.
        The physics of small-amplitude oscillation of the vocal folds.
        J Acoust Soc Am. 1988; 83: 1536-1552
        • Wilcox K.A.
        • Horii Y.
        Age and changes in vocal jitter.
        J Gerontol. 1980; 35: 194-198
        • Wong D.
        • Ito M.R.
        • Cox N.B.
        • et al.
        Observation of perturbations in a lumped-element model of the vocal folds with application to some pathological cases.
        J Acoust Soc Am. 1991; 89: 383-394