Advertisement

Improving the Quality and Intelligibility of Electrolaryngeal Speech during Mobile Communication with Landline Analogous Bandpass Filtering

  • Lani Rachel Mathew
    Correspondence
    Address correspondence and reprint requests to Lani Rachel Mathew, Dept. of Electronics & Communication, Mar Baselios College of Engineering & Technology, LBS Center for Science and Technology, University of Kerala, Kerala,India
    Affiliations
    Dept. of Electronics & Communication, Mar Baselios College of Engineering & Technology, LBS Center for Science and Technology, University of Kerala, Kerala,India
    Search for articles by this author
  • K. Gopakumar
    Affiliations
    APJ Abdul Kalam Technological University, Kerala, India
    Search for articles by this author

      Abstract

      The normal functioning of the larynx or voicebox gives humans the ability to converse with others effortlessly. However, if the larynx is permanently removed due to injury or illness, speech cannot be produced. The electrolarynx, a noninvasive device used to produce speech for such persons, has a motor-induced hum which hinders telephone-based communication among electrolaryngeal speakers. This adversely affects the morale of electrolarynx users, leading to abandonment of technology, psychological issues and decreased quality of life. This paper examines a technique based on bandpass filtering of electrolaryngeal speech which has the potential of increasing the quality and intelligibility of speech produced by means of an electrolarynx. Finite impulse response and infinite impulse response filtering with bandpass frequencies analogous to telephone landline and wideband filtering were investigated. Objective testing and subjective testing of quality and intelligibility of electrolaryngeal speech have been performed in each phase of the work. Statistical analysis based on one-way analysis of variance (ANOVA) has been conducted after each round of subjective testing. Results indicate that the quality as well as intelligibility of electrolaryngeal speech can be improved with the help of bandpass filtering.

      Keywords

      To read this article in full you will need to make a payment

      Purchase one-time access:

      Academic & Personal: 24 hour online accessCorporate R&D Professionals: 24 hour online access
      One-time access price info
      • For academic or personal research use, select 'Academic and Personal'
      • For corporate R&D use, select 'Corporate R&D Professionals'

      Subscribe:

      Subscribe to Journal of Voice
      Already a print subscriber? Claim online access
      Already an online subscriber? Sign in
      Institutional Access: Sign in to ScienceDirect

      References

        • Clements K.S.
        • Rassekh C.H.
        • Seikaly H.
        • et al.
        Communication after laryngectomy: an assessment of patient satisfaction.
        Arch Otolaryngol Head Neck Surg. 1997; 123: 493-496
        • Law I.K.-Y.
        • Ma E.P.-M.
        • Yiu E.M.-L.
        Speech intelligibility, acceptability, and communication-related quality of life in chinese alaryngeal speakers.
        Arch Otolaryngol Head Neck Surg. 2009; 135: 704-711
        • Liu H.
        • Wan M.
        • Wang S.
        • et al.
        Aerodynamic characteristics of laryngectomees breathing quietly and speaking with the electrolarynx.
        J Voice. 2004; 18: 567-577
        • Tartter V.C.
        Whats in a whisper?.
        The Journal of the Acoustical Society of America. 1989; 86: 1678-1683
        • Sharifzadeh H.R.
        • HajiRassouliha A.
        • McLoughlin I.V.
        • et al.
        A training-based speech regeneration approach with cascading mapping models.
        Comput Electr Eng. 2017; 62: 601-611
        • Kubert H.L.
        • Stepp C.E.
        • Zeitels S.M.
        • et al.
        Electromyographic control of a hands-free electrolarynx using neck strap muscles.
        J Commun Disord. 2009; 42: 211-225
        • Goldstein E.
        • Heaton J.
        • Kobler J.
        • et al.
        Design and implementation of a hands-free electrolarynx device controlled by neck strap muscle electromyographic activity.
        First International IEEE EMBS Conference on Neural Engineering. 2003: 169-172
        • Xiao K.
        • Wang S.
        • Wan M.
        • et al.
        Radiated noise suppression for electrolarynx speech based on multiband time-domain amplitude modulation.
        IEEE/ACM Trans Audio Speech Lang Process. 2018; 26: 1585-1593
        • Espy-Wilson C.Y.
        • Chari V.R.
        • MacAuslan J.M.
        • et al.
        Enhancement of electrolaryngeal speech by adaptive filtering.
        J Speech Lang Hearing Res. 1998; 41: 1253-1264
        • Haijun N.
        • Wan M.
        • Wang S.
        • Liu H.
        Enhancement of electrolarynx speech using adaptive noise cancelling based on independent component analysis.
        Med Biol Eng Comput. 2003; 41: 670-678
        • Mathew L.R.
        • Gopakumar K.
        Evaluation of speech enhancement algorithms applied to electrolaryngeal speech degraded by noise.
        Appl Acoust. 2021; 174: 107771
        • Cox S.R.
        • Doyle P.C.
        The influence of electrolarynx use on postlaryngectomy voice-related quality of life.
        Otolaryngol Head Neck Surg. 2014; 150: 1005-1009
        • Ferlay J.
        • Soerjomataram I.
        • Dikshit R.
        • et al.
        Cancer incidence and mortality worldwide: sources, methods and major patterns in GLOBOCAN 2012.
        Int J Cancer. 2014; 136: E359E386
        • Bobdey S.
        • Jain A.
        • Balasubramanium G.
        Epidemiological review of laryngeal cancer: an Indian perspective.
        Indian J Med Paediatr Oncol. 2015; 36: 154-160
        • Takahashi H.
        • Nakao M.
        • Kikuchi Y.
        • et al.
        Intra-oral pressure-based voicing control of electrolaryngeal speech with intra-oral vibrator.
        J Voice. 2008; 22: 420-429
        • TRAI
        Consultation paper on issues related to internet telephony.
        Telecom Regulatory Authority of India. 2008;
        • Mitra D.
        Technology review: network convergence and voice over IP.
        Tata Consultancy Services. 2001; : 2
        • TEC
        Generic requirements: electronic telephone instrument.
        Telecommunication Engineering Centre, India. 2017;
        • Onishi Y.
        • Kuroda J.
        • Murata Y.
        • Komoda M.
        • Toki N.
        • Sasaki Y.
        • et al.
        The development of ultra thin speakers for mobile phones.
        Proceedings of 20th International Congress on Acoustics, ICA. 2010: 23-27
        • Coll E.C.
        Telecom 101.
        5. Teracom Training Institute, 2020
        • McClellan J.
        • Parks T.
        A personal history of the parks-mcclellan algorithm.
        IEEE Signal Process Mag. 2005; 22: 82-86
        • Loizou P.C.
        Speech quality assessment.
        Multimedia Analysis, Processing and Communications: Studies in Computational Intelligence. 2. 2011
        • Loizou P.C.
        Speech Enhancement: Theory and Practice. vol. 346. CRC Press, 2013
        • Beerends J.G.
        • Hekstra A.P.
        • Rix A.W.
        • Hollier M.P.
        Perceptual evaluation of speech quality (PESQ) the new ITU standard for end-to-end speech quality assessment Part II: Psychoacoustic model.
        J Audio Eng Soc. 2002; 50: 765-778
        • Taal C.H.
        • Hendriks R.C.
        • Heusdens R.
        • et al.
        An algorithm for intelligibility prediction of time frequency weighted noisy speech.
        IEEE Trans Audio Speech Lang Process. 2011; 19: 2125-2136
        • Taal C.H.
        • Hendriks R.C.
        • Heusdens R.
        • et al.
        A short-time objective intelligibility measure for time-frequency weighted noisy speech.
        2010 IEEE International Conference on Acoustics, Speech and Signal Processing. 2010: 4214-4217
        • Vekkot S.and Gupta D.
        Speaker-independent expressive voice synthesis using learning-based hybrid network model.
        Int J Speech Technol. 2020; 23: 597-613
        • Naderi B.
        • Cutler R.
        An open source implementation of ITU-T Recommendation P. 808 with Validation.
        Proc Interspeech. 2020; : 2862-2866
        • Fairbanks G.
        Voice and Articulation Drillbook. 2. Harper & Row, New York, 1960
        • Cox R.
        • Campos S.
        • Lamblin C.
        • Sherif M.
        ITU-T coders for wideband, superwideband, and fullband speech communication [series editorial].
        Commun Mag, IEEE. 2009; 47: 106-109
      1. IEEE recommended practice for speech quality measurements.
        IEEE Trans Audio Electroacoust. 1969; 17: 225-246
        • Petkov P.N.
        • Kleijn W.B.
        Spectral dynamics recovery for enhanced speech intelligibility in noise.
        IEEE/ACM Trans Audio Speech Lang Process. 2015; 23: 327-338
        • Chen J.
        • Wang Y.
        • Wang D.
        A feature study for classification-based speech separation at low signal-to-noise ratios.
        IEEE/ACM Trans Audio Speech Lang Process. 2014; 22: 1993-2002
        • Cooke M.
        • Mayo C.
        • Valentini-Botinhao C.
        • et al.
        Evaluating the intelligibility benefit of speech modifications in known noise conditions.
        Speech Commun. 2013; 55: 572-585
        • Bennett S.
        • Weinberg B.
        Acceptability ratings of normal, esophageal, and artificial larynx speech.
        J Speech Hearing Res. 1973; 16: 608-615
        • Watson P.J.
        • Schlauch R.S.
        Fundamental frequency variation with an electrolarynx improves speech understanding: A case study.
        . 2009; 18: 162-167
        • Nagle K.F.
        • Eadie T.L.
        • Wright D.R.
        • et al.
        Effect of fundamental frequency on judgments of electrolaryngeal speech.
        . 2012; 21: 154-166
        • Hu Y.
        • Loizou P.C.
        Evaluation of objective measures for speech enhancement.
        Proc Interspeech. 2006; : 14471450
        • Hu Y.
        • Loizou P.C.
        Evaluation of objective quality measures for speech enhancement.
        IEEE/ACM Trans Audio Speech Lang Process. 2008; 16: 229-238
        • Kondo K.
        Subjective quality measurement of speech.
        Signals and Communication Technology. Springer-Verlag Berlin Heidelberg, 2012
        • Velavan T.P.
        • Meyer C.G.
        The covid-19 epidemic.
        Tropical Med Int Health. 2020; 25: 278-280
        • Yuki K.
        • Fujiogi M.
        • Koutsogiannaki S.
        COVID-19 pathophysiology: a review.
        Clin Immunol. 2020; 215: 108427
        • Cohen D.
        Precalculus: A Problems-Oriented Approach. 6. Cengage Learning, 2004
        • Howell D.
        Statistical Methods for Psychology.
        Duxbury, 2002