Advertisement
Review Article| Volume 36, ISSUE 5, P737.e1-737.e10, September 2022

Download started.

Ok

Blood Glucose Estimation From Voice: First Review of Successes and Challenges

Open AccessPublished:October 09, 2020DOI:https://doi.org/10.1016/j.jvoice.2020.08.034

      Summary

      The possibility to estimate glucose value from voice would make a breakthrough in diabetes treatment: namely, remove the delay in the nonintrusive instantaneous blood glucose estimation, relieve medical budgets and significantly improve wellbeing of diabetics. In this review, different approaches have been described and systematized, in order to provide an objective snapshot of the state of the art. Since nonintrusive glucose estimation is notoriously difficult, we included a Transparence and Reproducibility Score aimed at revealing the biases in the primary research articles. The review is completed with the discussion on future research pathways.

      Graphical Abstract

      Key Words

      INTRODUCTION

      A number of diseases and pathological conditions are reflected in the voice due to specific temporary or static changes that occur in the speaker's speech production organs or in the brain mechanisms controlling speech. Such voice changes can be heard and from a computational perspective can form learnable acoustic patterns generalizable across speakers and even across languages. The systems to detect such patterns have been built in the past decade to detect different neurological and psychological conditions and even in a disorder-independent way. The examples include stress recognition,
      • Muaremi A.
      • Arnrich B.
      • Tröster G.
      Towards measuring stress with smartphones and wearable devices during workday and sleep.
      • Van Segbroeck M.
      • Travadi R.
      • Vaz C.
      • et al.
      Classification of Cognitive Load From Speech Using an I-Vector Framework.
      • Aguiar A.C.
      • Kaiseler M.
      • Meinedo H.
      • et al.
      Speech Stress Assessment Using Physiological and Psychological Measures.
      autism spectrum,
      • Räsänen O.
      • Pohjalainen J.
      Random Subset Feature Selection in Automatic Recognition of Developmental Disorders, Affective States, and Level of Conflict From Speech.
      • Bone D.
      • Chaspari T.
      • Audhkhasi K.
      • et al.
      Classifying Language-Related Developmental Disorders From Speech Cues: The Promise and the Potential Confounds.
      physical load,
      • Li M.
      Automatic Recognition of Speaker Physical Load Using Posterior Probability Based Features From Acoustic and Phonetic Tokens.
      Parkinson's disease,
      • Bayestehtashk A.
      • Asgari M.
      • Shafran I.
      • et al.
      Fully automated assessment of the severity of Parkinson's disease from speech.
      • Bocklet T.
      • Steidl S.
      • Nöth E.
      • et al.
      Automatic Evaluation of Parkinson's Speech-Acoustic, Prosodic and Voice Related Cues.
      • Orozco-Arroyave J.R.
      • Hönig F.
      • Arias-Londoño J.D.
      • et al.
      Automatic detection of Parkinson's disease in running speech spoken in three different languages.
      • Kim J.
      • Nasir M.
      • Gupta R.
      • et al.
      Automatic Estimation of Parkinson's Disease Severity From Diverse Speech Tasks.
      Alzheimer's disease,
      • Lopez-de-Ipiña K.
      • Alonso J.B.
      • Solé-Casals J.
      • et al.
      On automatic diagnosis of Alzheimer's disease based on spontaneous speech analysis and emotional temperature.
      and intoxication.
      • Gajšek R.
      • Mihelic F.
      • Dobrišek S.
      Speaker state recognition using an HMM-based feature extraction method.
      ,
      • Suendermann-Oeft D.
      • Ramanarayanan V.
      • Teckenbrock M.
      • et al.
      HALEF: an open-source standard-compliant telephony-based modular spoken dialog system: a review and an outlook.
      Disorder-independent diagnostic support was proposed
      • Sidorova J.
      • Carlsson S.
      • Rosander O.
      • et al.
      Towards disorder-independent automatic assessment of emotional competence in neurological patients with a classical emotion recognition system: application in foreign accent syndrome.
      for the cases, when condition-specific methods are not applicable: little data available for a new or rare disorder and/or different acoustic parameters affected from subject to subject. Formally speaking, a biomarker is “a characteristic that is objectively measured and evaluated as an indicator of normal biological processes, pathogenic processes, or pharmacologic responses to a therapeutic intervention.”
      • Colburn W.
      • DeGruttola V.G.
      • DeMets D.L.
      • et al.
      Biomarkers and surrogate endpoints: preferred definitions and conceptual framework. Biomarkers Definitions Working Group..
      A vocal biomarker takes speech as an input and evaluates the subject's speech production (quality, competence or other aspects) at either a particular time moment or as a trend through a time interval (a recent review can be found in
      • Sidorova J.
      • Anisimova M.
      Impact of diabetes mellitus on voice: a methodological commentary.
      ). Vocal biomarkers possess an attractive combination of properties: noninvasive, instantaneous, and cost-effective. Currently, in diverse fields of healthcare, there is an urgent need for such solutions, to alleviate the ever growing medical budgets, better understand the disease, and improve therapies. Nonetheless, vocal biomarkers are only making it to become one of the core diagnostic criteria in neurology and are almost unheard of in endocrinology.
      Diabetes mellitus is a chronic disease associated to one of the two mechanisms: inadequate production of insulin by pancreas (Type 1), or inadequate sensitivity of cells to the action of insulin (Type 2). Diligent and timely control of blood glucose (BG) levels is a crucial element in diabetes treatment. An instantaneous and reliable way to measure glucose value is obtaining a blood sample with a lancet and then estimating a glucose value with a blood glucometer. Many diabetics do not test their blood frequently enough because of inconvenience, lack of time, discomfort, and cost of the test strips for a glucometer. Consequently, they fail to discover the presence of the dangerous conditions of hypo- or hyperglycemia, which are too low or too high blood sugar, respectively. Among the diabetics, who do not monitor their glucose level diligently, the development of severe complications is frequent. Unobtrusive continuous blood glucose monitoring (CBGM) with implants was invented, but, unfortunately, it introduces a considerable delay in the measurements, which is long enough to trigger negative clinical consequences. Regarding truly noninvasive technologies, despite many research efforts, at the moment none of them is entirely reliable and convenient, and many noninvasive glucose estimation products were retracted. An interested reader can be referred to critical surveys of the topic, eg,
      • Lin T.
      • Gal A.
      • Mayzel Y.
      • et al.
      Non-invasive glucose monitoring: a review of challenges and recent advances.
      or
      • Bolla A.S.
      • Priefer R.
      Diabetes and metabolic syndrome.
      that are focusing on the current obstacles.
      The problem of glucose value estimation is understood as the estimation of the numeric glucose value from a speech fragment or its mapping into hypoglycemia, normal BG, or hyperglycemia. In the adjacent field of research, glucose prediction refers to the prediction of future glucose values, eg,
      • Rodbard D.
      Continuos glucose monitoring: a review of successes, challenges, and opportunities.
      therefore, to avoid ambiguity, the term BG estimation rather than prediction is more appropriate.
      In this first literature survey on BG estimation from voice, we have answered the following questions.
      • 1.
        What makes one believe that the swings in BG trigger voice changes?
      • 2.
        What are the existing approaches to BG estimation from voice? Are there any research biases?
      • 3.
        What are the definite successes in the field? What remains to be an open problem?
      The rest of the article is structured as follows. “The Search Protocol” Section describes the search strategy to find the publications on glucose estimation from voice. The sections “Why BG Affects Voice?” and “Existing Systems for Glucose Estimation From Voice” aim at creating a judgment-neutral snapshot of the state of the art. (A reader uninterested in computational details of the existing solutions can skim through the “Computational Approaches” Subsection.) The Sections “Missing Values” and “Biases in Data Collection” describe the research biases in the primary research articles and the pitfalls of data collection. The remaining sections cover Future Paths, Discussion and Conclusions.

      THE SEARCH PROTOCOL

      • 1
        To search for the potentially applicable studies, we have searched in five databases of scientific literature covering the publications up to the article submission date in 2020: Cochrane library, PubMed, Scopus, Web of Science, and Google Scholar. The logical formula was: (speech OR voice) AND (diabetes OR glucose OR sugar OR hyperglycemia OR hypoglycemia). The queries are listed in Table 1, and the query was kept as large as possible, retrieving the articles containing the keywords anywhere. In Google Scholar the search was completed only over the titles, but the formula was extended with synonyms.
        TABLE 1Queries for the Literature Search
        DatabaseQuery# Hits (Not Listed, If Found in the Previous Database)# Manually Selected as Relevant and Relevant Articles That Cite the Hit
        Cochrane Library#1 speech MeSH

        #2 glucose OR hypoglycemia OR sugar OR hypoglycemia OR diabetes OR SMBG OR self-monitoring of glucose

        #3 #1 AND #2
        223N/A
        PubMed(("speech"[MeSH Terms] OR "speech"[All Fields]) OR ("voice"[MeSH Terms] OR "voice"[All Fields])) AND (("glucose"[MeSH Terms] OR "glucose"[All Fields]) OR ("hyperglycemia"[All Fields] OR "hyperglycemia"[MeSH Terms] OR "hyperglycemia"[All Fields]) OR ("hypoglycemia"[All Fields] OR "hypoglycemia"[MeSH Terms] OR "hypoglycemia"[All Fields]) OR ("sugars"[MeSH Terms] OR "sugars"[All Fields] OR "sugar"[All Fields]) OR ("diabetes mellitus"[MeSH Terms] OR ("diabetes"[All Fields] AND "mellitus"[All Fields]) OR "diabetes mellitus"[All Fields] OR "diabetes"[All Fields] OR "diabetes insipidus"[MeSH Terms] OR ("diabetes"[All Fields] AND "insipidus"[All Fields]) OR "diabetes insipidus"[All Fields]) OR SMBG[All Fields] OR (("ego"[MeSH Terms] OR "ego"[All Fields] OR "self"[All Fields]) AND monitoring[All Fields] AND ("glucose"[MeSH Terms] OR "glucose"[All Fields])))956 (more recent than 1999)N/A
        Web of Science(speech OR voice) AND (diabetes OR glucose OR sugar OR hyperglycemia OR hypoglycemia)808
        • Chitkara D.
        • Sharma R.K.
        Voice based detection of type 2 diabetes mellitus.
        ,
        • Pyniopodjanard S.
        • Suppakitjanusant P.
        • Lomprew P.
        • et al.
        Instrumental acoustic voice characteristics in adults with type 2 diabetes.
        ,
        • Tschope C.
        • Duckhorn F.
        • Wollf M.
        • et al.
        Estimating blood sugar from voice samples: a preliminary study.
        Scopus(“speech” OR “voice”) and (“diabetes” OR “glucose” OR “sugar” OR “hyperglycemia” OR “hypoglycemia”)886 hits (more recent than 2014)
        • Ravi R.
        • Gunjawate D.
        Effect of diabetes mellitus on voice: a systematic review.
        Google Scholarallintitle: diabetes OR glucose OR hypoglycemia OR hyperglycemia OR sugar vocal OR acoustic OR perceptual OR speech OR voice; time span of 2009-2019 in articles and patents.221
        • Hamdan A.
        • Jabbour J.
        • Nassar J.
        • et al.
        Vocal characteristics in patients with type 2 diabetes mellitus.
        ,

        Y. Ulanovsky, A. Frolov, A. Kozlova, et al., “Device for blood glucose level determination”, Patent WO2014072823, 2014.

        Y. Ulanovsky, A. Frolov, A. Kozlova, “Method of non-invasive determination of glucose concentration in blood and device for the implementation of thereof”, Patent WO2014/049438.

        P. R. Michaelis, “Detection of extreme hypoglycemia and hyperglycemia based on automatic analysis of speech patterns”, US patent US 7, 925,508 B1, 2011.

        J. Rasmusson, P. Karlsson, M. Svensson, et al., “Method and device for blood glucose monitoring”, Patient EP 3 574 830 A1, 2019.

      • 2
        Once the potentially applicable studies were retrieved, the irrelevant ones were discarded based on the manual scanning of the abstracts and full texts. A conservative approach was undertaken of including and summarizing every piece of research that describes either the construction of a vocal biomarker that detects glucose swings from voice or the voice changes in response to the change in glucose concentration.
      • 3
        The cited articles and the ones that cite the relevant articles were added to the pool of potentially applicable studies.
      • 4
        The research biases in the retrieved studies were analyzed: the description of the design alternatives is detailed with a Transparency and Reliability Matrix that was built to account for missing values, and the issues of data collection are addressed.
      The search returned the works published on glucose swings reflected in voice: one journal article,
      • Motorin V.
      Scientific solutions for the parameter’s automation in biochemical and biomechanical processes of the operational estimation of blood glucose from human voice.
      a conference article,
      • Tschope C.
      • Duckhorn F.
      • Wollf M.
      • et al.
      Estimating blood sugar from voice samples: a preliminary study.
      one abstract
      • Czupryniak. L.
      • Sielska-Badurek E.
      • Niebisz A.
      • et al.
      378-P: human voice is modulated by hypoglycemia and hyperglycemia in type 1 diabetes.
      and several patents.

      Y. Ulanovsky, A. Frolov, A. Kozlova, et al., “Device for blood glucose level determination”, Patent WO2014072823, 2014.

      Y. Ulanovsky, A. Frolov, A. Kozlova, “Method of non-invasive determination of glucose concentration in blood and device for the implementation of thereof”, Patent WO2014/049438.

      P. R. Michaelis, “Detection of extreme hypoglycemia and hyperglycemia based on automatic analysis of speech patterns”, US patent US 7, 925,508 B1, 2011.

      J. Rasmusson, P. Karlsson, M. Svensson, et al., “Method and device for blood glucose monitoring”, Patient EP 3 574 830 A1, 2019.

      Three more research articles and one literature survey describe the static changes in voice attributable to diabetes complications, and/or investigate the possibilities of diabetes detection from voice.
      • Ravi R.
      • Gunjawate D.
      Effect of diabetes mellitus on voice: a systematic review.
      • Chitkara D.
      • Sharma R.K.
      Voice based detection of type 2 diabetes mellitus.
      • Pyniopodjanard S.
      • Suppakitjanusant P.
      • Lomprew P.
      • et al.
      Instrumental acoustic voice characteristics in adults with type 2 diabetes.
      • Hamdan A.
      • Jabbour J.
      • Nassar J.
      • et al.
      Vocal characteristics in patients with type 2 diabetes mellitus.
      Three recent reviews
      • Lin T.
      • Gal A.
      • Mayzel Y.
      • et al.
      Non-invasive glucose monitoring: a review of challenges and recent advances.
      • Rodbard D.
      Continuos glucose monitoring: a review of successes, challenges, and opportunities.
      ,
      • Bolla A.S.
      • Priefer R.
      Diabetes and metabolic syndrome.
      on noninvasive glucose monitoring provide a broader context for the vocal biomarker. Miscellaneous research articles were retrieved answering specific queries, when the primary sources lacked information or references. We give full consideration to primary research articles from gray literature, since conference proceedings have been the publication venues for many important technological breakthroughs in speech technology and computer science, as well as, for example, the fact that emotion recognition from voice is feasible first appeared in a patent in the late 1970s.

      WHY BG AFFECTS VOICE?

      Some studies simply claim that glucose-related changes in voice can be heard: the persons, who know the diabetic subject well can hear when they are hypoglycemic,
      • Tschope C.
      • Duckhorn F.
      • Wollf M.
      • et al.
      Estimating blood sugar from voice samples: a preliminary study.
      and similarly according to
      • Czupryniak. L.
      • Sielska-Badurek E.
      • Niebisz A.
      • et al.
      378-P: human voice is modulated by hypoglycemia and hyperglycemia in type 1 diabetes.
      it is common knowledge that hypoglycemia is perceivable. Several studies provided scientific grounds to the not-obviously true statement that glucose swings are reflected in the voice. There are two explanations regarding the underlying mechanisms of voice changes as a response to BG shifts. One group of authors says that
      • the change in glucose level in the blood flowing in the larynx and the cords causes changes in the elastic properties of the biological tissue of these organs, which in turn results in the changes of spectral characteristics in compliance to the Hooke's law of physics.

        Y. Ulanovsky, A. Frolov, A. Kozlova, et al., “Device for blood glucose level determination”, Patent WO2014072823, 2014.

      The second explanation is emotion related

      P. R. Michaelis, “Detection of extreme hypoglycemia and hyperglycemia based on automatic analysis of speech patterns”, US patent US 7, 925,508 B1, 2011.

      :
      • hypoglycemia is often accompanied by a feeling of anxiety, which causes people to speak faster and with greater urgency, whereas hyperglycemia on the contrary is often accompanied by feelings of lethargy thereby causing speech patterns to be slower or slurred.
      Although no supporting references were provided, our investigation confirmed that both assumptions have scientific grounds. The subjective feelings by diabetics in the state of hypo- and hyperglycemia were reported in the 1990s, for example,
      • Weiniger K.
      • Jacobson A.M.
      • Draelos M.T.
      • et al.
      Blood glucose estimation and symptoms during hyperglycemia and hypoglycemia in patients with insulin-dependent diabetes mellitus.
      and the similarity of symptoms in anxiety and hypoglycemia were described in.
      • Polonsky W.
      • Davis C.
      • Jacobson A.
      • et al.
      Correlates of hypoglycemic fear in type I and type II diabetes mellitus.
      No related work regarding the other explanation about the change in elasticity of the larynx and cord was found by Google Scholar in English: (elasticity OR Hooke's law OR cord OR larynx) AND (hypoglycemia OR hyperglycemia OR glucose), but the search returned an article in Russian,
      • Motorin V.
      Scientific solutions for the parameter’s automation in biochemical and biomechanical processes of the operational estimation of blood glucose from human voice.
      where it is stated that it was demonstrated experimentally that with the increase of BG the elasticity of those muscles decreases.

      EXISTING SYSTEMS FOR GLUCOSE ESTIMATION FROM VOICE

      The study designs differ according to the initial assumption why BG affects voice resulting in five approaches coming from different research communities, and generally, since the problem is new and the alternative system designs have not been compared yet, a plephora of approaches have been tried, similar to the early days of speech emotion recognition
      • Sidorova J
      • McDonough J
      • Badia T
      Automatic Recognition of Emotive Voice and Speech.
      • Sidorova J
      . Below the existing systems are classified with respect to different options chosen in their respective system designs.

      Ground truth

      The BG value to estimate, called the ground truth, was of the two types:
      • numeric,
        • Tschope C.
        • Duckhorn F.
        • Wollf M.
        • et al.
        Estimating blood sugar from voice samples: a preliminary study.
        ,
        • Motorin V.
        Scientific solutions for the parameter’s automation in biochemical and biomechanical processes of the operational estimation of blood glucose from human voice.
        or
      • categorical.
        • Czupryniak. L.
        • Sielska-Badurek E.
        • Niebisz A.
        • et al.
        378-P: human voice is modulated by hypoglycemia and hyperglycemia in type 1 diabetes.
        ,

        Y. Ulanovsky, A. Frolov, A. Kozlova, et al., “Device for blood glucose level determination”, Patent WO2014072823, 2014.

        Y. Ulanovsky, A. Frolov, A. Kozlova, “Method of non-invasive determination of glucose concentration in blood and device for the implementation of thereof”, Patent WO2014/049438.

        P. R. Michaelis, “Detection of extreme hypoglycemia and hyperglycemia based on automatic analysis of speech patterns”, US patent US 7, 925,508 B1, 2011.

      Table 2 summarizes the literature with regard to different ground truth options.
      TABLE 2Ground Truth
      StudyGround TruthBlood Reading or CBGM
      Motorin, 2017
      • Motorin V.
      Scientific solutions for the parameter’s automation in biochemical and biomechanical processes of the operational estimation of blood glucose from human voice.
      NumericBG
      Ulanovsky et al, 2009

      Y. Ulanovsky, A. Frolov, A. Kozlova, et al., “Device for blood glucose level determination”, Patent WO2014072823, 2014.

      Y. Ulanovsky, A. Frolov, A. Kozlova, “Method of non-invasive determination of glucose concentration in blood and device for the implementation of thereof”, Patent WO2014/049438.

      Low: <3.5 mmol/L, Norm: 3.5-6.0 mmol/L,

      High: >6 mmol/L
      BG
      Michaelis, 2014

      P. R. Michaelis, “Detection of extreme hypoglycemia and hyperglycemia based on automatic analysis of speech patterns”, US patent US 7, 925,508 B1, 2011.

      Mild and extreme hypo- and hyperglycemiaBG
      Rasmusson et al, 2019

      J. Rasmusson, P. Karlsson, M. Svensson, et al., “Method and device for blood glucose monitoring”, Patient EP 3 574 830 A1, 2019.

      Numeric values are impliedCBGM
      Tschope et al, 2015
      • Tschope C.
      • Duckhorn F.
      • Wollf M.
      • et al.
      Estimating blood sugar from voice samples: a preliminary study.
      NumericBG
      Czupryniak et al, 2019
      • Czupryniak. L.
      • Sielska-Badurek E.
      • Niebisz A.
      • et al.
      378-P: human voice is modulated by hypoglycemia and hyperglycemia in type 1 diabetes.
      Low: <70 mg/dL (hypoglycemia),

      Norm: 70-200 mg/dL,

      High: >200 mg/dL (extreme hyperglycemia)
      CGBM
      Depending on the country, BG values are measured in mg/dL or mmol/dL, and there exist online calculators to convert between the units, eg,

      The converter among blood glucose units in different system. Available at:http://www.unit-conversion.info/blood-sugar.html. Accessed January 22, 2020.

      hypoglycemia (dangerously low blood sugar) is defined as <70 mg/dL or <3.8 mmol/L, and hyperglycemia (dangerously high blood sugar) is defined as >180 mg/dL or >10 mmol/dL.
      • Polonsky W.
      • Davis C.
      • Jacobson A.
      • et al.
      Correlates of hypoglycemic fear in type I and type II diabetes mellitus.
      Accordingly, numeric BG values are binned into low (hypoglycemia), normal, or high (hyperglycemia). In
      • Czupryniak. L.
      • Sielska-Badurek E.
      • Niebisz A.
      • et al.
      378-P: human voice is modulated by hypoglycemia and hyperglycemia in type 1 diabetes.
      the values >200 mg/dL are taken for hyperglycemia, which is higher than in all the other referenced articles, and in

      P. R. Michaelis, “Detection of extreme hypoglycemia and hyperglycemia based on automatic analysis of speech patterns”, US patent US 7, 925,508 B1, 2011.

      extreme hypo- and hyperglycemia was also mentioned as recognition classes.
      As far as the ways of obtaining BG measurements are concerned, the options are either
      • BG readings,
        • Tschope C.
        • Duckhorn F.
        • Wollf M.
        • et al.
        Estimating blood sugar from voice samples: a preliminary study.
        • Motorin V.
        Scientific solutions for the parameter’s automation in biochemical and biomechanical processes of the operational estimation of blood glucose from human voice.

        Y. Ulanovsky, A. Frolov, A. Kozlova, et al., “Device for blood glucose level determination”, Patent WO2014072823, 2014.

        Y. Ulanovsky, A. Frolov, A. Kozlova, “Method of non-invasive determination of glucose concentration in blood and device for the implementation of thereof”, Patent WO2014/049438.

        or
      • CBGM.
        • Czupryniak. L.
        • Sielska-Badurek E.
        • Niebisz A.
        • et al.
        378-P: human voice is modulated by hypoglycemia and hyperglycemia in type 1 diabetes.
        ,

        J. Rasmusson, P. Karlsson, M. Svensson, et al., “Method and device for blood glucose monitoring”, Patient EP 3 574 830 A1, 2019.

      BG readings are obtained from a drop of blood by means of a capillary finger-pricks with a lancet and then BG value is measured with a BG meter. In the case of CBGM, the BG value is obtained from a needle type sensor that is inserted into the subcutaneous system. The biases of ground truth values obtained with CBGM are covered in the “Biases of Data Collection” Section.

      Patient groups

      The studies were either clinical or involved volunteers (Table 3). The number of subjects (Table 6) was typically been below ten, except for a large-scale study with 7,000 subjects in.
      • Motorin V.
      Scientific solutions for the parameter’s automation in biochemical and biomechanical processes of the operational estimation of blood glucose from human voice.
      Some studies were carried out exclusively for T1D,
      • Czupryniak. L.
      • Sielska-Badurek E.
      • Niebisz A.
      • et al.
      378-P: human voice is modulated by hypoglycemia and hyperglycemia in type 1 diabetes.
      ,
      • Tschope C.
      • Duckhorn F.
      • Wollf M.
      • et al.
      Estimating blood sugar from voice samples: a preliminary study.
      while in others there was no special distinction made between T1D and T2D.
      • Motorin V.
      Scientific solutions for the parameter’s automation in biochemical and biomechanical processes of the operational estimation of blood glucose from human voice.

      Y. Ulanovsky, A. Frolov, A. Kozlova, et al., “Device for blood glucose level determination”, Patent WO2014072823, 2014.

      Y. Ulanovsky, A. Frolov, A. Kozlova, “Method of non-invasive determination of glucose concentration in blood and device for the implementation of thereof”, Patent WO2014/049438.

      P. R. Michaelis, “Detection of extreme hypoglycemia and hyperglycemia based on automatic analysis of speech patterns”, US patent US 7, 925,508 B1, 2011.

      TABLE 3Diabetes-related Aspects
      StudyClinical StudyT1D or T2D
      Motorin, 2017
      • Motorin V.
      Scientific solutions for the parameter’s automation in biochemical and biomechanical processes of the operational estimation of blood glucose from human voice.
      YesBoth implied
      Ulanovsky et al, 2009

      Y. Ulanovsky, A. Frolov, A. Kozlova, et al., “Device for blood glucose level determination”, Patent WO2014072823, 2014.

      Y. Ulanovsky, A. Frolov, A. Kozlova, “Method of non-invasive determination of glucose concentration in blood and device for the implementation of thereof”, Patent WO2014/049438.

      UnspecifiedBoth
      Michaelis, 2014

      P. R. Michaelis, “Detection of extreme hypoglycemia and hyperglycemia based on automatic analysis of speech patterns”, US patent US 7, 925,508 B1, 2011.

      UnspecifiedUnspecified
      Rasmusson et al, 2019

      J. Rasmusson, P. Karlsson, M. Svensson, et al., “Method and device for blood glucose monitoring”, Patient EP 3 574 830 A1, 2019.

      UnspecifiedBoth
      Tschope et al, 2015
      • Tschope C.
      • Duckhorn F.
      • Wollf M.
      • et al.
      Estimating blood sugar from voice samples: a preliminary study.
      No, volunteersT1D
      Czupryniak et al, 2019
      • Czupryniak. L.
      • Sielska-Badurek E.
      • Niebisz A.
      • et al.
      378-P: human voice is modulated by hypoglycemia and hyperglycemia in type 1 diabetes.
      UnspecifiedT1D
      In contrast to the tradition of voice pathology research, where the speakers are excluded that have any other conditions, which are known to affect speech production organs or the neurological mechanisms related to speech,
      • Pyniopodjanard S.
      • Suppakitjanusant P.
      • Lomprew P.
      • et al.
      Instrumental acoustic voice characteristics in adults with type 2 diabetes.
      ,
      • Hamdan A.
      • Jabbour J.
      • Nassar J.
      • et al.
      Vocal characteristics in patients with type 2 diabetes mellitus.
      the studies on glucose detection do not discuss and presumably do not have exclusion criteria. Only ref.
      • Czupryniak. L.
      • Sielska-Badurek E.
      • Niebisz A.
      • et al.
      378-P: human voice is modulated by hypoglycemia and hyperglycemia in type 1 diabetes.
      says that they excluded (only) smokers. It should be added that this study comes from the voice pathology research and concerns with the description of the patterns in the whole population rather than their speaker-dependent detection. Likely, this disregard of the exclusion criteria is by design, since the studies aim at building a speaker-dependent predictive model relying on the features responsible for the BG swings and assuming that the distributions for a speaker are stationary, even though if permanently distorted by some pathology. In such a case, temporary conditions, such as flue or intoxication, are of relevance rather than static deformations of voice patterns due to chronic neurological or complications of diabetes.

      Speech corpora

      The type of speech samples (Table 4) used were:
      • one phoneme /a/,
        • Czupryniak. L.
        • Sielska-Badurek E.
        • Niebisz A.
        • et al.
        378-P: human voice is modulated by hypoglycemia and hyperglycemia in type 1 diabetes.
      which comes from the tradition of the statical approach in vocal biomarkers,
      • Sidorova J.
      • Anisimova M.
      Impact of diabetes mellitus on voice: a methodological commentary.
      • matched sentences for the ease of comparison,
        • Tschope C.
        • Duckhorn F.
        • Wollf M.
        • et al.
        Estimating blood sugar from voice samples: a preliminary study.
      in the tradition of early works on emotion recognition, e.g.,
      • Sidorova J.
      Speech emotion recognition with TGI+.2 classifier.
      ,
      • Sidorova J.
      • Badia T.
      ESEDA: tool for enhanced speech emotion detection and analysis.
      and
      • any fragment of speech
        • Motorin V.
        Scientific solutions for the parameter’s automation in biochemical and biomechanical processes of the operational estimation of blood glucose from human voice.

        Y. Ulanovsky, A. Frolov, A. Kozlova, et al., “Device for blood glucose level determination”, Patent WO2014072823, 2014.

        Y. Ulanovsky, A. Frolov, A. Kozlova, “Method of non-invasive determination of glucose concentration in blood and device for the implementation of thereof”, Patent WO2014/049438.

        P. R. Michaelis, “Detection of extreme hypoglycemia and hyperglycemia based on automatic analysis of speech patterns”, US patent US 7, 925,508 B1, 2011.

      in the tradition of modern emotion recognition done for free speech
      • Sidorova J.
      • Badia T.
      Syntactic learning for ESEDA.1, tool for enhanced speech emotion detection and analysis.
      .
      TABLE 4Type of Voice Data
      StudySpeech UnitRecording Device
      Motorin, 2017
      • Motorin V.
      Scientific solutions for the parameter’s automation in biochemical and biomechanical processes of the operational estimation of blood glucose from human voice.
      Fragments of speech, where the vowels are more frequentSmartphone
      Ulanovsky et al, 2009

      Y. Ulanovsky, A. Frolov, A. Kozlova, et al., “Device for blood glucose level determination”, Patent WO2014072823, 2014.

      Y. Ulanovsky, A. Frolov, A. Kozlova, “Method of non-invasive determination of glucose concentration in blood and device for the implementation of thereof”, Patent WO2014/049438.

      Free speech mobile phoneSmartphone
      Michaelis, 2014

      P. R. Michaelis, “Detection of extreme hypoglycemia and hyperglycemia based on automatic analysis of speech patterns”, US patent US 7, 925,508 B1, 2011.

      Any fragment of free speechUnspecified
      Rasmusson et al, 2019

      J. Rasmusson, P. Karlsson, M. Svensson, et al., “Method and device for blood glucose monitoring”, Patient EP 3 574 830 A1, 2019.

      UnspecifiedUnspecified
      Tschope et al, 2015
      • Tschope C.
      • Duckhorn F.
      • Wollf M.
      • et al.
      Estimating blood sugar from voice samples: a preliminary study.
      Matching sentences in GermanUnspecified
      Czupryniak et al, 2019
      • Czupryniak. L.
      • Sielska-Badurek E.
      • Niebisz A.
      • et al.
      378-P: human voice is modulated by hypoglycemia and hyperglycemia in type 1 diabetes.
      Vowel /a/ for 3 secondsProfessional voice recorder
      The speech was either recorded
      • in a phonetic lab
        • Czupryniak. L.
        • Sielska-Badurek E.
        • Niebisz A.
        • et al.
        378-P: human voice is modulated by hypoglycemia and hyperglycemia in type 1 diabetes.
        ,

        Y. Ulanovsky, A. Frolov, A. Kozlova, et al., “Device for blood glucose level determination”, Patent WO2014072823, 2014.

        ,

        Y. Ulanovsky, A. Frolov, A. Kozlova, “Method of non-invasive determination of glucose concentration in blood and device for the implementation of thereof”, Patent WO2014/049438.

        or
      • via a smartphone.
        • Motorin V.
        Scientific solutions for the parameter’s automation in biochemical and biomechanical processes of the operational estimation of blood glucose from human voice.
      To be able to continuously monitor BG from speech in real-life applications, it is necessary to do recording with a smartphone or a wearable device.

      Computational approaches

      The study designs differ according to the initial assumption why BG affects voice (will be further detailed in Table 7) resulting in five approaches coming from different research communities. Below the existing systems are catalogued with respect to different options chosen in their respective system designs. The acoustic parameters and the computational analysis are further detailed in Tables 5 and 7, respectively.
      TABLE 5Acoustic Parameters for Analysis
      StudyVoice Parameters
      Motorin, 2017
      • Motorin V.
      Scientific solutions for the parameter’s automation in biochemical and biomechanical processes of the operational estimation of blood glucose from human voice.
      From the Fourier spectrum of the voice, ie, values in the coordinates of intensity vs frequency, the coefficients are determined for the solution of equations describing a physical system.
      Ulanovsky et al, 2009

      Y. Ulanovsky, A. Frolov, A. Kozlova, et al., “Device for blood glucose level determination”, Patent WO2014072823, 2014.

      Y. Ulanovsky, A. Frolov, A. Kozlova, “Method of non-invasive determination of glucose concentration in blood and device for the implementation of thereof”, Patent WO2014/049438.

      The Fourier spectrum. (The voice is transformed into spectrum, sound spectrum peaks in the areas of low (100-1,500 Hz) and high (7k-10k) frequencies are sampled, intensities of the selected peaks are determined by frequency, a ratio between the peaks of the selected low and high frequencies is obtained.)
      Michaelis, 2014

      P. R. Michaelis, “Detection of extreme hypoglycemia and hyperglycemia based on automatic analysis of speech patterns”, US patent US 7, 925,508 B1, 2011.

      Short-term features from formants, pitch, articulation rate (eg, number transitions between voiced and unvoiced sounds), intensity, number speech errors, response time, nonfluency, speech quality, “S” sounds are shifted to “SH”, “R” goes to “L”, “EZ” goes to “ES”, delayed responsiveness.
      Rasmusson et al, 2019

      J. Rasmusson, P. Karlsson, M. Svensson, et al., “Method and device for blood glucose monitoring”, Patient EP 3 574 830 A1, 2019.

      Features include frequency patterns and amplitude patterns in speech spectrum.
      Tschope et al, 2015
      • Tschope C.
      • Duckhorn F.
      • Wollf M.
      • et al.
      Estimating blood sugar from voice samples: a preliminary study.
      openSMILES extractor, 2,375 features.
      Czupryniak et al, 2019
      • Czupryniak. L.
      • Sielska-Badurek E.
      • Niebisz A.
      • et al.
      378-P: human voice is modulated by hypoglycemia and hyperglycemia in type 1 diabetes.
      Number of fundamental periods, time of fundamental periods, fundamental frequency, energy, amplitude of fundamental frequency, indicator of voiced probability, simple voice quality, relative average perturbation, shimmer, amplitude perturbation quotient, F1-F4 frequencies, harmonic perturbation quotients, residual to harmonic ratio, unharmonic to harmonic ratio, subharmonic to harmonic ratio, noise to harmonic ratio, F1 to F4 harmonic to all energy ratio.
      TABLE 6Transparency and Generalizability
      StudyNumber SubjectsExperimentsHow Generic the Method Is
      Motorin, 2017
      • Motorin V.
      Scientific solutions for the parameter’s automation in biochemical and biomechanical processes of the operational estimation of blood glucose from human voice.
      7,000UnspecifiedSpeaker-dependent
      Ulanovsky et al, 2009

      Y. Ulanovsky, A. Frolov, A. Kozlova, et al., “Device for blood glucose level determination”, Patent WO2014072823, 2014.

      Y. Ulanovsky, A. Frolov, A. Kozlova, “Method of non-invasive determination of glucose concentration in blood and device for the implementation of thereof”, Patent WO2014/049438.

      Five subjects T1D, two subjects T2D, three healthySome testing examples providedSpeaker-dependent
      Michaelis, 2014

      P. R. Michaelis, “Detection of extreme hypoglycemia and hyperglycemia based on automatic analysis of speech patterns”, US patent US 7, 925,508 B1, 2011.

      UnspecifiedUnspecifiedSpeaker-dependent
      Rasmusson et al, 2019

      J. Rasmusson, P. Karlsson, M. Svensson, et al., “Method and device for blood glucose monitoring”, Patient EP 3 574 830 A1, 2019.

      UnspecifiedUnspecifiedUnspecified
      Tschope et al, 2015
      • Tschope C.
      • Duckhorn F.
      • Wollf M.
      • et al.
      Estimating blood sugar from voice samples: a preliminary study.
      TwoYesTwo persons
      Czupryniak et al, 2019
      • Czupryniak. L.
      • Sielska-Badurek E.
      • Niebisz A.
      • et al.
      378-P: human voice is modulated by hypoglycemia and hyperglycemia in type 1 diabetes.
      4 men and 5 womenUnspecifiedWithin a gender group, but limited to 4-5 speakers
      TABLE 7Computational Approach and Results
      StudyComputational ApproachConclusion
      Tschope et al, 2015
      • Tschope C.
      • Duckhorn F.
      • Wollf M.
      • et al.
      Estimating blood sugar from voice samples: a preliminary study.
      R2 in linear regressionNot a random relation was detected between glucose and a set of voice features
      Czupryniak et al, 2019
      • Czupryniak. L.
      • Sielska-Badurek E.
      • Niebisz A.
      • et al.
      378-P: human voice is modulated by hypoglycemia and hyperglycemia in type 1 diabetes.
      Multivariate statistical comparisonThe values of those acoustic parameters are significantly altered for hypoglycemia and extreme hyperglycemia.
      Ulanovsky et al, 2009

      Y. Ulanovsky, A. Frolov, A. Kozlova, et al., “Device for blood glucose level determination”, Patent WO2014072823, 2014.

      Y. Ulanovsky, A. Frolov, A. Kozlova, “Method of non-invasive determination of glucose concentration in blood and device for the implementation of thereof”, Patent WO2014/049438.

      Generalized statistically average functional dependenciesIndividual examples on which the method works are given
      Michaelis, 2014

      P. R. Michaelis, “Detection of extreme hypoglycemia and hyperglycemia based on automatic analysis of speech patterns”, US patent US 7, 925,508 B1, 2011.

      Conditional probabilities, Hidden Markov ModelsHypo- and hyperglycemia detection in individual speakers
      Rasmusson et al, 2019

      J. Rasmusson, P. Karlsson, M. Svensson, et al., “Method and device for blood glucose monitoring”, Patient EP 3 574 830 A1, 2019.

      UnspecifiedIt is taken for granted that glucose can be accurately approximated from voice, the emphasis is laid on its further uses.
      Motorin, 2017
      • Motorin V.
      Scientific solutions for the parameter’s automation in biochemical and biomechanical processes of the operational estimation of blood glucose from human voice.
      Not probabilistic, no pattern recognition, a system of differential equations instead98% accurate glucose estimation in 7,000 patients
      In ref.,

      P. R. Michaelis, “Detection of extreme hypoglycemia and hyperglycemia based on automatic analysis of speech patterns”, US patent US 7, 925,508 B1, 2011.

      the emotion related mechanism was assumed and, the pipeline developed for emotion recognition implemented:
      • Approach I: Hidden Markov Models working on window-based features.
      This architecture is one of the alternatives for emotion detection, eg.
      • Nogueiras A.
      • Moreno A.
      • Bonafonte A.
      • et al.
      Speech emotion recognition using hidden Markov models.
      In ref.,
      • Tschope C.
      • Duckhorn F.
      • Wollf M.
      • et al.
      Estimating blood sugar from voice samples: a preliminary study.
      the mechanism behind voice changes is unknown, and the generic analysis pipeline for the detection of a variety of cognitive and physiological states
      • Muaremi A.
      • Arnrich B.
      • Tröster G.
      Towards measuring stress with smartphones and wearable devices during workday and sleep.
      • Van Segbroeck M.
      • Travadi R.
      • Vaz C.
      • et al.
      Classification of Cognitive Load From Speech Using an I-Vector Framework.
      • Aguiar A.C.
      • Kaiseler M.
      • Meinedo H.
      • et al.
      Speech Stress Assessment Using Physiological and Psychological Measures.
      • Räsänen O.
      • Pohjalainen J.
      Random Subset Feature Selection in Automatic Recognition of Developmental Disorders, Affective States, and Level of Conflict From Speech.
      • Bone D.
      • Chaspari T.
      • Audhkhasi K.
      • et al.
      Classifying Language-Related Developmental Disorders From Speech Cues: The Promise and the Potential Confounds.
      • Li M.
      Automatic Recognition of Speaker Physical Load Using Posterior Probability Based Features From Acoustic and Phonetic Tokens.
      • Bayestehtashk A.
      • Asgari M.
      • Shafran I.
      • et al.
      Fully automated assessment of the severity of Parkinson's disease from speech.
      • Bocklet T.
      • Steidl S.
      • Nöth E.
      • et al.
      Automatic Evaluation of Parkinson's Speech-Acoustic, Prosodic and Voice Related Cues.
      • Orozco-Arroyave J.R.
      • Hönig F.
      • Arias-Londoño J.D.
      • et al.
      Automatic detection of Parkinson's disease in running speech spoken in three different languages.
      • Kim J.
      • Nasir M.
      • Gupta R.
      • et al.
      Automatic Estimation of Parkinson's Disease Severity From Diverse Speech Tasks.
      implemented:
      • Approach II: a classification with a large set of global features for emotion recognition.
      The term global means that they were estimated over the whole phrase, not in the window-based fashion
      • Sidorova J
      Optimization techniques for speech emotion recognition.
      as in Approach I.
      Another way is to tackle the problem is the tradition of medical research:
      • Approach III: a small set of features and statistical tests to check the significance of the found changes.
        • Czupryniak. L.
        • Sielska-Badurek E.
        • Niebisz A.
        • et al.
        378-P: human voice is modulated by hypoglycemia and hyperglycemia in type 1 diabetes.
      The seventeen acoustic parameters (as well as the voice database) are similar to the methodology nicknamed as the “gold standard”
      • Pyniopodjanard S.
      • Suppakitjanusant P.
      • Lomprew P.
      • et al.
      Instrumental acoustic voice characteristics in adults with type 2 diabetes.
      for acoustic analysis of voice pathology that was also used to describe the static changes DM causes on voice.
      • Chitkara D.
      • Sharma R.K.
      Voice based detection of type 2 diabetes mellitus.
      • Pyniopodjanard S.
      • Suppakitjanusant P.
      • Lomprew P.
      • et al.
      Instrumental acoustic voice characteristics in adults with type 2 diabetes.
      • Hamdan A.
      • Jabbour J.
      • Nassar J.
      • et al.
      Vocal characteristics in patients with type 2 diabetes mellitus.
      Recently, this methodology was criticized in ref.
      • Sidorova J.
      • Anisimova M.
      Impact of diabetes mellitus on voice: a methodological commentary.
      In
      • Motorin V.
      Scientific solutions for the parameter’s automation in biochemical and biomechanical processes of the operational estimation of blood glucose from human voice.

      Y. Ulanovsky, A. Frolov, A. Kozlova, et al., “Device for blood glucose level determination”, Patent WO2014072823, 2014.

      Y. Ulanovsky, A. Frolov, A. Kozlova, “Method of non-invasive determination of glucose concentration in blood and device for the implementation of thereof”, Patent WO2014/049438.

      the elasticity coefficient changing in response to the glucose concentration in blood is thought to be the underlying reason of voice changes, and an approach from computational physics was followed
      • Approach IV: the spectral features based on the Fourier transform with the classification implementing a statistically averaged functional dependency.

        Y. Ulanovsky, A. Frolov, A. Kozlova, et al., “Device for blood glucose level determination”, Patent WO2014072823, 2014.

        Y. Ulanovsky, A. Frolov, A. Kozlova, “Method of non-invasive determination of glucose concentration in blood and device for the implementation of thereof”, Patent WO2014/049438.

      In a more recent work, statistical approximation was replaced with an analytical solution.
      • Approach V: a set of differential equations was proposed describing a physical model of the speech organs, which links the BG concentration and the coefficients from the Fourier transform.
        • Motorin V.
        Scientific solutions for the parameter’s automation in biochemical and biomechanical processes of the operational estimation of blood glucose from human voice.

      Experimental results

      As far as the results are concerned, all the studies report positive results and either
      • confirmed that there are nonrandom fluctuations in voice in response to glucose swings,
        • Czupryniak. L.
        • Sielska-Badurek E.
        • Niebisz A.
        • et al.
        378-P: human voice is modulated by hypoglycemia and hyperglycemia in type 1 diabetes.
        ,
        • Tschope C.
        • Duckhorn F.
        • Wollf M.
        • et al.
        Estimating blood sugar from voice samples: a preliminary study.
        or
      • achieved glucose estimation from voice.
        • Motorin V.
        Scientific solutions for the parameter’s automation in biochemical and biomechanical processes of the operational estimation of blood glucose from human voice.

        Y. Ulanovsky, A. Frolov, A. Kozlova, et al., “Device for blood glucose level determination”, Patent WO2014072823, 2014.

        Y. Ulanovsky, A. Frolov, A. Kozlova, “Method of non-invasive determination of glucose concentration in blood and device for the implementation of thereof”, Patent WO2014/049438.

        P. R. Michaelis, “Detection of extreme hypoglycemia and hyperglycemia based on automatic analysis of speech patterns”, US patent US 7, 925,508 B1, 2011.

        J. Rasmusson, P. Karlsson, M. Svensson, et al., “Method and device for blood glucose monitoring”, Patient EP 3 574 830 A1, 2019.

      In ref.,
      • Tschope C.
      • Duckhorn F.
      • Wollf M.
      • et al.
      Estimating blood sugar from voice samples: a preliminary study.
      via a linear regression analysis it was concluded that the relation between the BG and voice parameters is not random. Similarly, in ref.,
      • Czupryniak. L.
      • Sielska-Badurek E.
      • Niebisz A.
      • et al.
      378-P: human voice is modulated by hypoglycemia and hyperglycemia in type 1 diabetes.
      via a statistical analysis of acoustic features, it was concluded that acoustic parameters in hypo- and hyperglycemia patterns in voice were significantly altered. Predictive models that would work on new samples were reported in.
      • Motorin V.
      Scientific solutions for the parameter’s automation in biochemical and biomechanical processes of the operational estimation of blood glucose from human voice.

      Y. Ulanovsky, A. Frolov, A. Kozlova, et al., “Device for blood glucose level determination”, Patent WO2014072823, 2014.

      Y. Ulanovsky, A. Frolov, A. Kozlova, “Method of non-invasive determination of glucose concentration in blood and device for the implementation of thereof”, Patent WO2014/049438.

      P. R. Michaelis, “Detection of extreme hypoglycemia and hyperglycemia based on automatic analysis of speech patterns”, US patent US 7, 925,508 B1, 2011.

      J. Rasmusson, P. Karlsson, M. Svensson, et al., “Method and device for blood glucose monitoring”, Patient EP 3 574 830 A1, 2019.

      All the studies except

      Y. Ulanovsky, A. Frolov, A. Kozlova, “Method of non-invasive determination of glucose concentration in blood and device for the implementation of thereof”, Patent WO2014/049438.

      state that only speaker-dependent estimation was possible. The strongest result was reported in
      • Motorin V.
      Scientific solutions for the parameter’s automation in biochemical and biomechanical processes of the operational estimation of blood glucose from human voice.
      as a result of a large-scale study resulting in 98% accuracy for 7,000 subjects.

      MISSING VALUES

      The Transparency and Reproducibility matrix (Table 8) was built from the evidence tables (TABLE 2, TABLE 3, TABLE 4, TABLE 5, TABLE 6, TABLE 7). This matrix permits to give a different weight of consideration to the information reported in different sources. The idea of it is that, if some aspect is lacking description (such as type of diabetes, speech unit, way of recording speech material, experimental details, speaker dependency or independency of the model, or computational approach), "missing" was placed in the corresponding cell.
      TABLE 8The Transparency and Reproducibility Matrix
      StudyTID or TIDSpeech UnitLab or MobileExperimentsSpeaker-independencyComp. Appr.
      Tschope et al 2015
      • Tschope C.
      • Duckhorn F.
      • Wollf M.
      • et al.
      Estimating blood sugar from voice samples: a preliminary study.
      missing
      Czupryniak et al, 2019
      • Czupryniak. L.
      • Sielska-Badurek E.
      • Niebisz A.
      • et al.
      378-P: human voice is modulated by hypoglycemia and hyperglycemia in type 1 diabetes.
      missing
      Ulanovsky et al, 2009

      Y. Ulanovsky, A. Frolov, A. Kozlova, et al., “Device for blood glucose level determination”, Patent WO2014072823, 2014.

      Y. Ulanovsky, A. Frolov, A. Kozlova, “Method of non-invasive determination of glucose concentration in blood and device for the implementation of thereof”, Patent WO2014/049438.

      Michaelis, 2014

      P. R. Michaelis, “Detection of extreme hypoglycemia and hyperglycemia based on automatic analysis of speech patterns”, US patent US 7, 925,508 B1, 2011.

      missingmissingmissing
      Rasmusson et al, 2019

      J. Rasmusson, P. Karlsson, M. Svensson, et al., “Method and device for blood glucose monitoring”, Patient EP 3 574 830 A1, 2019.

      missingmissingmissingmissingmissing
      Motorin, 2017
      • Motorin V.
      Scientific solutions for the parameter’s automation in biochemical and biomechanical processes of the operational estimation of blood glucose from human voice.
      missingmissing
      The most significant advancement (a definite solution to the problem of glucose detection from voice clues) was reported by Motorin.
      • Motorin V.
      Scientific solutions for the parameter’s automation in biochemical and biomechanical processes of the operational estimation of blood glucose from human voice.
      This approach is radically different to the rest of the literature, since it models the speech tract with a set of differential equations in place of statistics or machine learning. The problem that
      • Motorin V.
      Scientific solutions for the parameter’s automation in biochemical and biomechanical processes of the operational estimation of blood glucose from human voice.
      explicitly states that the method has a know-how and lacks a detailed experimental section. The other path is a a better understood strategy and, for example, with deep learning should give an accurate approximation (discussed in the “Future Work” Subsection).

      BIASES OF DATA COLLECTION

      There are application-specific issues in data collection that, unless properly addressed, can undermine the conclusions of the study.

      CBGM values need to be correctly retrofitted

      The options of BG measurement are either obtaining it from a drop of blood by means of a capillary finger-pricks with a lancet and then estimating a glucose value with a BG meter (BG readings) as was done in
      • Tschope C.
      • Duckhorn F.
      • Wollf M.
      • et al.
      Estimating blood sugar from voice samples: a preliminary study.
      • Motorin V.
      Scientific solutions for the parameter’s automation in biochemical and biomechanical processes of the operational estimation of blood glucose from human voice.

      Y. Ulanovsky, A. Frolov, A. Kozlova, et al., “Device for blood glucose level determination”, Patent WO2014072823, 2014.

      Y. Ulanovsky, A. Frolov, A. Kozlova, “Method of non-invasive determination of glucose concentration in blood and device for the implementation of thereof”, Patent WO2014/049438.

      or using continuous glucose monitoring (CGM) sensors,
      • Czupryniak. L.
      • Sielska-Badurek E.
      • Niebisz A.
      • et al.
      378-P: human voice is modulated by hypoglycemia and hyperglycemia in type 1 diabetes.
      Table 2. The first option is an instantaneous and reliable way to the measure glucose value. As for the second option, currently available CGM systems have needle-type sensors that are inserted in the subcutaneous system and remain there for several days. Unfortunately, it is not trivial to get the values of the glucose concentration from a CGM trace, and a retrospective “retrofitting” algorithm is needed. CGM data are usually noisy and characterized with a significant bias.

      C. Cobelli, S. Del Favero, A. Facchinetti, et al., “Retrospective retrofitting method to generate a continuous glucose concentration profile by exploiting continuous glucose monitoring sensor data and blood glucose measurements”, patent US 2019/0223807.

      CGM readings are affected by a physiological and a technical time delay, when compared to BG readings. The accuracy of CGM systems depend on several factors, one of which is the rate of change in BG concentrations. Inaccuracy at rapidly changing BG concentrations is, in part, caused by the fact that CGM systems measure glucose values in the interstitial tissue, not on capillary blood. A physiological time delay (of less than 10 minutes) is thus expected under such conditions. Moreover, it appears that physiological delay can work both ways: with BG changes preceding interstitial glucose changes as well as interstitial glucose changes preceding BG changes.
      • Pleus S.
      • Schoemaker M.
      • Morgenstern K.
      • et al.
      Rate-of-change dependence of the performance of two CGM systems during induced glucose swings.
      Further time delay is introduced through the sensor architecture, since glucose has to pass through the membranes surrounding the electrodes, and more delay is added, which is equal to the running time of the algorithm of the CGM system. Based on the above, we recommend that in the data collection protocol, it should be explicitly explained that the values in the presence of rapid glucose fluctuations were not taken and how other CGM issues were accounted for.

      Subjects must be blinded to their glucose value

      In the literature, the speech sample and the BG were said to be simultaneous. Indeed, technically they have the same time stamp, but that would be a nontrivial multitasking, and we assume that reading a sentence precedes prickling the finger with a lancet or vice versa. The order is of significance. Suppose the subject first takes the glucose value, introduces the numeric value in the mobile app and then reads a prompted sentence. Then, there is a danger that the resulting voice pattern of hypoglycemia is overlaid with the pattern of emotion: the subject receives bad news about a much feared condition of hypoglycemia or annoying hyperglycemia (with a need to take an insulin shot, which is obtrusive and possibly expensive). The voice patterns for the normal BG values can be mixed with relief. The resulting system can detect the affect induced by the faulty data collection design, instead of the BG.
      Another connected problem stemming from the fact that the subjects were not blinded to their glucose value is a possible involuntary manipulation of the voice data, especially by the diabetic colleagues, who volunteered for the study and were familiar the working hypothesis of generalizable voice patterns in hypo- and hyperglycemia.
      In all the referenced studies with BG a necessary detail of the protocol is missing, mining endocrinology research we found it expressed as follows: “The subjects were blinded to their actual glucose value”.
      • Weiniger K.
      • Jacobson A.M.
      • Draelos M.T.
      • et al.
      Blood glucose estimation and symptoms during hyperglycemia and hypoglycemia in patients with insulin-dependent diabetes mellitus.

      FUTURE PATHS

      Special success/error measure

      The research on BG estimation has its specific error measure: Parkes Error (PE). Since the clinical consequence of any particular error depends on the absolute value of both predicted and actual values and not on the percentage of deviation, from mid 1980s, statistical metrics such as accuracy or linear regression score were considered to be inadequate for reporting errors in BG estimation and evaluating new methodologies.
      • Parkes J.L.
      • Slatin S.L.
      • Pardo S.
      • et al.
      A new consensus error grid to evaluate the clinical significance of inaccuracies in the measurement of blood glucose.
      In PE, the risk plane, on which the X-axis is the actual BG and the Y-axis is the measured BG, is divided into eight concentric zones (Figure 1). There are five risk levels associated to all the possible errors: A is the zone corresponding to clinically accurate measurement, i.e., the one with no effect on clinical action, B is the zone corresponding to altered clinical action, but no effect on clinical outcome, C is the zone corresponding to wrong actions that are likely to affect clinical outcome, D is the zone corresponding to altered clinical action with a significant clinical risk, and E is the zone corresponding to altered clinical action with dangerous consequences. The PE is defined in a slightly different way for T1D and T2D and was based on the aggregated expert knowledge of 100 endocrinologists and refined the previous version of the same idea initially proposed in the 1980s. It has become part of the standard for BG measurement ISO 1519 2013/2018.
      FIGURE 1
      FIGURE 1Definition of Parkes Error, reproduced from ref.
      • Parkes J.L.
      • Slatin S.L.
      • Pardo S.
      • et al.
      A new consensus error grid to evaluate the clinical significance of inaccuracies in the measurement of blood glucose.

      Fusion of different technologies

      At the moment, none of the noninvasive technologies for glucose monitoring is sufficiently reliable and convenient and their development remains to be an active research field. An interested reader can refer to Bolla and Priefer
      • Bolla A.S.
      • Priefer R.
      Diabetes and metabolic syndrome.
      for an overview of current and future noninvasive devices. Yet, a system implementing the fusion of the three modalities had a much better success rate than any of the constituent technologies alone.
      • Lin T.
      • Gal A.
      • Mayzel Y.
      • et al.
      Non-invasive glucose monitoring: a review of challenges and recent advances.
      Fusion of features from complementary sources enhances predicitive models in general, beyond pattern recognition from voice, eg. in the prediction of the biochemical activity
      • Sidorova J
      • Caltabiano G
      • Drougov A
      • et al.
      DUPROSY: Dual probabilistic system for biochemical activity prediction.
      . Voice can be expected to bring additional improvement in fusion of complementary sources of information, eg.
      • Padila W.R.
      • García J.
      • Molina J.M.
      Knowledge extraction and improved data fusion for sales prediction in local agricultural markets.
      Also a recent patent lays emphasis on the fusion of CBGM (a method with a significant delay but accurate) and BG estimation from voice (instantaneous).

      J. Rasmusson, P. Karlsson, M. Svensson, et al., “Method and device for blood glucose monitoring”, Patient EP 3 574 830 A1, 2019.

      Another possibility is a fusion of invasive and noninvasive methods: based on speech analysis, alert the patient when to do an additional invasive BG measurement.

      DISCUSSION

      The present review has covered the literature on BG estimation from voice.
      • 1)
        All the studies on glucose estimation from voice report positive results: from a nonrandom nature of the relation between the acoustic patterns and BG value in few subjects to 98% correct estimation in 7,000 subjects. Due to the novelty of the field and the authors being unaware of the published studies (eg, several claim they are the first ones), there are five types of computational system designs that come from different research fields.
      • 2)
        Unlike other vocal biomarkers, generalization beyond one speaker in instantaneous glucose estimation can be impossible, because in endocrinology research the trend is that many aspects regarding diabetes are highly individual and average responses are useless.
      • 3)
        The community of glucose estimation from voice should be aware of the specific practices in endocrinology regarding success/error measure, when it comes to comparing different methods, as well as avoid the bias of data collection, namely, (i) the subjects must be blinded about their BG; (ii) if CBGM values are used, then BG traces need to be correctly retrofitted.

      CONCLUSIONS

      We have reviewed and discussed the usefulness of voice as a biomarker of glucose variability taking into consideration variations across individuals. The two basic mechanisms for the association between BG level and voice are glucose-induced changes in vocal fold elasticity and/or anxiety. The relation between glucose level and voice is nonrandom and, even if future research shows that glucose level cannot be estimated from voice alone with sufficient accuracy, there is a lot of active research on noninvasive estimation of BG with fusion of complementary technologies, and voice is one more source of information to gain additional accuracy. As a technical contribution, the article discussed the defects of the currently used protocols and catalogued the existing systems.

      Future Work

      In the authors’ opinion, unless the details of study
      • Motorin V.
      Scientific solutions for the parameter’s automation in biochemical and biomechanical processes of the operational estimation of blood glucose from human voice.
      are made fully transparent, a solution to a speaker-dependent problem would be:
      • to collect data via recording either a short sentence or the phoneme /a/ with a smartphone;
      • extracting a large number of acoustic features from the speech samples with the openSMILES extractor
        • Eyben F
        • Wollmer M
        • Gross F
        • et al.
        Recent developments in opensmile, the munich open-source multimedia feature extractor.
        (or similar software); and then
      • depending on the size of the database either train a deep neuronal network (several thousand voice samples) or a classical classification algorithm such as the Support Vector Machine or Naïve Bayes (several hundred voice samples), with appropriate problem formulation, e. g.
        • Sánchez P. D.
        • Amigo D.
        • García J.
        • et al.
        Architecture for trajectory-based fishing ship classification with AIS data.
      The generic analysis pipeline
      • Sidorova J.
      • Anisimova M.
      Impact of diabetes mellitus on voice: a methodological commentary.
      with the classical machine learning function (the scripts for data collection with a smartphone,

      Sidorova J, Arlos P, Vendrell J, et al., “Collection and Analysis of Voice Data for Medical Research”, manuscript in preparation.

      feature extraction, and training/testing of the classification function) are available from the first author on request.
      Many factors can affect the voice, for example, fatigue, alcohol and other drugs, stress, circadian rhythm, temporary mucus, infection and so on. This phenomenon, when the distributions shift due to a global change, is referred to as “context drift.” There are diverse ways to handle this variability and the fact that a drift has happened is detectable. Also, to the extent possible, the vocal biomarker should be built with the features that are robust to context drift.

      ACKNOWLEDGMENTS

      The corresponding author is pleased to acknowledge being part of the “Scalable resource-efficient systems for big data analytics”funded by the Knowledge Foundation, Sweden  (20140032).

      AUTHOR DISCLOSURE STATEMENT

      The authors declare no conflict of interest.

      REFERENCES

        • Muaremi A.
        • Arnrich B.
        • Tröster G.
        Towards measuring stress with smartphones and wearable devices during workday and sleep.
        BioNanoScience. 2013; 3: 172-183
        • Van Segbroeck M.
        • Travadi R.
        • Vaz C.
        • et al.
        Classification of Cognitive Load From Speech Using an I-Vector Framework.
        Interspeech, Singapore2014: 751-755
        • Aguiar A.C.
        • Kaiseler M.
        • Meinedo H.
        • et al.
        Speech Stress Assessment Using Physiological and Psychological Measures.
        Ubicomp, Zurich, Switzerland2013: 921-930
        • Räsänen O.
        • Pohjalainen J.
        Random Subset Feature Selection in Automatic Recognition of Developmental Disorders, Affective States, and Level of Conflict From Speech.
        Interspeech, Lyon, France2013: 210-214
        • Bone D.
        • Chaspari T.
        • Audhkhasi K.
        • et al.
        Classifying Language-Related Developmental Disorders From Speech Cues: The Promise and the Potential Confounds.
        Interspeech, Lyon, France2013: 182-186
        • Li M.
        Automatic Recognition of Speaker Physical Load Using Posterior Probability Based Features From Acoustic and Phonetic Tokens.
        Interspeech, Singapore2014: 437-441
        • Bayestehtashk A.
        • Asgari M.
        • Shafran I.
        • et al.
        Fully automated assessment of the severity of Parkinson's disease from speech.
        Comput Speech Lang. 2015; 29: 172-185
        • Bocklet T.
        • Steidl S.
        • Nöth E.
        • et al.
        Automatic Evaluation of Parkinson's Speech-Acoustic, Prosodic and Voice Related Cues.
        Interspeech, Lyon, France2013: 1149-1153
        • Orozco-Arroyave J.R.
        • Hönig F.
        • Arias-Londoño J.D.
        • et al.
        Automatic detection of Parkinson's disease in running speech spoken in three different languages.
        J Acoust Soc Am. 2016; 139: 481-500
        • Kim J.
        • Nasir M.
        • Gupta R.
        • et al.
        Automatic Estimation of Parkinson's Disease Severity From Diverse Speech Tasks.
        Interspeech, Dresden, Germany2015: 914-918
        • Lopez-de-Ipiña K.
        • Alonso J.B.
        • Solé-Casals J.
        • et al.
        On automatic diagnosis of Alzheimer's disease based on spontaneous speech analysis and emotional temperature.
        Cogn Comput. 2015; 7: 44-55
        • Gajšek R.
        • Mihelic F.
        • Dobrišek S.
        Speaker state recognition using an HMM-based feature extraction method.
        Comput Speech Lang. 2013; 27: 135-150
        • Suendermann-Oeft D.
        • Ramanarayanan V.
        • Teckenbrock M.
        • et al.
        HALEF: an open-source standard-compliant telephony-based modular spoken dialog system: a review and an outlook.
        in: Geunbae Lee G. Kook Kim H. Jeong M. Kim J.-H. Natural Language Dialog Systems and Intelligent Assistants. Springer International Publishing, Switzerland2015: 53-61 (ch. 5)
        • Sidorova J.
        • Carlsson S.
        • Rosander O.
        • et al.
        Towards disorder-independent automatic assessment of emotional competence in neurological patients with a classical emotion recognition system: application in foreign accent syndrome.
        IEEE Trans Affect Comput. 2019;
        • Colburn W.
        • DeGruttola V.G.
        • DeMets D.L.
        • et al.
        Biomarkers and surrogate endpoints: preferred definitions and conceptual framework. Biomarkers Definitions Working Group..
        Clin Pharmacol Ther. 2001; 69: 89-95
        • Sidorova J.
        • Badia T.
        Syntactic learning for ESEDA.1, tool for enhanced speech emotion detection and analysis.
        in: Proceedings of Internet Technology and Secured Transactions Conference, UK, London2009 (1-6)
        • Lin T.
        • Gal A.
        • Mayzel Y.
        • et al.
        Non-invasive glucose monitoring: a review of challenges and recent advances.
        Curr Trends Biomed Eng Biosci. 2017; 6: 1-8
        • Rodbard D.
        Continuos glucose monitoring: a review of successes, challenges, and opportunities.
        Diab Technol Ther. 2016; 18 (S2-3-13)
        • Ravi R.
        • Gunjawate D.
        Effect of diabetes mellitus on voice: a systematic review.
        Pract Diab. 2019; 36: 177-180
        • Chitkara D.
        • Sharma R.K.
        Voice based detection of type 2 diabetes mellitus.
        in: International Conference on Advances in Electrical, Electronics, Information, Communication and Bio-Informatics, Chennai, India. 2016: 83-87
        • Pyniopodjanard S.
        • Suppakitjanusant P.
        • Lomprew P.
        • et al.
        Instrumental acoustic voice characteristics in adults with type 2 diabetes.
        J Voice. 2019; (In press)
        • Hamdan A.
        • Jabbour J.
        • Nassar J.
        • et al.
        Vocal characteristics in patients with type 2 diabetes mellitus.
        Eur Arch Otorhinolaryngol. 2012; 269: 1489-1495
        • Czupryniak. L.
        • Sielska-Badurek E.
        • Niebisz A.
        • et al.
        378-P: human voice is modulated by hypoglycemia and hyperglycemia in type 1 diabetes.
        in: Poster Presentation. American Diabetes Association, San Francisco, California2019
        • Tschope C.
        • Duckhorn F.
        • Wollf M.
        • et al.
        Estimating blood sugar from voice samples: a preliminary study.
        in: International Conference on Computational Science and Computational Intelligence, Las Vegas, USA. 2015: 804-805
        • Motorin V.
        Scientific solutions for the parameter’s automation in biochemical and biomechanical processes of the operational estimation of blood glucose from human voice.
        Theory Pract Modern Sci. 2016; 7 (in Russian): 214-226
      1. Y. Ulanovsky, A. Frolov, A. Kozlova, et al., “Device for blood glucose level determination”, Patent WO2014072823, 2014.

      2. Y. Ulanovsky, A. Frolov, A. Kozlova, “Method of non-invasive determination of glucose concentration in blood and device for the implementation of thereof”, Patent WO2014/049438.

      3. P. R. Michaelis, “Detection of extreme hypoglycemia and hyperglycemia based on automatic analysis of speech patterns”, US patent US 7, 925,508 B1, 2011.

      4. J. Rasmusson, P. Karlsson, M. Svensson, et al., “Method and device for blood glucose monitoring”, Patient EP 3 574 830 A1, 2019.

        • Weiniger K.
        • Jacobson A.M.
        • Draelos M.T.
        • et al.
        Blood glucose estimation and symptoms during hyperglycemia and hypoglycemia in patients with insulin-dependent diabetes mellitus.
        Am J Med. 1995; 98: 22-31
        • Sidorova J.
        Speech emotion recognition with TGI+.2 classifier.
        in: Proceedings of European Association for Computational Linguistics, Student Session, Athens, Greece. 2009: 54-60
        • Sidorova J.
        • Badia T.
        ESEDA: tool for enhanced speech emotion detection and analysis.
        in: The 4th International Conference on Automated Solutions for Cross Media Content and Multi-Channel Distribution, Italy, Florence. 2008: 17-19
        • Parkes J.L.
        • Slatin S.L.
        • Pardo S.
        • et al.
        A new consensus error grid to evaluate the clinical significance of inaccuracies in the measurement of blood glucose.
        Diabetes Care. 2000; 23: 1143-1148
      5. The converter among blood glucose units in different system. Available at:http://www.unit-conversion.info/blood-sugar.html. Accessed January 22, 2020.

      6. C. Cobelli, S. Del Favero, A. Facchinetti, et al., “Retrospective retrofitting method to generate a continuous glucose concentration profile by exploiting continuous glucose monitoring sensor data and blood glucose measurements”, patent US 2019/0223807.

        • Pleus S.
        • Schoemaker M.
        • Morgenstern K.
        • et al.
        Rate-of-change dependence of the performance of two CGM systems during induced glucose swings.
        J Diab Sci Technol. 2015; 9: 801-807
        • Polonsky W.
        • Davis C.
        • Jacobson A.
        • et al.
        Correlates of hypoglycemic fear in type I and type II diabetes mellitus.
        Health Psychol. 1992; 11: 199-202
        • Nogueiras A.
        • Moreno A.
        • Bonafonte A.
        • et al.
        Speech emotion recognition using hidden Markov models.
        in: Proceedings of Eurospeech, Aalborg, Denmark. 2001
        • Sidorova J.
        • Anisimova M.
        Impact of diabetes mellitus on voice: a methodological commentary.
        J Voice. 2020;
      7. Sidorova J, Arlos P, Vendrell J, et al., “Collection and Analysis of Voice Data for Medical Research”, manuscript in preparation.

        • Padila W.R.
        • García J.
        • Molina J.M.
        Knowledge extraction and improved data fusion for sales prediction in local agricultural markets.
        Sensors. 2019; 19: 286
        • Sánchez P. D.
        • Amigo D.
        • García J.
        • et al.
        Architecture for trajectory-based fishing ship classification with AIS data.
        Sensors. 2020; 20: 3782
        • Bolla A.S.
        • Priefer R.
        Diabetes and metabolic syndrome.
        Clin Res Rev. 2020; 14 (739e751)
        • Eyben F
        • Wollmer M
        • Gross F
        • et al.
        Recent developments in opensmile, the munich open-source multimedia feature extractor.
        MM’ 13, Barcelona, Spain. 2013; : 835-838
        • Sidorova J
        Optimization techniques for speech emotion recognition.
        PhD thesis, Universidad Pompeu Fabra. 2009;
        • Sidorova J
        • Caltabiano G
        • Drougov A
        • et al.
        DUPROSY: Dual probabilistic system for biochemical activity prediction.
        Procs. of the 8th International Conference on Computing Technology and Information Management. 2012; : 800-803
        • Sidorova J
        • McDonough J
        • Badia T
        Automatic Recognition of Emotive Voice and Speech.
        Emotions in the Human Voice, Culture and Perception. 3. Plural Publishing, USA2007: 217-242
        • Sidorova J
        Speech emotion recognition, DEA Report. Universidad Pompeu Fabra, 2007