Summary
Objective
This study aimed to analyze the effects of microphone and audio compression variables
on voice and speech parameters acquisition.
Method
Acoustic measures were recorded and compared using a high-quality reference microphone
and three testing microphones. The tested microphones displayed differences in specifications
and acoustic properties. Furthermore, the impact of the audio compression was assessed
by resampling the original uncompressed audio files into the MPEG-1/2 Audio Layer
3 (mp3) format at three different compression rates (128 kbps, 64 kbps, 32 kbps).
Eight speakers were recruited in each recording session and asked to produce four
sustained vowels: two [a] segments and two [ɛ] segments. The audio was captured simultaneously
by the reference and tested microphones. The recordings were synchronized and analyzed
using the Praat software.
Results
From a set of eight acoustic parameters assessed (f0, F1, F2, jitter%, shimmer%, HNR, H1-H2, and CPP), three (f0, F2, and jitter%) were suggested as resistant regarding the microphone and audio
compression variables. In contrast, some parameters seemed to be significantly affected
by both factors: HNR, H1-H2, and CPP; while shimmer% was found sensitive only concerning
the latter factor. Moreover, higher compression rates appeared to yield more frequent
acoustic distortions than lower rates.
Conclusion
Overall, the outcomes suggest that acoustic parameters are influenced by both the
microphone selection and the audio compression usage, which may reflect the practical
implications of these components on the acoustic analysis reliability.
Key Words
To read this article in full you will need to make a payment
Purchase one-time access:
Academic & Personal: 24 hour online accessCorporate R&D Professionals: 24 hour online accessOne-time access price info
- For academic or personal research use, select 'Academic and Personal'
- For corporate R&D use, select 'Corporate R&D Professionals'
Subscribe:
Subscribe to Journal of VoiceAlready a print subscriber? Claim online access
Already an online subscriber? Sign in
Register: Create an account
Institutional Access: Sign in to ScienceDirect
REFERENCES
- Aspectos técnicos na coleta de dados linguísticos orais.Raquel meister ko. freitag (organizadora). Metodologia de coleta e manipulação de dados em sociolinguística. Editora Edgard Blücher, São Paulo2014: 19-26
- recommended protocols for instrumental assessment of voice: American Speech-Language-Hearing Association Expert Panel to develop a protocol for instrumental assessment of vocal function.Am J Speech Lang Pathol. 2018; 27: 887-905https://doi.org/10.1044/2018_AJSLP-17-0009
- Mobile communication devices, ambient noise, and acoustic voice measures.J Voice. 2017; 31
- Voz o livro do especialista I.Revinter, São Paulo2001
- Factors affecting the quality of sound recording for speech and voice analysis.Int J Speech Lang Pathol. 2009; 11 (431-437)
- Guidelines for selecting microphones for human voice production research.Am J Speech Lang Pathol. 2010; 19: 356-368
- Some technical considerations in voice perturbation measurements.J Speech Hear Res. 1987; 30: 252-260
- The effects of mp3 compression on acoustic measurements of fundamental frequency and pitch range.Speech Prosody. 2016; 1: 523-527
- Acoustic analysis of pathological voices compressed with mpeg system.J Voice. 2003; 17: 126-139
- Influence of smartphones and software on acoustic voice measures.Int J Telerehabilitation. 2016; 8.2: 9-14
- Praat: doing phonetics by computer [Computer program].2019 (Version 6.0.50, retrieved 31 March 2019 from)
P Barbosa. Praat script: VoiceQualityAnalysis.psc. Copyright (C) 2006 Barbosa, P. A. ([email protected])
- Acoustic measurement of overall voice quality: a meta-analysis.J Acoust Soc Am. 2009; 126: 2619-2634
- Forensic speaker identification.Taylor & Francis, London2002
- Effect size estimates: current use, calculations, and interpretation.J Exp Psychol. 2012; 141
- The need to report effect size estimates revisited. An overview of some recommended measures of effect size.Trends Sport Sci. 2014; 1: 19-25
- Smartphones offer new opportunities in clinical voice research.J Voice. 2017; 31 (111.e1–111.e7)
- Assessing voice health using smartphones: bias and random error of acoustic voice parameters captured by different smartphone types.Int J Lang Commun Disord. 2019; 54 (MARCH/APRIL): 292-305
- The effect of noise on computer-aided measures of voice: a comparison of CSpeechSP and the multi-dimensional voice program software using the CSL 4300B Module and multi-speech for windows.J Voice. 2003; 17: 12-20
- Exploring the feasibility of smartphone microphone for measurement of acoustic voice parameters and voice pathology screening.Eur Arch Otorhinolaryngol. 2015; 272: 3391-3399
- Functional dysphonia: strategies to improve patient outcomes.Patient Relat Outcome Meas. 2015; 6: 243-253
- Smartphone application for the analysis of prosodic features in running speech with a focus on bipolar disorders: system performance evaluation and case study.Sensors (Switzerland). 2015; 15.11: 28070-28087
- Evaluating iPhone recordings for acoustic voice assessment.Folia Phoniatr Logop. 2012; 64.3: 122-130
- What is vocology?.Logoped Phoniatr Vocol. 1996; 21: 5-6
Article info
Publication history
Published online: January 13, 2021
Identification
Copyright
© 2021 Published by Elsevier Inc. on behalf of The Voice Foundation.