Abstract
Keywords
Purchase one-time access:
Academic & Personal: 24 hour online accessCorporate R&D Professionals: 24 hour online accessOne-time access price info
- For academic or personal research use, select 'Academic and Personal'
- For corporate R&D use, select 'Corporate R&D Professionals'
Subscribe:
Subscribe to Journal of VoiceReferences
- Vocal fatigue among teachers.Folia Phoniatrica et Logopaedica. 1993; 45: 120-129https://doi.org/10.1159/000266237
- The professional voice.J Laryngol Otol. 2010; 125: 111-116https://doi.org/10.1017/s0022215110001970
- Changes in objective acoustic measurements and subjective voice complaints in call center customer-service advisors during one working day.J Voice. 2008; 22: 164-177https://doi.org/10.1016/j.jvoice.2006.08.010
- Vocal fatigue: current knowledge and future directions.J Voice. 2003; 17: 21-30https://doi.org/10.1016/s0892-1997(03)00033-x
- Vocal fatigue index (VFI): development and validation.J Voice. 2015; 29: 433-440https://doi.org/10.1016/j.jvoice.2014.09.012
- Toward a consensus description of vocal effort, vocal load, vocal loading, and vocal fatigue.J Speech Lang Hear Res. 2020; 63: 509-532
- Vocal fatigue induced by prolonged oral reading: analysis and detection.Comput Speech Lang. 2014; 28: 453-466https://doi.org/10.1016/j.csl.2012.12.003
- wav2vec 2.0: a framework for self-supervised learning of speech representations.in: Larochelle H. Ranzato M. Hadsell R. Balcan M.F. Lin H. Advances in Neural Information Processing Systems. vol. 33. Curran Associates, Inc., 2020: 12449-12460
- X-vectors: robust DNN embeddings for speaker recognition.2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). 2018: 5329-5333https://doi.org/10.1109/ICASSP.2018.8461375
- ECAPA-TDNN: emphasized channel attention, propagation and aggregation in TDNN based speaker verification.Proc. Interspeech 2020. 2020: 3830-3834https://doi.org/10.21437/Interspeech.2020-2650
- Visualizing data using t-sne.J Mach Learn Res. 2008; 9: 2579-2605
- A training algorithm for optimal margin classifiers.Proceedings of the Fifth Annual Workshop on Computational Learning Theory, COLT ’92, Association for Computing Machinery. New York, NY, USA. 1992: 144-152https://doi.org/10.1145/130385.130401
- Acoustic measures and self-reports of vocal fatigue by female teachers.J Voice. 2008; 22: 283-289https://doi.org/10.1016/j.jvoice.2006.10.001
- Vocal impact of a prolonged reading task at two intensity levels: objective measurements and subjective self-ratings.J Voice. 2012; 26: e177-e186https://doi.org/10.1016/j.jvoice.2011.07.016
- Effects of a vocally fatiguing task and systemic hydration on men’s voices.J Voice. 2003; 17: 31-46https://doi.org/10.1016/s0892-1997(03)00029-8
- Objective measurement of vocal fatigue in classical singers: a vocal dosimetry pilot study, otolaryngol.Head Neck Surg. 2006; 135: 595-602
- Investigation of vocal fatigue using a dose-based vocal loading task.Appl Sci (Basel). 2020; 10: 1192
- Multivariate analysis of vocal fatigue in continuous reading.INTERSPEECH. 2010
- A high-precision feature extraction network of fatigue speech from air traffic controller radiotelephony based on improved deep learning.ICT Expr. 2021; 7: 403-413https://doi.org/10.1016/j.icte.2021.01.002
- Classification of vocal fatigue using semg: data imbalance, normalization, and the role of vocal fatigue index scores.Appl Sci. 2021; 11https://doi.org/10.3390/app11104335
Baevski A., Hsu W.-N., Conneau A., et al. Unsupervised speech recognition. 2021. ArXiv:2105.11084 [cs, eess]ArXiv: 2105.11084. http://arxiv.org/abs/2105.11084
- Spoken language recognition using x-vectors.Proc. The Speaker and Language Recognition Workshop (Odyssey 2018). 2018: 105-111https://doi.org/10.21437/Odyssey.2018-15
Tjandra A., Choudhury D.G., Zhang F., et al. Improved language identification through cross-lingual self-supervised learning. 2021. ArXiv:2107.04082.
- Exploring wav2vec 2.0 on speaker verification and language identification.Proc. Interspeech 2021. 2021: 1509-1513https://doi.org/10.21437/Interspeech.2021-1280
- Emotion recognition from speech using wav2vec 2.0 embeddings.Interspeech 2021, ISCA. 2021: 3400-3404https://doi.org/10.21437/Interspeech.2021-703
- Detecting dysfluencies in stuttering therapy using wav2vec 2.0.Interspeech 2022, ISCA. 2022: 2868-2872https://doi.org/10.21437/Interspeech.2022-10908
- Front-end factor analysis for speaker verification.IEEE Transactions on Audio, Speech, and Language Processing. vol. 19. 2011: 788-798https://doi.org/10.1109/TASL.2010.2064307
- Dynamically monitoring vocal fatigue and recovery using aerodynamic, acoustic, and subjective self-rating measurements.J Voice. 2019; 33: 809.e11-809.e18
- Factors involved in vocal fatigue: a pilot study.Folia Phoniatr Logop. 2016; 68: 112-118
- Lmelectures: a multimedia corpus of academic spoken english.Proc. First Workshop on Speech, Language and Audio in Multimedia (SLAM), Curran Associates. 2013: 102-107
- Representation learning: a review and new perspectives.IEEE Trans Pattern Anal MachIntell. 2013; 35: 1798-1828https://doi.org/10.1109/TPAMI.2013.50
- IEEE Catalog No.: CFP11SRW-USB
- Voxceleb: a large-scale speaker identification dataset.Proc. Interspeech 2017. 2017: 2616-2620https://doi.org/10.21437/Interspeech.2017-950
Snyder D., Chen G., Povey D.. MUSAN: a music, speech, and noise corpus, arxiv:1510.08484v1. 2015. ArXiv:1510.08484.
- A study on data augmentation of reverberant speech for robust speech recognition.2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). 2017: 5220-5224https://doi.org/10.1109/ICASSP.2017.7953152
- Res2net: a new multi-scale backbone architecture.IEEE Trans Pattern Anal MachIntell. 2021; 43: 652-662https://doi.org/10.1109/TPAMI.2019.2938758
- Squeeze-and-excitation networks.2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition. 2018: 7132-7141https://doi.org/10.1109/CVPR.2018.00745
Ravanelli M., Parcollet T., Plantinga P., et al. SpeechBrain: a general-purpose speech toolkit, arxiv:2106.04624. 2021. ArXiv:2106.04624.
- Specaugment: a simple data augmentation method for automatic speech recognition.Interspeech 2019. 2019https://doi.org/10.21437/interspeech.2019-2680
- Attention is all you need.in: Guyon I. Luxburg U.V. Bengio S. Wallach H. Fergus R. Vishwanathan S. Advances in Neural Information Processing Systems. vol. 30. Curran Associates, Inc., 2017
Devlin J., Chang M.-W., Lee K., et al. BERT: pre-training of deep bidirectional transformers for language understanding, arxiv:1810.04805 [cs]arxiv: 1810.04805. 2019. http://arxiv.org/abs/1810.04805.
- Librispeech: an ASR corpus based on public domain audio books.2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), IEEE, South Brisbane, Queensland, Australia. 2015: 5206-5210https://doi.org/10.1109/ICASSP.2015.7178964
- Opensmile: the Munich versatile and fast open-source audio feature extractor.Proceedings of the International Conference on Multimedia - MM ’10. ACM Press, Firenze, Italy2010: 1459https://doi.org/10.1145/1873951.1874246
- The INTERSPEECH 2016 computational paralinguistics challenge: deception, sincerity & native language.Proc. Interspeech. vol. 2016. 2016: 2001-2005https://doi.org/10.21437/Interspeech.2016-129
Schuller B.W., Batliner A., Amiriparian S., et al., The ACM multimedia 2022 computational paralinguistics challenge: vocalisations, stuttering, activity, & mosquitoes. 2022. ArXiv preprint arXiv:2205.06799.
- Going beyond the cookie theft picture test: detecting cognitive impairments using acoustic features.in: Sojka P. Horák A. Kopeček I. Pala K. Text, Speech, and Dialogue. Springer International Publishing, Cham2022: 437-448
- Challenges of using longitudinal and cross-domain corpora on studies of pathological speech.Proc. Interspeech 2022. 2022: 1921-1925https://doi.org/10.21437/Interspeech.2022-10995
- Articulation rate and its variability in spontaneous speech: a reanalysis and some implications.Phonetica. 1984; 41: 215-225https://doi.org/10.1159/000261728
- Fadiga vocal em professores universitários no início e ao final do ano letivo.CoDAS. 2020; 32https://doi.org/10.1590/2317-1782/20192018233
Article info
Publication history
Publication stage
In Press Corrected ProofFootnotes
Equal contribution, listed in alphabetical order.