Summary
Objective
Methods
Results
Key Words
Purchase one-time access:
Academic & Personal: 24 hour online accessCorporate R&D Professionals: 24 hour online accessOne-time access price info
- For academic or personal research use, select 'Academic and Personal'
- For corporate R&D use, select 'Corporate R&D Professionals'
Subscribe:
Subscribe to Journal of VoiceReferences
- A cross-entropy-guided measure (CEGM) for assessing speech recognition performance and optimizing DNN-based speech enhancement.IEEE ACM Trans Audio Speech Lang Process. 2021; 29: 106-117
- Multi-level single channel speech enhancement using a unified framework for estimating magnitude and phase spectra.IEEE ACM Trans Audio Speech Lang Process. 2020; 28: 1315-1327
- Binaural codebook-based speech enhancement with atomic speech presence probability.IEEE ACM Trans Audio Speech Lang Process. 2019; 27: 2150-2161
- The application of deep neural network in speech enhancement processing.2018 5th International Conference on Information Science and Control Engineering (ICISCE). 2018; : 1263-1266
- Joint optimization of modified ideal radio mask and deep neural networks for monaural speech enhancement.2017 IEEE 9th International Conference on Communication Software and Networks (ICCSN). 2017; : 1070-1074
- Single-channel speech enhancement using learnable loss mixup.Proc Interspeech 2021. 2021; : 2696-2700https://doi.org/10.21437/Interspeech.2021-859
- Glottal source modeling in dysphonic speech.Speech Commun. 2019; 113: 136-145https://doi.org/10.1016/j.specom.2019.06.001
- Artificial neural network based voice pathology detection using time-domain features.Appl Soft Comput. 2021; 109: 1-11https://doi.org/10.1016/j.asoc.2021.107016
- A pilot study of pink noise therapy for improving speech in Parkinson’s disease.J Acoust Soc Am. 2017; 141: 2373-2383https://doi.org/10.1121/1.4979417
- Relief-based feature selection: Introduction and review.J Biomed Inf. 2018; 85: 189-203https://doi.org/10.1016/j.jbi.2018.07.014
- Streaming feature selection algorithms for big data: a survey.Appl Comput Inf. 2019; https://doi.org/10.1016/j.aci.2019.01.001
- A survey on feature selection methods.Comput Electr Eng. 2014; 1: 16-28
- MCRA noise estimation for KLT-VRE-based speech enhancement.Int J Speech Technol. 2013; 16: 333-339
- A multi-target SNR-progressive learning approach to regression based speech enhancement.IEEE/ACM Trans Audio Speech Lang Process. 2020; 28: 1608-1619
- Gold section search algorithm for maximizing an object-oriented neural network-based cost function.IEEE Trans Syst Man Cybern Part C (Appl Rev). 1999; 29: 234-239
- A new thresholding method based on the gold section search for image segmentation.IEEE Trans Image Process. 2011; 20: 1717-1726
- Application of golden section search for optimization of fractal antenna design.Progr Electromagnet Res. 2012; 126: 355-374
- Surgical effects of type-I thyroplasty and fat injection laryngoplasty on voice recovery.Auris Nasus Larynx. 2021; 48: 302-309
- Communication, functional disorders and lifestyle changes after total laryngectomy.Clin Otolaryngol. 1994; 19: 295-300
- Speaker verification using neural embedding of phonetic posterior features.IEEE International Conference on Acoustics, Speech, and Signal Processing. IEEE, 2018: 4874-4878
- Robust speaker recognition in noisy environments using convolutional neural networks.IEEE Signal Process Lett. 2018; 25: 1353-1357
- Robust speaker recognition using attention-based deep neural networks.IEEE International Conference on Acoustics, Speech, and Signal Processing. IEEE, 2019
- Speaker recognition in adverse conditions using deep neural networks trained on synthetic noisy speech.IEEE J Select Topics Signal Process. 2019; 13: 364-374
- Multiple vowels repair based on pitch extraction and line spectrum pair feature for voice disorder.IEEE J Biomed Health Inf. 2020; 24: 1940-1951
- An improved time domain pitch detection algorithm for pathological voice.Am J Appl Sci. 2012; 9: 93-102
- PVR-AFM: a pathological voice repair system based on non-linear structure.J Voice. 2021; https://doi.org/10.1016/j.jvoice.2021.05.010
- Methods for formant extraction in speech of patients after total laryngectomy.IEEE J Biomed Health Inf. 2006; 1: 107-112
- Correlation between vocal function and quantitative videofluoroscopic analysis of swallowing in patients with vocal cord paralysis.Ann Otol Rhinol Laryngol. 2007; 116: 93-99
- Vocal fold polyps and their impact on the glottal airflow: an experimental study.Ann Otol Rhinol Laryngol. 2005; 114: 835-841
- Visual assessment of laryngeal airflow in speech pathology.Int J Lang Commun Disord. 2000; 35: 401-415
- Enhancement of speech corrupted by acoustic noise.Proc Int Conf Acoust Speech Signal Process (ICASS). 1979; 4: 208-211
- All-pole modeling of degraded speech.IEEE Trans Acoust Speech Signal Process. 1978; 26: 197-210
- Statistical-model-based speech enhancement systems.Proc IEEE. 1992; 80: 1526-1555
- Speech enhancement from noise: a regenerative approach.Speech Commun. 1991; 10: 45-57
- Noise reduction using connectionist models.Proc IEEE Int Conf Acoust Speech Signal Process (ICASSP). 1988; : 553-556
- Speech enhancement with missing data techniques using recurrent neural networks.Proc IEEE Int Conf Acoust Speech Signal Process (ICASSP). 2004; : 733-736
- Speech enhancement based on deep denoising autoencoder.Interspeech. 2013; : 436-440
- Recurrent deep stacking networks for supervised speech separation.IEEE Int Conf Acoust Speech Signal Process. 2017; : 71-75
- A perceptual weighting filter loss for DNN training in speech enhancement.2019 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA). 2019; : 229-233https://doi.org/10.1109/WASPAA.2019.8937189
- Learning latent representations for speech generation and transformation.Interspeech. 2017; : 1273-1277
- Modeling and transforming speech using variational autoencoders.Interspeech. 2016; : 1770-1774
Bando Y, Mimura M, Itoyama K, Yoshii K, Kawahara T. Statistical Speech Enhancement Based on Probabilistic Integration of Variational Autoencoder and Non-Negative Matrix Factorization, 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Calgary, AB, Canada, 2018, pp. 716–720.
- English Multi-Peaker Corpus for CSTR Voice Cloning Toolkit[J]. University of Edinburgh. The Centre for Speech Technology Research (CSTR), 2017
- Noisex-92: A database and an experiment to study the effect of additive noise on speech recognition systems[J].Speech communication. 1993; 12: 247-253
Barry WJ, Ptzer M. Saarbrucken voice database. Institute of Phonetics, Universitt des Saarlandes; 2007. http://www.stimmdatenbank.coli.uni-saarland.de.
Kingma DP, Ba J. Adam: a method for stochastic optimization. arXiv:141269802014.