The purpose of this study was to examine several factors of vocal quality that might be affected by changes in vocal fold vibratory patterns. Four voice types were examined: modal, vocal fry, falsetto, and breathy. Three categories of analysis techniques were developed to extract source‐related features from speech and electroglottographic (EGG) signals. Four factors were found to be important for characterizing the glottal excitations for the four voice types: the glottal pulse width, the glottal pulse skewness, the abruptness of glottal closure, and the turbulent noise component. The significance of these factors for voice synthesis was studied and a new voice source model that accounted for certain physiological aspects of vocal fold motion was developed and tested using speech synthesis. Perceptual listening tests were conducted to evaluate the auditory effects of the source model parameters upon synthesized speech. The effects of the spectral slope of the source excitation, the shape of the glottal excitation pulse, and the characteristics of the turbulent noise source were considered. Applications for these research results include synthesis of natural sounding speech, synthesis and modeling of vocal disorders, and the development of speaker independent (or adaptive) speech recognition systems.
Skip Nav Destination
Article navigation
November 1991
November 01 1991
Vocal quality factors: Analysis, synthesis, and perception
D. G. Childers;
D. G. Childers
Department of Electrical Engineering, University of Florida, Gainesville, Florida 32611‐2024
Search for other works by this author on:
C. K. Lee
C. K. Lee
Department of Electrical Engineering, Tatung Institute of Technology, Taipei, Taiwan, Republic of China
Search for other works by this author on:
J. Acoust. Soc. Am. 90, 2394–2410 (1991)
Article history
Received:
September 28 1990
Accepted:
July 25 1991
Citation
D. G. Childers, C. K. Lee; Vocal quality factors: Analysis, synthesis, and perception. J. Acoust. Soc. Am. 1 November 1991; 90 (5): 2394–2410. https://doi.org/10.1121/1.402044
Download citation file:
Sign in
Don't already have an account? Register
Sign In
You could not be signed in. Please check your credentials and make sure you have an active account and try again.
Pay-Per-View Access
$40.00
Citing articles via
A survey of sound source localization with deep learning methods
Pierre-Amaury Grumiaux, Srđan Kitić, et al.
Using soundscape simulation to evaluate compositions for a public space sound installation
Valérian Fraisse, Nadine Schütz, et al.
Source and propagation modelling scenarios for environmental impact assessment: Model verification
Michael A. Ainslie, Robert M. Laws, et al.