CONTENTS
BEING BIOLOGICAL
SIMULACRA
SPEECH SYNTHESIS
VOCAL TRACTS
ARTICULATORS
SPEECH PRODUCTION
MCGURK
SPEECHREADING
FACIAL ANIMATION
AVATARS
BACKGROUND
DIRECTORY
BIBLIOGRAPHY

Talking Heads:
Bibliography

     Articulatory synthesis | Avatars | Facial animation | McGurk
Speech - general | Speech synthesis | Speech production | Speechreading


 
Articulatory synthesis and inversion:

Atal, B.S., Chang, J.J., Mathews, M.V., & Tukey, J.W. (1978). Inversion of articulatory-to-acoustic transformation in the vocal tract by a computer-sorting technique. Journal of the Acoustical Society of America, 63, 1535-1555.

Atal, B., & Rioul, O. (1989). Neural networks for estimating articulatory positions from speech. Journal of the Acoustical Society of America, 86, 123-131.

Badin, P. & Fant, G. (1984). Notes on vocal tract computation. STL-QPR No. 2-3, 53-107.

Båvegård, M. (1996). Towards an articulatory speech synthesizer: Model development and simulations. TMH-QPSR, Jan. 1996, 1-15.

Boë, L.-J, Perrier, P., & Bailly, G. (1992). The geometric vocal tract variables controlled for vowel production: proposals for constraining acoustic-to-articulatory inversion. Journal of Phonetics, 20, 27-38.

Butler, S. & Wakita, H. (1987). Articulatory constraints on vocal tract area functions and their acoustic implications. Speech. Technol. Lab. Res. Rep., 1, 1-7.

Coker. C. H. (1968). Speech synthesis with a parametric articulatory model. Proc. Speech. Symp., Kyoto, Japan, paper A-4.

Coker. C. H. (1976). A model for articulatory dynamics and control. Proceedings of the IEEE, 64/4, 452-460.

Coker. C. & Fujimura, O. (1966). Model for the specification of the vocal tract area function.. Journal of the Acoustical Society of America, 40, 1271.

Coker. C. H., Umeda, N., & Browman, C.P. (1973). Automatic synthesis from ordinary English text. IEEE Trans. Audio Electroacoust., AU-21, 293-297.

Cranen, B. & Schroeter, J. (1996). Physiologically motivated modelling of the voice source in articulatory analysis/synthesis. Speech Communication, 19, 1-19.

Fant, G. (1960). Acoustic Theory of Speech Production. Mouton, The Hague.

Fant, G. (1986). Glottal flow: Models and interaction. Journal of Phonetics, 14, 393-399.

Flanagan, J.L., Ishizaka, K., & Shipley, K. L. (1975). Synthesis of speech from a dynamic model of the vocal cords and vocal tract. Bell Systems Technical Journal, 54, 485-506.

Gabioud, B. (1994). Articulatory models in speech synthesis. In E. Keller (Ed.). (1994). Fundamentals of Speech Synthesis and Speech Recognition. John Wiley & Sons, 215-230.

Gupta. S. & Schroeter, J. (1993). Pitch-synchronous frame-by-frame and segment-based articulatory analysis by synthesis. Journal of the Acoustical Society of America, 94, 2517-2530.

Harshman, R., Ladefoged, P., & Goldstein, L. (1977). Factor analysis of tongue shapes. Journal of the Acoustical Society of America, 62, 693-707.

Heinz, J. M. & Stevens, K. N., On the derivation of area functions and acoustic spectra from cineradiographic films of speech. (1964). Journal of the Acoustical Society of America, 36, 1037-1038.

Henke, W. L. (1966). Dynamic Articulatory Model of Speech Production Using Computer Simulation. Unpublished doctoral dissertation, MIT, Cambridge, MA.

Hirayama, M., Vatikiotis-Bateson, E., & Kawato, M. (1993). Physiologically based speech synthesis using neural networks, IEICE Transactions, E76-A, 1898-1910.

Hogden, J., Löfqvist, A., Gracco., V., Zlokarnik, I., Rubin, P., & Saltzman, E. (1996). Accurate recovery of articulator positions from acoustics: New conclusions based on human data. Journal of the Acoustical Society of America, 100 (3), 1819-1834.

Ishizaka, K. & Flanagan, J. (1972). Synthesis of voiced sounds from a two-mass model of the vocal cords. Bell Syst. Techn. J., 51, 1233-1268.

Ladefoged, P., Harshman, R., Goldstein, L., & Rice, L. (1978). Generating vocal tract shapes from formant frequencies. Journal of the Acoustical Society of America, 64, 1027-1035.

Lin, Q. (1990). Speech Production Theory and Articulatory Speech Synthesis. Ph.D. thesis, Dept. of Speech Communication and Music Acoustics, Royal Institute of Technology, Stockholm.

Lin, Q. & Fant, G. (1990). A new algorithm for speech synthesis based on vocal tract modeling. STL-QPSR 2-3, 45-52.

Maeda, S. (1979). An articulatory model of the tongue based on a statistical analysis. Journal of the Acoustical Society of America, 65, S22.

Maeda, S. (1988). Improved articulatory model. Journal of the Acoustical Society of America, 84, Sup. 1, S146.

Maeda, S. (1990). Compensatory articulation during speech: evidence from the analysis and synthesis of vocal-tract shapes using an articulatory model. In W. J. Hardcastle and A. Marchal (Eds.), Speech Production and Speech Modelling, Kluwer Academic, Dordrecht, 131-149.

McGowan, R. (1994). Recovering articulator movement from formant frequency trajectories using task dynamics and a genetic algorithm: Preliminary tests. Speech Communication, 14, 19-48.

Mermelstein, P. (1973). Articulatory model for the study of speech production. Journal of the Acoustical Society of America, 53, 1070-1082.

Payan, Y. & Perrier, P. (1997). Synthesis of V-V sequences with a 2D biomechanical tongue model controlled by the Equilibrium Point Hypothesis. Speech Communication, 22, 185-205.

Perrier, P., Boë, L.-J., and Sock, R. (1992). Vocal tract area functions estimation from midsagittal dimensions with CT scans and a vocal tract cast: Modelling the transition with two sets of coefficients. Journal of Speech and Hearing Research, 35, 53-67.

Perrier, P. & Ostry, D. (1994). Dynamic modelling and control of speech articulators: Application to vowel reduction. In E. Keller (Ed.). (1994). Fundamentals of Speech Synthesis and Speech Recognition. John Wiley & Sons, 231-251.

Rahim, M., Goodyear, C., Kleijn, W., Schroeter, J., & Sondhi, M. (1993). On the use of neural networks in articulatory speech synthesis. Journal of the Acoustical Society of America, 93, 1109-1121.

Rubin, P. E., Baer, T., & Mermelstein, P. (1981) An articulatory synthesizer for perceptual research, Journal of the Acoustical Society of America, 70, 321-328.

Rubin, P., Saltzman, E., Goldstein, L., McGowan, R., Tiede, M., & Browman, C. (1996). CASY and extensions to the task-dynamic model. Proceedings of the 1st ESCA Tutorial and Research Workshop on Speech Producing Modeling - 4th Speech Production Seminar, 125-128.

Sanguineti, V. & Morasso, P. (1996) Articulatory speech synthesis by field computation. In S. G. Pandalai, editor, Recent Research Developments in Biological Cybernetics. Research Signpost, Trivandrum, India.

Schoentgen, J., & Ciocea, S. (1997). Kinematic formant-to-area mapping. Speech Communication, 21, 227-244.

Schroeter, J. & Sondhi, M. (1992). Speech coding based on physiological models of speech production. In S. Furui & M. Sondhi (Eds.), Advances in Speech Signal Processing. Dekker, New York, 231-267.

Schroeter, J. & Sondhi, M. (1994). Techniques for estimating vocal-tract shapes from the speech signal. IEEE Trans. Speech Audio Process., 2, 133-150.

Shirai, K. (1993). Estimation and generation of the articulatory motion using neural networks. Speech Communication, 13, 45-51.

Shirai, K. & Kobayashi, T. (1986). Estimating articulatory motion from speech wave. Speech Communication, 5, 159-170.

Shirai, K. & Honda, M. (1977). Estimation of articulatory motion. In M. Sawashima and F. S. Cooper (Eds.), Dynamic Aspects of Speech Production, University of Tokyo Press, Japan, 279-302.

Sondhi, M. M. & Schroeter, J. (1986). A nonlinear articulatory speech synthesizer using both time- and frequency-domain elements. Proc. ICASSP-Tokyo, 1999-2002.

Sondhi, M. M. & Schroeter, J. (1986). A hybrid time-frequency domain articulatory speech synthesizer. IEEE Trans. Acoust. Speech Signal Process., ASSP-35, 955-967.

Titze, I. R. (1989). A four parameter model of the glottis and vocal fold contact area. Speech Communication, 8, 191-201.

Wakita, H. (1973). Direct estimation of the vocal tract shape by inverse filtering of acoustic speech waveforms. IEEE Trans. Audio Electroacoust., AU-21(5), 417-427.
 

Bibliography  
   Articulatory sel of the glottis and vocal fold contact area. Speech Communication, 8, 191-201.

Wakita, H. (1973). Direct estimation of the vocal tract shape by inverse filtering of acoustic speech waveforms. IEEE Trans. Audio Electroacoust., AU-21(5), 417-427.
 

Bibliography  
   Articulatory sel of the glottis and vocal fold contact area. Speech Communication, 8, 191-201.

Wakita, H. (1973). Direct estimation of the vocal tract shape by inverse filtering of acoustic speech waveforms. IEEE Trans. Audio Electroacoust., AU-21(5), 417-427.
 

Bibliography