CONTENTS
BEING BIOLOGICAL
SIMULACRA
SPEECH SYNTHESIS
VOCAL TRACTS
ARTICULATORS
SPEECH PRODUCTION
MCGURK
SPEECHREADING
FACIAL ANIMATION
AVATARS
BACKGROUND
DIRECTORY
BIBLIOGRAPHY

Talking Heads:
Bibliography

     Articulatory synthesis | Avatars | Facial animation | McGurk
Speech - general | Speech synthesis | Speech production | Speechreading


 
Speech production and modeling:

Abbs, J.H. & Gilbert, B.N. (1973). A strain gage transduction system for lip and jaw motion in two dimensions: design criteria and calibration data. Journal of Speech and Hearing Research, 16, 248-256.

Abbs, J.H. & Gracco V.L. (1983). Sensorimotor actions in the control of multimovement speech gestures. Trends in Neuroscience, 6, 391-395.

Baer, T., Gore, J.C., Gracco, L.C., & Nye, P.W. (1991). Analysis of vocal tract shape and dimensions using magnetic resonance imaging. Journal of the Acoustical Society of America, 92, 3078-3096.

Basu, S., Oliver, N., & Pentland, A. (1998). 3D modeling and tracking of human lips. Proc. Internat. Conf. on Computer Vision, Bombay, 4-7 Jan. 1998 (IEEE Computer Society Press, Los Alamitos), 337-343.

Bell-Berti, F. & Raphael, L. J. (Eds.), (1995). Producing Speech: Contemporary Issues. For Katherine Safford Harris. AIP Press, New York.

Borden, G. J. & Harris, K. S. (1980). Speech Science Primer. Williams & Wilkins, Baltimore.

Browman, C. P. & Goldstein, L. (1986). Towards an articulatory phonology. Phonology Yearbook, 3, 219-252.

Browman, C. P. & Goldstein, L. (1989). Articulatory gestures as phonological units. Phonology, 6, 201-251.

Browman, C. P. & Goldstein, L. (1990). Gestural specification using dynamically-defined articulatory structures. Journal of Phonetics, 18, 299-320.

Browman, C. P. & Goldstein, L. (1990). Representation and reality: Physical systems and phonological structure. Journal of Phonetics, 18, 411-424.

Browman, C. P. & Goldstein, L. (1990). Tiers in articulatory phonology, with some implications for casual speech. In T. Kingston & M. E. Beckman (Eds.), Papers in Laboratory Phonology I: Between the Grammar and Physics of Speech (pp. 341-376). Cambridge University Press.

Browman, C. P. & Goldstein, L. (1992). Articulatory phonology: An overview. Phonetica, 49, 155-180.

Browman, C. P. Goldstein, L., Kelso, J .A. S., Rubin, P., & Saltzman, E. (1984). Articulatory synthesis from underlying dynamics. Journal of the Acoustical Society, 75, S22-S23 (A).

Butterworth, B. (ed.). (1980). Language Production. Academic Press, New York.

Clements, G. N. (1992). Phonological primes: Features or gestures? Phonetica, 49, 181-193.

Fant, G. (1970). Acoustic Theory of Speech Production. Mouton, The Hague.

Fant, G. (1980). The relations between area functions and the acoustic signal. Phonetica, 37, 55-86.

Fowler, C. A., Rubin, P., Remez, R. E., & Turvey, M. T. (1980). Implications for speech production of a general theory of action. In B. Butterworth (Ed.), Language production. Academic Press, New York.

Fowler, C. A. & Saltzman, E. (1980). Coordination and coarticulation in speech production. Journal of Phonetics, 36, 171-195.

Fromkin, V. A. (Ed.). (1980). Errors in Linguistic Performance: Slips of the Tongue, Ear, Pen and Hand. Academic Press, New York.

Fujimura, O. (1990). Articulatory perspectives of speech organization. In W. J. Hardcastle and A. Marchal (Eds.), Speech Production and Speech Modelling. Kluwer Academic Press, Dordrecht, Netherlands, 323-342.

Fujimura, O. & Erickson, D. (1997). Acoustic phonetics. In W. J. Hardcastle and J. Laver (Eds.), The Handbook of Phonetic Sciences. Blackwell, Cambridge, MA, 65-115.

Giles, S. & Moll, K. (1975). Cineflourographic study of selected allophones of English /l/. Phonetica, 31, 206-227.

Goodell, E. W. & Studdert-Kennedy, M. (1993). Acoustic evidence for the development of gestural coordination in the speech of 2-year-olds: A longitudinal study. Journal of Speech and Hearing Research, 36, 707-727.

Gracco, V. L. (in press). A neuromotor perspective on speech production. To appear in Speech Motor Production and Fluency Disorders.

Gribble, P. L., Laboissiere, R. & Ostry, D. J. (1997). Control of Human Arm and Jaw Motion: Issues Related to Musculo-Skeletal Geometry. In P.G. Morasso & V. Sanguineti (eds.), Self-organization, Computational Maps and Motor control. Elsevier-North Holland, Advances in Psychology Series, vol. 119.

Guenther, F. H. (1995). Speech sound acquisition, coarticulation and rate effects in a neural network model of speech production. Psychological Review, 102, 594-621.

Guiard-Marigny, T. & Ostry, D.J. (1997). A system for three-dimensional visualization of human jaw motion in speech. Journal of Speech, Language, and Hearing Research, 40, 1118-1121.

Hardcastle, W. J. & Marchal, A. (Eds.). (1990). Speech Production and Speech Modelling, Kluwer Academic, Dordrecht.

Hashimoto, K. & Suga, S. (1986). Estimation of the muscular tensions of the human tongue by using a three-dimensional model of the tongue. Journal of the Acoustical Society of Japan, 7(1), 39-46.

Honda, K., Hirai, H., & Dang, J. (1994). A physiological model of speech production and the implication of tongue-larynx interaction. In International Congress on Spoken Language Processing, Yokohama, Japan, 175-178.

Jordan, M. I. (1991). Serial order: A parallel distributed processing approach. In J. L. Elman & D. Rumelhart (Eds.), Advances in Connectionist Theory: Speech. Erlbaum and Associates, Hillsdale, NJ, 214-249.

Kelso, J. A. S., Saltzman, E. L., & Tuller, B. (1986). The dynamical perspective on speech production: Data and theory. Journal of Phonetics, 14, 29-59.

Kent, R. D., Adams, S. G., & Turner, G. S. (1996). Models of speech production. In N. J. Lass (Ed.), Principles of Experimental Phonetics. Mosby, St. Louis, 3-45.

Kent, R. D., Atal, B. S., & Miller, J. L. (Eds.) (1991). Papers in Speech Communication: Speech Production. Acoustical Society of America, Woodbury, New York.

Kent, R. D. & Minifie, F. D. (1977). Coarticulation in recent speech production models. Journal of Phonetics, 5, 115-133.

Kiritani, S., Itoh, S. K., & Fujimura, O. (1975). Tongue-pellet tracking by a computer controlled X-ray microbeam system. Journal of the Acoustical Society of America, 57, 1516-1520.

Kiritani, S., Miyawaki, K., & Fujimura, O. (1976). A computational model of the tongue. Res. Instit. Logoped. Phoniatr. Annu. Bull., 10, 243-252.

Kuehn, D. P. & Moll, K. (1976). A cineradiographic study of VC and CV articulatory velocities. Journal of Phonetics, 4, 303-320.

Laboissière, R., Ostry, D.J., and Feldman, A.G. (1996). The control of multi-muscle systems: Human jaw and hyoid movements. Biological Cybernetics, 74, 373-384.

Ladegofed, P. (1971). Preliminaries to Linguistic Phonetics. The University of Chicago Press, Chicago.

Ladegofed, P. (1975). A Course in Phonetics. Harcourt Brace Jovanovich, New York.

Lashley, K. (1951). The problem of serial order in behavior. In L. A. Jeffries (Ed.), Cerebral Mechanisms in Behvavior. Wiley, New York, 506-528.

Levelt, W. J. M. (1989). Speaking: From Intention to Articulation. MIT Press, Cambridge, MA.

Lindblom, B. (1996). Role of articulation in speech perception: Clues from production. Journal of the Acoustical Society of America, 99. 1683-1692.

Lindblom, B. E. F. & Sundberg, J. E. F. (1971). Acoustical consequences of lip, tongue, jaw and larynx movement. Journal of the Acoustical Society of America, 50, 1166-1179.

Löfqvist, A. (1990). Speech as audible gestures. In W. J. Hardcastle & A. Marchal (Eds.), Speech Production and Speech Modelling. Kluwer Academic Press, Dordrecht, Netherlands, 289-322.

Löfqvist, A. (1997). Theories and models of speech production. In W. J. Hardcastle & J. Laver (Eds.), The Handbook of Phonetic Sciences. Blackwell, Cambridge, MA, 404-426.

Löfqvist, A. & Gracco (1997). Lip and jaw kinematics in bilabial stop consonant production. Journal of Speech, Language, and Hearing Research, 40, 877-893.

Löfqvist, A. & Gracco (1998). Control of lip closure in bilabial stop consonant production. In M. Cannito, K, Yorkston, & D. Beukelman (Eds.), Motor Speech Disorders: Nature, Assessment and Management. Baltimore, Paul H. Brookes Publishing, Co., 47-65.

Lucero, J., Munhall, K. G., Gracco, V. L., & Ramsay, J. O. (1997). On the registration of time and the patterning of speech movements. Journal of Speech and Hearing Research, 40, 1111-1117.

Macchi, M. (1988). Labial articulation patterns associated with segmental features and syllable structure in English. Phonetica, 45, 109-121.

MacNeilage, P. (1970). Motor control of serial ordering of speech. Psychological Review, 77, 182-196.

MacNeilage, P. (Ed.) (1971). The Production of Speech. Springer-Verlag, New York.

MacNeilage, P. F., Studdert-Kennedy, M. G., and Lindblom, B. (1985). Planning and production of speech: An overview Journal of the Acoustical Society of America, 15, 15-21.

Maeda, S. & Honda, K. (1994). From EMG to formant patterns of vowels: The implication of vowel spaces. Phonetica, 51(1-3), 17-29.

Meyer, D. E. & Gordon, P. C. (1985). Speech production: Motor programming of phonetic features. Journal of Memory and Language, 24, 3-26.

Miyawaki, K. (1974). A study of the musculature of the human tongue. Annual Bull. Res. Instit. Logoped. Phoniatr., University of Tokyo, 8, 23-50.

Morasso, P. & Sanguineti, V. (1992). Equilibrium point and self-organization. Behavioral and Brain Sciences, 15:781-782.

Morrish, K., Stone, M., Shawker, T., & Sonies, B. (1985). Distinguishability of tongue shape during vowel production. Journal of Phonetics, 13, 189-203.

Munhall, K. G. & Löfqvist, A. (1992). Gestural aggregation in speech: Laryngeal gestures. Journal of Phonetics, 20, 111-126.

Munhall, K. G., Vatikiotis-Bateson, E., & Kawato, M. (submitted). Coarticulation and physical models of the vocal tract. Labphon 5. Cambridge Univ. Press., Cambridge.

Munhall, K. G., Vatikiotis-Bateson, E., & Tohkura, Y. (1995). X-ray film database for speech research. Journal of the Acoustical Society of America, 98(2), 1222-1224.

Nadler, R. D., Abbs, J. H., & Fujimura, O. (1987). Speech movement research using the new x-ray microbeam system. Speech Motor Control Laboratories Preprints, 181-184.

Öhman, S. (1966). Coarticulation in VCV utterances: Spectrographic measurements. Journal of the Acoustical Society of America, 39, 151-168.

Öhman, S. (1966). Numerical model of coarticulation. Journal of the Acoustical Society of America, 41, 310-320.

Ostry, D.J., Gribble, P.L., & Gracco, V.L. (1996). Coarticulation of jaw movements in speech production: Is context sensitivity centrally planned? Journal of Neuroscience, 16, 1570-1579.

Ostry, D.J., Gribble, P.L., Levin, M.F., & Feldman, A.G. (1997). Phasic and tonic stretch reflexes in muscles with few spindles: Human jaw opener muscles. Experimental Brain Research, 116(2), 299-308.

Ostry, D. J. & Munhall, K. G. (1994). Control of jaw orientation and position in mastication and speech. Journal of Neurophysiology, 71, 1515-1532.

Ostry, D.J., Vatikiotis-Bateson, E., & Gribble, P.L. (1997). An examination of the degrees of freedom of human jaw motion in speech and mastication. Journal of Speech and Hearing Research, 40, 1341-1351.

Perkell, J. S. (1969). Physiology of Speech Production: Results and Implications of a Quantitative Cinreradiographic Study. MIT Press, Cambridge, MA.

Perkell, J. S. (1986). Coarticulation strategies: preliminary implications of a detailed analysis of lower lip protrusion movement. Speech Communication, 5, 47-68.

Perkell, J. S. (1997). Articulatory processes. In W. J. Hardcastle and J. Laver (Eds.), The Handbook of Phonetic Sciences. Blackwell, Cambridge, MA, 333-370.

Perkell, J., Cohen, M., Svirsky, M., Mathies, M., Garabieta, I., & Jackson, M. (1992). Electromagnetic midsagittal articulometer systems for transducing speech articulatory movements. Journal of the Acoustical Society of America, 92, 3078-3096.

Perrier, P., Ostry, D.J., & Laboissiere, R. (1996). The equilibrium-point hypothesis and its application to speech motor control. Journal of Speech and Hearing Research, 39, 365-377.

Revéret, L. (1997). From raw images of the lips to articulatory parameters : A viseme-based prediction. Proceedings of the Fifth EUROSPEECH Conference, Rhodes, Greece, Sept. 22-25, 1997, vol. 4, 2011-2014.

Revéret, L. & Benoît, C. (1997). Lip paramaters extraction based on projection of raw images onto reference shapes. Proceedings of the First IEEE Workshop on Multimedia Signal Processing, Princeton, NJ, USA, June 23-25, 1997.

Revéret, L., F. Garcia, F., Benoît, C., & Vatikiotis-Bateson, E. (1997). An hybrid approach to orientation-free liptracking. Proceedings of the First ESCA Workshop on Audio-Visual Speech Processing, Rhodes, Greece, Sept. 26-27, 1997, p. 117-120.

Rubin, P. E. & Vatikiotis-Bateson, E. (1998). Measuring and modeling speech production. In S. L. Hopp, M. J. Owren, & C. S. Evans (Eds.), Animal Acoustic Communication: Recent Technical Advances, Springer-Verlag, New York, 251-290.

Saltzman, E. (1986). Task dynamic coordination of the speech articulators: A preliminary model. In H. Heuer & C. Fromm (Eds.), Experimental Brain Research Series 15 (pp. 129-144). New York: Springer-Verlag.

Saltzman, E. (1995). Dynamics and coordinate systems in skilled sensorimotor activity. In Port, R. and Van Gelder, T. (Eds.), Mind as motion. Cambridge, MA: MIT Press, 149-173.

Saltzman, E. & Kelso, J. A. S. (1987). Skilled actions: A task dynamic approach. Psychological Review, 94, 84-106.

Saltzman, E., Löfqvist, A., Kay, B., Kinsella-Shaw, J., & Rubin, P. (in press). Dynamics of intergestural timing: A perturbation study of lip-larynx coordination. Experimental Brain Research.

Saltzman, E., Löfqvist, A., Kinsella-Shaw, J., Kay, B., & Rubin, P. (1995). On the dynamics of temporal patterning in speech. In F. Bell-Berti and L. J. Raphael (Eds.), Producing Speech: Contemporary Issues. For Katherine Safford Harris. AIP Press, New York, 469-487.

Saltzman, E. L. & Munhall, K. G. (1989) A dynamical approach to gestural patterning in speech production. Ecological Psychology, 1, 333-382.

Saltzman, E. L., Rubin, P. E., Goldstein, L., & Browman. C. P. (1987). Task-dynamic modeling of interarticulator coordination. Journal of the Acoustical Society of America, 82, S15.

Sanguineti, V., Laboissière, R., & Ostry, D. J. (1998). A dynamic biomechanical model for the neural control of speech production. Journal of the Acoustical Society of America, 103, 1615-1627.

Sanguineti, V., Laboissière, R., & Payan, Y. (1997). A control model of human tongue movements in speech. Biological Cybernetics, 77(1), 11-22.

Sawashima, M. & Cooper, F. S. (Eds.). (1977). Dynamic Aspects of Speech Production, University of Tokyo Press, Japan.

Smith, A. (1992). The control of orofacial movements in speech. Crit. Rev. Oral Biol. Med., 3(3), 233-267.

Stevens, K. N. (1989). On the quantal nature of speech. Journal of Phonetics, 17, 3-46.

Stone, M. & Vatikiotis-Bateson, E. (1995). Coarticulatory effects on tongue, jaw, and palate behavior. Journal of Phonetics, 23, 81-100.

Studdert-Kennedy, M. (1998). The particulate origins of language generativity: from syllable to gesture. In J. Hurford, M. Studdert-Kennedy & C. Knight (Eds.), Approaches to the Evolution of Language, 1998. Cambridge University Press, New York, 202-221.

Surprenant, A. M. & Goldstein, L. (1998). The perception of speech gestures. Journal of the Acoustical Society of America, 104, 518-529.

Sussman, H. M., MacNeilage, P. F., & Hanson, R. J. (1973). Labial and mandibular dynamics during the production of bilabial consonants: Preliminary observations. Journal of Speech and Hearing Research, 16, 397-420.

Titze, I. R. (1984). Parameterization of the glottal area, glottal flow, and vocal fold contact area. Journal of the Acoustical Society of America, 75, 570-580.

Tohkura, Y., Vatikiotis-Bateson, E., & Sagisaka, Y. (1992). Speech perception, production and lingus

Sussman, H. M., MacNeilage, P. F., & Hanson, R. J. (1973). Labial and mandibular dynamics during the production of bilabial consonants: Preliminary observations. Journal of Speech and Hearing Research, 16, 397-420.

Titze, I. R. (1984). Parameterization of the glottal area, glottal flow, and vocal fold contact area. Journal of the Acoustical Society of America, 75, 570-580.

Tohkura, Y., Vatikiotis-Bateson, E., & Sagisaka, Y. (1992). Speech perception, production and lingus

Sussman, H. M., MacNeilage, P. F., & Hanson, R. J. (1973). Labial and mandibular dynamics during the production of bilabial consonants: Preliminary observations. Journal of Speech and Hearing Research, 16, 397-420.

Titze, I. R. (1984). Parameterization of the glottal area, glottal flow, and vocal fold contact area. Journal of the Acoustical Society of America, 75, 570-580.

Tohkura, Y., Vatikiotis-Bateson, E., & Sagisaka, Y. (1992). Speech perception, production and lingus

Sussman, H. M., MacNeilage, P. F., & Hanson, R. J. (1973). Labial and mandibular dynamics during the production of bilabial consonants: Preliminary observations. Journal of Speech and Hearing Research, 16, 397-420.

Titze, I. R. (1984). Parameterization of the glottal area, glottal flow, and vocal fold contact area. Journal of the Acoustical Society of America, 75, 570-580.

Tohkura, Y., Vatikiotis-Bateson, E., & Sagisaka, Y. (1992). Speech perception, production and lingus

Sussman, H. M., MacNeilage, P. F., & Hanson, R. J. (1973). Labial and mandibular dynamics during the production of bilabial consonants: Preliminary observations. Journal of Speech and Hearing Research, 16, 397-420.

Titze, I. R. (1984). Parameterization of the glottal area, glottal flow, and vocal fold contact area. Journal of the Acoustical Society of America, 75, 570-580.

Tohkura, Y., Vatikiotis-Bateson, E., & Sagisaka, Y. (1992). Speech perception, production and lingus

Sussman, H. M., MacNeilage, P. F., & Hanson, R. J. (1973). Labial and mandibular dynamics during the production of bilabial consonants: Preliminary observations. Journal of Speech and Hearing Research, 16, 397-420.

Titze, I. R. (1984). Parameterization of the glottal area, glottal flow, and vocal fold contact area. Journal of the Acoustical Society of America, 75, 570-580.

Tohkura, Y., Vatikiotis-Bateson, E., & Sagisaka, Y. (1992). Speech perception, production and lingus

Sussman, H. M., MacNeilage, P. F., & Hanson, R. J. (1973). Labial and mandibular dynamics during the production of bilabial consonants: Preliminary observations. Journal of Speech and Hearing Research, 16, 397-420.

Titze, I. R. (1984). Parameterization of the glottal area, glottal flow, and vocal fold contact area. Journal of the Acoustical Society of America, 75, 570-580.

Tohkura, Y., Vatikiotis-Bateson, E., & Sagisaka, Y. (1992). Speech perception, production and lingus

Sussman, H. M., MacNeilage, P. F., & Hanson, R. J. (1973). Labial and mandibular dynamics during the production of bilabial consonants: Preliminary observations. Journal of Speech and Hearing Research, 16, 397-420.

Titze, I. R. (198