Characterizing Motherese: On the Computational Structure of Child-Directed Language


We report a quantitative analysis of the cross-utterance coordination observed in child-directed language, where successive utterances often overlap in a manner that makes their constituent structure more prominent, and describe the application of a recently published unsupervised algorithm for grammar induction to the largest available corpus of such language, producing a grammar capable of accepting and generating novel wellformed sentences. We also introduce a new corpus-based method for assessing the precision and recall of an automatically acquired generative grammar without recourse to human judgment. The present work sets the stage for the eventual development of more powerful unsupervised algorithms for language acquisition, which would make use of the coordination structures present in natural child-directed speech.



    Upload a copy of this work     Papers currently archived: 76,442

External links

Setup an account with your affiliations in order to access resources via your University's proxy server

Through your library

  • Only published works are available at libraries.

Similar books and articles

Which came first: Infants learning language or motherese?Heather Bortfeld - 2004 - Behavioral and Brain Sciences 27 (4):505-506.
Finitely constrained classes of homogeneous directed graphs.Brenda J. Latka - 1994 - Journal of Symbolic Logic 59 (1):124-139.
Concepts, structures, and meanings.Grant R. Gillett - 1987 - Inquiry: An Interdisciplinary Journal of Philosophy 30 (March):101-112.


Added to PP

42 (#280,645)

6 months
1 (#454,876)

Historical graph of downloads
How can I increase my downloads?