Bootstrapping the lexicon: a computational model of infant speech segmentation

Cognition 83 (2):167-206 (2002)
  Copy   BIBTEX


Prelinguistic infants must find a way to isolate meaningful chunks from the continuous streams of speech that they hear. BootLex, a new model which uses distributional cues to build a lexicon, demonstrates how much can be accomplished using this single source of information. This conceptually simple probabilistic algorithm achieves significant segmentation results on various kinds of language corpora - English, Japanese, and Spanish; child- and adult-directed speech, and written texts; and several variations in coding structure - and reveals which statistical characteristics of the input have an influence on segmentation performance. BootLex is then compared, quantitatively and qualitatively, with three other groups of computational models of the same infant segmentation process, paying particular attention to functional characteristics of the models and their similarity to human cognition. Commonalities and contrasts among the models are discussed, as well as their implications both for theories of the cognitive problem of segmentation itself, and for the general enterprise of computational cognitive modeling



    Upload a copy of this work     Papers currently archived: 83,802

External links

Setup an account with your affiliations in order to access resources via your University's proxy server

Through your library

Similar books and articles

The feasibility of segmentation of protolanguage.István Zachar - 2011 - Interaction Studies 12 (1):1-35.
Which came first: Infants learning language or motherese?Heather Bortfeld - 2004 - Behavioral and Brain Sciences 27 (4):505-506.


Added to PP

29 (#430,529)

6 months
3 (#243,044)

Historical graph of downloads
How can I increase my downloads?