David Bourget (Western Ontario)
David Chalmers (ANU, NYU)
Rafael De Clercq
Jack Alan Reynolds
Learn more about PhilPapers
Cognitive Science 35 (1):119-155 (2011)
This paper reconsiders the diphone-based word segmentation model of Cairns, Shillcock, Chater, and Levy (1997) and Hockema (2006), previously thought to be unlearnable. A statistically principled learning model is developed using Bayes’ theorem and reasonable assumptions about infants’ implicit knowledge. The ability to recover phrase-medial word boundaries is tested using phonetic corpora derived from spontaneous interactions with children and adults. The (unsupervised and semi-supervised) learning models are shown to exhibit several crucial properties. First, only a small amount of language exposure is required to achieve the model’s ceiling performance, equivalent to between 1 day and 1 month of caregiver input. Second, the models are robust to variation, both in the free parameter and the input representation. Finally, both the learning and baseline models exhibit undersegmentation, argued to have significant ramifications for speech processing as a whole
|Keywords||Word segmentation Language acquisition Computational model Bayesian Unsupervised learning|
|Categories||categorize this paper)|
Setup an account with your affiliations in order to access resources via your University's proxy server
Configure custom proxy (use this if your affiliation does not provide a proxy)
|Through your library|
References found in this work BETA
Adam Albright & Bruce Hayes (2003). Rules Vs. Analogy in English Past Tenses: A Computational/Experimental Study. Cognition 90 (2):119-161.
Eleanor Olds Batchelder (2002). Bootstrapping the Lexicon: A Computational Model of Infant Speech Segmentation. Cognition 83 (2):167-206.
Michael R. Brent & Timothy A. Cartwright (1996). Distributional Regularity and Phonotactic Constraints Are Useful for Segmentation. Cognition 61 (1-2):93-125.
Michael R. Brent & Jeffrey Mark Siskind (2001). The Role of Exposure to Isolated Words in Early Vocabulary Development. Cognition 81 (2):B33-B44.
Jeffrey L. Elman (1990). Finding Structure in Time. Cognitive Science 14 (2):179-211.
Citations of this work BETA
Andrew Martin, Sharon Peperkamp & Emmanuel Dupoux (2013). Learning Phonemes With a Proto-Lexicon. Cognitive Science 37 (1):103-124.
Similar books and articles
Heather Bortfeld (2004). Which Came First: Infants Learning Language or Motherese? Behavioral and Brain Sciences 27 (4):505-506.
Marc Ettlinger, Amy S. Finn & Carla L. Hudson Kam (2011). The Effect of Sonority on Word Segmentation: Evidence for the Use of a Phonological Universal. Cognitive Science 36 (4):655-673.
Erik D. Thiessen (2010). Effects of Visual Information on Adults' and Infants' Auditory Statistical Learning. Cognitive Science 34 (6):1093-1106.
Bryan R. Gibson, Timothy T. Rogers & Xiaojin Zhu (2013). Human Semi-Supervised Learning. Topics in Cognitive Science 5 (1):132-172.
Erik D. Thiessen & Philip I. Pavlik (2013). iMinerva: A Mathematical Model of Distributional Statistical Learning. Cognitive Science 37 (2):310-343.
Axel Cleeremans (1993). Mechanisms of Implicit Learning: Connectionist Models of Sequence Processing. MIT Press.
Sean Fulop & Nick Chater (2013). Editors' Introduction: Why Formal Learning Theory Matters for Cognitive Science. Topics in Cognitive Science 5 (1):3-12.
Jukka Corander & Pekka Marttinen (2006). Bayesian Model Learning Based on Predictive Entropy. Journal of Logic, Language and Information 15 (1-2):5-20.
Archana Balyan, S. S. Agrawal & Amita Dev (2012). Automatic Phonetic Segmentation of Hindi Speech Using Hidden Markov Model. AI and Society 27 (4):543-549.
Keith S. Apfelbaum & Bob McMurray (2011). Using Variability to Guide Dimensional Weighting: Associative Mechanisms in Early Word Learning. Cognitive Science 35 (6):1105-1138.
Pierre Barbaroux & Gilles Enée (2005). Spontaneous Coordination and Evolutionary Learning Processes in an Agent-Based Model. Mind and Society 4 (2):179-195.
Scott Moss & Bruce Edmonds (1994). Modelling Learning as Modelling. Philosophical Explorations.
Added to index2010-12-10
Total downloads3 ( #303,907 of 1,099,862 )
Recent downloads (6 months)1 ( #303,846 of 1,099,862 )
How can I increase my downloads?