Learning words from sights and sounds: a computational model

Cognitive Science 26 (1):113-146 (2002)
  Copy   BIBTEX

Abstract

This paper presents an implemented computational model of word acquisition which learns directly from raw multimodal sensory input. Set in an information theoretic framework, the model acquires a lexicon by finding and statistically modeling consistent cross‐modal structure. The model has been implemented in a system using novel speech processing, computer vision, and machine learning algorithms. In evaluations the model successfully performed speech segmentation, word discovery and visual categorization from spontaneous infant‐directed speech paired with video images of single objects. These results demonstrate the possibility of using state‐of‐the‐art techniques from sensory pattern recognition and machine learning to implement cognitive models which can process raw sensor data without the need for human transcription or labeling.

Links

PhilArchive



    Upload a copy of this work     Papers currently archived: 90,616

External links

Setup an account with your affiliations in order to access resources via your University's proxy server

Through your library

Similar books and articles

Complexity in Language Acquisition.Alexander Clark & Shalom Lappin - 2013 - Topics in Cognitive Science 5 (1):89-110.
Taking Semantics and Embodiment into Account.J. Stewart - 2013 - Constructivist Foundations 9 (1):139-141.

Analytics

Added to PP
2013-11-21

Downloads
16 (#774,858)

6 months
3 (#447,120)

Historical graph of downloads
How can I increase my downloads?