Skip to main content
Log in

Where did language come from? Connecting sign, song, and speech in hominin evolution

  • Published:
Biology & Philosophy Aims and scope Submit manuscript

Abstract

Recently theorists have developed competing accounts of the origins and nature of protolanguage and the subsequent evolution of language. Debate over these accounts is lively. Participants ask: Is music a direct precursor of language? Were the first languages gestural? Or is language continuous with primate vocalizations, such as the alarm calls of vervets? In this article I survey the leading hypotheses and lines of evidence, favouring a largely gestural conception of protolanguage. However, the “sticking point” of gestural accounts, to use Robbins Burling’s phrase, is the need to explain how language shifted to a largely vocal medium. So with a critical eye I consider Michael Corballis’s most recent expression of his ideas about this transition (2017’s The Truth About Language: What It Is And Where It Came From). Corballis’s view is an excellent foil to mine and I present it as such. Contrary to Corballis’s account, and developing Burling’s conjecture that musicality played some role, I argue that the foundations of an evolving musicality (i.e., evolving largely independently of language) provided the means and medium for the shift from gestural to vocal dominance in language. In other words, I suggest that an independently evolving musicality prepared ancient hominins, morphologically and cognitively, for intentional articulate vocal production, enabling the evolution of speech.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Fig. 1

Extracted from Perniss and Vigliocco (2014), reproducible under the terms of the Creative Commons Attribution License 3.0

Fig. 2

Image by Selket, available at https://commons.wikimedia.org/w/index.php?curid=1679336. Accessed 20 July 2017. Reproducible under the terms of the Creative Commons Attribution-ShareAlike License 3.0

Fig. 3

Extracted from Capek et al. (2008), reproduced here with permission of MIT Press

Similar content being viewed by others

Notes

  1. Terms like “protolanguage” and “language-ready” are a convenience, not meant to imply an element of purpose or teleology.

  2. Indeed, it is likely that language evolution took hold in the kinesic and oral-aural modalities together (see Kendon 2016) to some extent—and in coevolutionary tandem.

  3. Corballis (2017); for earlier versions see Corballis (2002, 2009).

  4. Fitch puts it like this: ‘To the extent that there was a lexicon, it was a simple list of tunes or “riffs”—complex, multi-unit phrases linked to whole, context-bound events’ (Fitch 2010, p. 476).

  5. Largely, but not exclusively, voluntary; recent evidence suggests great ape vocalization may be a little more flexible than previously thought. For discussion, see Kendon (2016); Irvine (2016).

  6. I thus resist the widespread (e.g. Kendon 2016) ‘gesture-first’ versus ‘voice-first’ nomenclature.

  7. Later work by Bickerton (e.g. 2009) explicitly allows for a role for gestures in establishing referential displacement. He envisions a scenario in which an individual has located a carcass somewhere in the distance and needs to recruit others to go and help exploit that carcass: ‘waving and screaming and pointing “Thataway!”, the elephant noise or the hippo noise or whatever it was could have only one meaning: dead megabeast, food for the taking only a short march away’ (2009, p. 161).

  8. Also laughter, sobbing, and so on; these reactive, affective vocalizations may be continuous with primate vocalization.

  9. For example: ‘We can’t tell, just by the sound of the words, what kind of motion is indicated by walk, run, swim, fly, or crawl. But in sign languages, some iconicity has often been retained, and the meanings of the signs for these motion-types can be more successfully guessed at’ (Hurford 2014, p. 107). Perniss and Vigliocco (2014) argue that iconicity played a key role in language evolution in establishing displacement (i.e., reference to that which is beyond the immediate here-and-now) and continues to play a key role in ontogeny in supporting the development of word-referentiality, crucial to vocabulary acquisition and meaningful communication.

  10. According to Thompson et al. (2012), children learning BSL tend to acquire iconic sign production and comprehension before that of non-iconic signs.

  11. The following is a typical articulation of this: ‘In a group of people just beginning to signal meanings to each other, many more meanings could be guessed from manual and facial gestures than from attempts to express them vocally. It would be easier to get a gestural language off the ground in the first place than a speech-based one’ (Hurford 2014, p. 107). Indeed we have seen the birth of Nicaraguan Sign Language, a newly established, full language, built from the idiosyncratic homesign gestures of deaf children (Senghas 1995; Senghas et al. 2004; see Begby 2017 for philosophical implications).

  12. ‘A transition is variation limited when the available genetic variation in the given lineage does not offer even a partial solution to the problem at hand, and it takes considerable time (in evolutionary terms) for the necessary variation to arise. By contrast, a transition is selection limited if the necessary genetic prerequisites of a possible transition are present, but the given transition is not selected for as this would require a specific ecological or social context’ (Számadó and Szathmáry 2006, p. 555).

  13. For more on great ape gestural communication in the context of language evolution see e.g. Moore (this issue), Sterelny (this issue).

  14. The evidence in its favour is not decisive, of course. However taken together it makes for a very compelling package, in my view.

  15. Such pantomimic additions to speech enhance comprehension when congruent, and stifle it when incongruent. It appears that the dynamic influence of gesture on speech and vice versa is obligatory (Kelly et al. 2010).

  16. Corballis co-opts Max Müller’s famous use of “Rubicon”—the divide between humans and other species with respect to language: ‘the one great barrier between brute and man is Language. Man speaks, and no brute has ever uttered a word. Language is our Rubicon, and no brute will dare cross it’ (Müller 1861, p. 340).

  17. For a short BBC video clip demonstrating the McGurk effect, see https://www.youtube.com/watch?v=G-lN8vWm3m0. Accessed 20 July 2017.

  18. E.g., that the two streams don’t in fact operate independently of each other, as some dual-stream theorists have supposed (see McIntosh and Schenk 2009).

  19. After all, chimps and bonobos vocalize a lot while communicating via gesture. For example: ‘Kanzi the bonobo vocalizes prolifically while communicating through gesture… but this is probably largely emotional and it is the gestures that provide the information’ (Corballis 2017, p. 163).

  20. Saussure, for one, famously thought of arbitrariness as a defining feature of language.

  21. I thank an anonymous referee for pushing this point.

  22. I am grateful to Lauren Reed for bringing this hypothesis to my attention.

  23. Capek et al. find that ‘speech-derived mouthings (DM [disambiguating mouth signs]) generated relatively greater activation in a somewhat circumscribed region of the left middle and posterior portions of the superior temporal cortex, whereas for mouth gestures (EP [echo phonology signs]), which are not speech-derived, there was relatively greater posterior activation in both hemispheres’ (Capek et al. 2008, p. 1231). This suggests that in EP the manual gesture drives the accompanying mouth gesture, providing a plausible cortical correlate. They continue: ‘Although mouth actions can be of many different sorts, DM and EP show systematic differences in terms of their functional cortical correlates; DM resembles speechreading more closely, whereas EP resembles manual-only signs’ (Capek et al. 2008, p. 1231).

  24. This hypothetical example shouldn’t seem too wild. There is evidence that ancient hominins were procuring aquatic prey including crocodiles, presumably a feat requiring coordinated action, 1.95 million years ago (Braun et al. 2010).

  25. Thanks to an anonymous referee for pushing this point.

References

  • Barham L (2013) From hand to handle: the first industrial revolution. Oxford University Press, Oxford

    Book  Google Scholar 

  • Begby E (2017) Language from the ground up: a study of homesign communication. Erkenntnis 82:693–714

    Article  Google Scholar 

  • Behne T, Carpenter M, Tomasello M (2014) Young children create iconic gestures to inform others. Dev Psychol 50(8):2049–2060

    Article  Google Scholar 

  • Bickerton D (1990) Language and species. University of Chicago Press, Chicago

    Google Scholar 

  • Bickerton D (2009) Adam’s tongue: how humans made language, how language made humans. Hill and Wang, New York

    Google Scholar 

  • Boesch C (1993) Aspects of transmission of tool-use in wild chimpanzees. In: Gibson KR, Ingold T (eds) Tools, language, and cognition in human evolution. Cambridge University Press, Cambridge, pp 171–183

    Google Scholar 

  • Boesch C, Boesch-Achermann H (2000) The chimpanzees of the Taï forest: behavioural ecology and evolution. Oxford University Press, Oxford

    Google Scholar 

  • Braun DR, Harris JWK, Levin NE et al (2010) Early hominin diet included diverse terrestrial and aquatic animals 1.95 Ma in East Turkana, Kenya. Proc Natl Acad Sci 107(22):10002–10007

    Article  Google Scholar 

  • Brown S (2000) The “musilanguage” model of music evolution”. In: Wallin NL, Merker B, Brown S (eds) The origins of music. MIT Press, Cambridge, pp 271–300

    Google Scholar 

  • Bunn HT, Pickering TR (2010a) Methodological recommendations for ungulate mortality analyses in paleoanthropology. Quatern Res 74(3):388–394

    Article  Google Scholar 

  • Bunn HT, Pickering TR (2010b) Bovid mortality profiles in paleoecological context falsify hypotheses of endurance running–hunting and passive scavenging by early Pleistocene hominins. Quatern Res 74(3):395–404

    Article  Google Scholar 

  • Burling R (2005) The talking ape. Oxford University Press, New York

    Google Scholar 

  • Capek CM et al (2008) Hand and mouth: cortical correlates of lexical processing in British Sign Language and speechreading English. J Cogn Neurosci 20(7):1220–1234

    Article  Google Scholar 

  • Corballis M (2002) From hand to mouth: the origins of language. Princeton University Press, Princeton

    Google Scholar 

  • Corballis M (2009) The evolution of language. Ann N Y Acad Sci 1156:19–43

    Article  Google Scholar 

  • Corballis M (2017) The truth about language: what it is and where it came from. Auckland University Press, Auckland

    Book  Google Scholar 

  • Crockford C, Wittig R, Mundy R, Zuberbühler K (2012) Wild chimpanzees inform ignorant group members of danger. Curr Biol 22(2):142–146

    Article  Google Scholar 

  • Darwin C (1871) The descent of man, and selection in relation to sex. Appleton & Co., New York

    Book  Google Scholar 

  • Dunbar R (1996) Grooming, gossip and the evolution of language. Harvard University Press, Cambridge

    Google Scholar 

  • Dunbar R (2014) Human evolution. Penguin, London

    Google Scholar 

  • Esteve-Gibert N, Prieto P (2014) Infants temporally coordinate gesture–speech combinations before they produce their first words. Speech Commun 57:301–316

    Article  Google Scholar 

  • Evans N (2009) Dying words: endangered languages and what they have to tell us. Wiley-Blackwell, Oxford

    Book  Google Scholar 

  • Eysenck M, Keane M (2015) Cognitive psychology: a student’s handbook, 7th edn. Psychology Press, New York

    Google Scholar 

  • Fitch W Tecumseh (2010) The evolution of language. Cambridge University Press, Cambridge

    Book  Google Scholar 

  • Fitch WT, Zuberbühler K (2013) Primate precursors to human language: beyond discontinuity. In: Altenmüller E, Schmidt S, Zimmerman E (eds) Evolution of emotional communication. Oxford University Press, Oxford, pp 26–48

    Chapter  Google Scholar 

  • Gallese V, Fadiga L, Fogassi L, Rizzolatti G (1996) Action recognition in the premotor cortex. Brain 119(2):593–609

    Article  Google Scholar 

  • Gamble C, Gowlett J, Dunbar R (2011) The social brain and the shape of the Palaeolithic. Camb Archaeol J 21(1):115–135

    Article  Google Scholar 

  • Gasser M (2004) The origins of arbitrariness in language. Proc Annu Meeting Cogn Sci Soc 26:434–439

    Google Scholar 

  • Geissmann T (2002) Duet-splitting and the evolution of gibbon songs. Biol Rev 77:57–76

    Article  Google Scholar 

  • Gentilucci M (2003) Grasp observation influences speech production. Eur J Neurosci 17(1):179–184

    Article  Google Scholar 

  • Goldin-Meadow S (2003) The resilience of language: what gesture creation in deaf children can tell us about how all children learn language. Psychology Press, New York

    Google Scholar 

  • Goldin-Meadow S, Alibali MW (2013) Gesture’s role in speaking, learning, and creating language. Annu Rev Psychol 64:257–283

    Article  Google Scholar 

  • Goodall J (1986) The chimpanzees of gombe: patterns of behavior. Harvard University Press, Cambridge

    Google Scholar 

  • Gowlett J, Gamble C, Dunbar R (2012) Human evolution and the archaeology of the social brain. Curr Anthropol 53(6):693–722

    Article  Google Scholar 

  • Hewes GW (1973) Primate communication and the gestural origin of language. Curr Anthropol 14:5–24

    Article  Google Scholar 

  • Hickok G (2012) The cortical organization of speech processing: feedback control and predictive coding the context of a dual-stream model. J Commun Disord 45:393–402

    Article  Google Scholar 

  • Hickok G, Poeppel D (2007) The cortical organization of speech processing. Nat Rev Neurosci 8(5):393–402

    Article  Google Scholar 

  • Hobaiter C, Byrne RW (2011) Serial gesturing by wild chimpanzees: its nature and function for communication. Anim Cogn 14:827–838

    Article  Google Scholar 

  • Hobaiter C, Byrne RW (2014) The meaning of chimpanzee gestures. Curr Biol 24:1–5

    Article  Google Scholar 

  • Hostetter AB, Cantero M, Hopkins WD (2001) Differential use of vocal and gestural communication by chimpanzees (Pan troglodytes) in response to the attentional status of a human (Homo sapiens). J Comp Psychol 115(4):337–343

    Article  Google Scholar 

  • Hurford J (2014) Origins of language. Oxford University Press, Oxford

    Google Scholar 

  • Irvine E (2016) Method and evidence: gesture and iconicity in the evolution of language. Mind Lang 31(2):221–247

    Article  Google Scholar 

  • Kalagher H, Yu C (2006) The effects of deictic pointing in word learning. In: Proceedings of the 5th international conference of development and learning. Indiana University Department of Psychological Brain Sciences, Bloomington. ISBN 0-9786456-0-X

  • Kelly SD, Özyürek A, Maris E (2010) Two sides of the same coin: speech and gesture mutually interact to enhance comprehension. Psychol Sci 21(2):260–267

    Article  Google Scholar 

  • Kendon A (2016) Reflections on the ‘gesture-first’ hypothesis of language origins. Psychon Bull Rev. https://doi.org/10.3758/s13423-016-1117-3

    Google Scholar 

  • Killin A (2016) Rethinking music’s status as adaptation versus technology: a niche construction perspective. Ethnomusicol Forum 25(2):210–233

    Article  Google Scholar 

  • Killin A (2017) Plio-Pleistocene foundations of hominin musicality: coevolution of cognition, sociality, and music. Biol Theory 12(4):222–235

    Article  Google Scholar 

  • Kimura D (1993) Neuromotor mechanisms in human communication. Oxford University Press, Oxford

    Book  Google Scholar 

  • Kohler E et al (2002) Hearing sounds, understanding actions: action representation in mirror neurons. Science 297:846–848

    Article  Google Scholar 

  • Kohn M, Mithen S (1999) Handaxes: products of sexual selection. Antiquity 73:518–526

    Article  Google Scholar 

  • Lawson FRS (2014) Is music an adaptation or a technology? Ethnomusicological perspectives from the analysis of Chinese shuochang. Ethnomusicol Forum 23(1):3–26

    Article  Google Scholar 

  • MacLarnon A, Hewitt G (1999) The evolution of human speech: the role of enhanced breathing control. Am J Phys Anthropol 109:341–363

    Article  Google Scholar 

  • Masataka N (1992) Motherese in a signed language. Infant Behav Dev 15(4):453–460

    Article  Google Scholar 

  • McGurk H, MacDonald, J (1976) Hearing lips and seeing voices. Nature 264(5588):746–748

    Article  Google Scholar 

  • McIntosh RD, Schenk T (2009) Two visual streams for perception and action: current trends. Neuropsychologia 47(6):1391–1396

    Article  Google Scholar 

  • Merker B (2012) The vocal learning constellation: imitation, ritual culture, encephalization. In: Bannan N (ed) Music, language and human evolution. Oxford University Press, Oxford, pp 215–260

    Chapter  Google Scholar 

  • Mithen S (2005) The singing Neanderthals. Weidenfeld & Nicolson, Great Britain

    Google Scholar 

  • Moore R (this issue) Social cognition, Stag Hunts, and the evolution of language. Biol Philos. https://doi.org/10.1007/s10539-017-9598-7

  • Morley I (2013) The prehistory of music: human evolution, archaeology, and the origins of musicality. Oxford University Press, Oxford

    Book  Google Scholar 

  • Müller M (1861) Lectures on the science of language (first series). Longman, Green, Longman and Roberts, London

    Google Scholar 

  • Newman A et al (2002) A critical period for right hemisphere recruitment in American Sign Language processing. Nat Neurosci 5:76–80

    Article  Google Scholar 

  • Newman A et al (2010) Prosodic and narrative processing in American Sign Language: an fMRI study. Neuroimage 52:669–676

    Article  Google Scholar 

  • Novick JM, Trueswell JC, Thompson-Schill SL (2010) Broca’s area and language processing: evidence for the cognitive control connection. Lang Linguist Compass 4(10):906–924

    Article  Google Scholar 

  • O’Neill M, Bard K, Linnell M, Fluck M (2005) Maternal gestures with 20-month-old infants in two contexts. Dev Sci 8(4):352–359

    Article  Google Scholar 

  • Perniss P, Vigliocco G (2014) The bridge of iconicity: from a world of experience to the experience of language. Philos Trans R Soc B 369:20130300. https://doi.org/10.1098/rstb.2013.0300

    Article  Google Scholar 

  • Petitto LA, Marentette PF (1991) Babbling in the manual mode: evidence for the ontogeny of language. Science 251:1493–1496

    Article  Google Scholar 

  • Pfenning AR et al (2014) Convergent transcriptional specializations in the brains of humans and song-learning birds. Science 346:1333–1346

    Article  Google Scholar 

  • Pika S, Mitani J (2006) Referential gestural communication in wild chimpanzees (Pan troglodytes). Curr Biol 16(6):R191–R192

    Article  Google Scholar 

  • Pollick A, de Waal F (2007) Ape gestures and language evolution. Proc Natl Acad Sci 104:8184–8189

    Article  Google Scholar 

  • Rizzolatti G, Arbib MA (1998) Language within our grasp. Trends Neurosci 21(5):188–194

    Article  Google Scholar 

  • Rizzolatti G, Craighero L (2004) The mirror-neuron system. Annu Rev Neurosci 27:169–192

    Article  Google Scholar 

  • Rizzolatti G et al (1988) Functional organization of inferior area 6 in the macaque monkey. II. Area F5 and the control of distal movements. Exp Brain Res 71:491–507

    Article  Google Scholar 

  • Russon AE, Andrews K (2011) Pantomime in great apes: evidence and implications. Commun Integr Biol 4(3):315–317

    Article  Google Scholar 

  • Schel AM, Townsend SW, Machanda Z, Zuberbühler K, Slocombe KE (2013) Chimpanzee alarm call production meets key criteria for intentionality. PLoS ONE 8(10):e76674

    Article  Google Scholar 

  • Senghas A (1995) The development of Nicaraguan sign language via the language acquisition process. In: MacLaughlin D, McEwan S (eds) BUCLD 19: proceedings of the 19th annual Boston University conference on language development. Cascadilla Press, Boston

    Google Scholar 

  • Senghas A, Kita S, Özyürek A (2004) Children creating core properties of language: evidence from an emerging sign language in Nicaragua. Science 305:1779–1782

    Article  Google Scholar 

  • Shaw-Williams K (2014) The social trackways theory of the evolution of cognition. Biol Theory 9(1):16–26

    Article  Google Scholar 

  • Shaw-Williams K (2017) The social trackways theory of the evolution of language. Biol Theory 12(4):195–210

    Article  Google Scholar 

  • Slocombe KE, Zuberbühler K (2007) Chimpanzees modify recruitment screams as a function of audience composition. Proc Natl Acad Sci USA 104:17228–17233

    Article  Google Scholar 

  • Sterelny K (2012) Language, gesture, skill: the coevolutionary foundations of language. Philos Trans R Soc B 367:2141–2151

    Article  Google Scholar 

  • Sterelny K (2016) Deacon’s challenge: from calls to words. Topoi 35(1):271–282

    Article  Google Scholar 

  • Sterelny K (2018) Language: from how-possibly to how-probably? In: Joyce R (ed) Routledge handbook of evolution and philosophy. Routledge, Oxon/NY, pp 120–135

    Google Scholar 

  • Sterelny K (this issue) From code to speaker meaning. Biol Philos. https://doi.org/10.1007/s10539-017-9597-8

  • Számadó S, Szathmáry E (2006) Selective scenarios for the emergence of natural language. Trends Ecol Evol 21(10):555–561

    Article  Google Scholar 

  • Thompson RL, Vinson DP, Woll B, Vigliocco G (2012) The road to language learning is iconic: evidence from British Sign Language. Psychol Sci 23(12):1443–1448

    Article  Google Scholar 

  • Tomasello M (2008) Origins of human communication. MIT Press, Cambridge

    Google Scholar 

  • Verhaegen M (2013) The aquatic ape evolves: common misconceptions and unproven assumptions about the so-called aquatic ape hypothesis. Hum Evol 28:237–266

    Google Scholar 

  • Watkins KE, Strafella AP, Paus T (2003) Seeing and hearing speech excites the motor system involved in speech production. Neuropsychologia 41(8):989–994

    Article  Google Scholar 

  • Watson SK, Townsend SW, Schel AM, Wilke C, Wallace EK, Chang L, West V, Slocombe KE (2015) Vocal learning in the functionally referential food grunts of chimpanzees. Curr Biol 25:495–499

    Article  Google Scholar 

  • Woll B (2001) The sign that dares to speak its name: echo phonology in British Sign Language (BSL). In: Boyes Braem P, Sutton-Spence R (eds) The hands are the head of the mouth: the mouth as articulator in sign language. Signum, Hamburg, pp 87–98

    Google Scholar 

  • Woll B (2014) Moving from hand to mouth: echo phonology and the origins of language. Front Psychol 5:662. https://doi.org/10.3389/fpsyg.2014.00662

    Article  Google Scholar 

Download references

Acknowledgements

Thanks are due to Simon Greenhill and Kim Sterelny for compiling this special issue. I am grateful to Michael Corballis, Kim Sterelny, and an anonymous referee for helpful critical comments on previous versions of the manuscript. I thank audiences at the Empirical Philosophy Workshop 2017 at Victoria University of Wellington, the Centre of Excellence for the Dynamics of Language Seminar Series at the Australian National University, and the Australasian Association of Philosophy Conference 2017 at the University of Adelaide for helpful questions and feedback on this and related material. In particular I thank Matt Spike, Kim Shaw-Williams, and Lauren Reed.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Anton Killin.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Killin, A. Where did language come from? Connecting sign, song, and speech in hominin evolution. Biol Philos 32, 759–778 (2017). https://doi.org/10.1007/s10539-017-9607-x

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10539-017-9607-x

Keywords

Navigation