Balancing Effort and Information Transmission During Language Acquisition: Evidence From Word Order and Case Marking.

Fedzechkina M; Newport EL; Jaeger TF

doi:10.1111/cogs.12346

Balancing Effort and Information Transmission During Language Acquisition: Evidence From Word Order and Case Marking.

Fedzechkina M ¹,

Newport EL ²,

Jaeger TF ³

Affiliations

1. Department of Psychology, University of Pennsylvania.
Authors
Fedzechkina M¹
(1 author)
2. Department of Neurology, Center for Brain Plasticity and Recovery, Georgetown University.
Authors
Newport EL²
(1 author)
3. Department of Brain & Cognitive Sciences, University of Rochester.
Authors
Jaeger TF³
(1 author)

ORCIDs linked to this article

Jaeger TF | 0000-0002-1158-7308

Cognitive Science, 22 Feb 2016, 41(2):416-446
https://doi.org/10.1111/cogs.12346 PMID: 26901374 PMCID: PMC4993698

Free full text in Europe PMC

Abstract

Across languages of the world, some grammatical patterns have been argued to be more common than expected by chance. These are sometimes referred to as (statistical) language universals. One such universal is the correlation between constituent order freedom and the presence of a case system in a language. Here, we explore whether this correlation can be explained by a bias to balance production effort and informativity of cues to grammatical function. Two groups of learners were presented with miniature artificial languages containing optional case marking and either flexible or fixed constituent order. Learners of the flexible order language used case marking significantly more often. This result parallels the typological correlation between constituent order flexibility and the presence of case marking in a language and provides a possible explanation for the historical development of Old English to Modern English, from flexible constituent order with case marking to relatively fixed order without case marking. In addition, learners of the flexible order language conditioned case marking on constituent order, using more case marking with the cross-linguistically less frequent order, again mirroring typological data. These results suggest that some cross-linguistic generalizations originate in functionally motivated biases operating during language learning.

Free full text

Cogn Sci. Author manuscript; available in PMC 2018 Mar 1.

Published in final edited form as:

Cogn Sci. 2017 Mar; 41(2): 416–446.

Published online 2016 Feb 22. https://doi.org/10.1111/cogs.12346

PMCID: PMC4993698

NIHMSID: NIHMS745557

PMID: 26901374

Balancing effort and information transmission during language acquisition: Evidence from word order and case marking

Maryia Fedzechkina,^a,¹ Elissa L. Newport,^b and T. Florian Jaeger^c,^d,^e

Author information Copyright and License information Disclaimer

The publisher's final edited version of this article is available free at Cogn Sci

See other articles in PMC that cite the published article.

Go to:

Abstract

Across languages of the world, some grammatical patterns have been argued to be more common than expected by chance. These are sometimes referred to as (statistical) language universals. One such universal is the correlation between constituent order freedom and the presence of a case system in a language. Here we explore whether this correlation can be explained by a bias to balance production effort and informativity of cues to grammatical function. Two groups of learners were presented with miniature artificial languages containing optional case marking and either flexible or fixed constituent order. Learners of the flexible order language used case marking significantly more often. This result parallels the typological correlation between constituent order flexibility and the presence of case marking in a language and provides a possible explanation for the historical development of Old English to Modern English, from flexible constituent order with case marking to relatively fixed order without case marking. Additionally, learners of the flexible order language conditioned case marking on constituent order, using more case marking with the cross-linguistically less frequent order, again mirroring typological data. These results suggest that some cross-linguistic generalizations originate in functionally motivated biases operating during language learning.

Keywords: language acquisition, learning biases, language universals, efficient information transfer, communicative pressures

Go to:

1. Introduction

As observed by linguist Joseph Greenberg (Greenberg, 1963), certain grammatical properties tend to co-occur across languages. These statistical tendencies that occur with higher than chance frequency cross-linguistically often take the form of implicational statements (implicational language universals^¹): If a language has property A, then it will most likely have property B. The explanation for such (hypothesized) universals has been heavily debated in linguistics and cognitive science. Most theories agree that linguistic universals originate at the level of the individual and suggest that language structures are shaped by biases and limitations on human cognitive systems (Chomsky, 1965; Christiansen & Chater, 2008; Deacon, 1997; Pinker, 1984, but see Dunn et al., 2011; Evans & Levinson, 2009 for arguments that language universals originate outside of the cognitive system). Views differ, however, as to whether these biases are specific to language (Chomsky, 1965; Fodor, 2001; Pinker, 1984) or rather arise from constraints shared by (at least some) other cognitive systems such as memory or perception (e.g., Bates & MacWhinney, 1982; Christiansen & Chater, 2008). Under the latter view, cognitively non-arbitrary pressures on language acquisition and use shape the historical development of languages over time, providing an explanation for cross-linguistic generalizations (Bever, 1970; Bybee, 2007; Christiansen & Chater, 2008; Deacon, 1997; Givón, 1991; Hawkins, 2004; Newport, 1981, 1990; Slobin, 1973; Zipf, 1949, for a review see Jaeger & Tily, 2010).

One example of a linguistic universal that has been of long-standing interest in linguistic typology is the inverse correlation between case and constituent order as means of encoding grammatical function assignment^². Typological and historical studies as well as computational simulations have supported the hypothesized trade-off between these two cues. For example, typological studies have found that languages with flexible constituent order often use morphological means, such as case, to mark grammatical function assignment (e.g., German, Japanese, and Russian), whereas languages with fixed constituent order (such as English and Mandarin) often lack productive case marking (Baugh & Cable, 1993; Blake, 2001; Sinnemäki, 2008; for a related discussion see also Siewierska, 1998)^³. Historical studies have found that case marking and constituent order flexibility trade off over time, such as in the historical development from Old English (a language with flexible constituent order and rich case marking) to Modern English (a language with fixed constituent order and a rudimentary case system) (Marchand, 1951; Sapir, 1921; Traugott, 1972; see also Tily, 2010).

However, typological and historical correlations should be interpreted with caution. First, any approach relying solely on typological or historical data suffers from sparsity of independent data points: Most or even all languages are more or less directly genetically related. Language contact (sometimes extending over centuries) can further diminish independence. This drastically reduces the true effective sample size available for statistical tests of hypothesized universals (for related discussion, see Cysouw, 2010; Dryer, 1989; Jaeger, Graff, Croft, & Pontillo, 2011; Rafferty, Griffiths, & Klein, 2014). Indeed, recent more statistically advanced analyses that adequately discount dependencies between languages have called into question the validity of many implicational universals (Dunn, Greenhill, Levinson, & Gray, 2011; but see Croft, Bhattacharya, Kleinschmidt, Smith, & Jaeger, 2011 for discussion). Specifically with regard to the correlation between case and constituent order freedom, some studies have found stronger correlations between constituent order flexibility and other morphological means (e.g., agreement, Siewierska, 1998). These studies have left open the question of whether the observed correlation between constituent order flexibility and case marking holds once these other factors are controlled for. Similarly, the historical link between the loss of case marking and the loss of word order flexibility has been called into question (Detges, 2009; Pintzuk, 2002).

More importantly, while typological studies can uncover a correlation between biases in individual users and patterns in cross-linguistic diversity, they cannot address questions about the underlying causal relationship between the two. Thus, despite the long-standing interest in the trade-off between constituent order and case marking, it is still unclear whether this phenomenon indeed originates in biases of individual language users.

Here we draw on an emerging complement to typological data, which allows us to address questions about the causes of linguistic universals. The approach we employ is miniature language learning. In this paradigm child or adult learners are exposed to experimentally designed languages that are small enough to be learned in the lab. In contrast to a typological approach, a miniature language learning paradigm does not suffer from data sparsity: Within the limitations of the paradigm, novel data can be generated (for a discussion of these limitations, see Fedzechkina, Newport, & Jaeger, in press). Perhaps more importantly, a miniature language learning approach also allows us to directly test hypotheses about the underlying causes of linguistic universals by studying patterns in the acquisition and use of novel miniature languages.

Recent findings from this paradigm provide behavioral evidence for language universals. In a number of such experiments, typologically frequent patterns are learned more easily or are preferred over cross-linguistically less frequent alternatives, thereby providing evidence for the hypothesis that patterns in cross-linguistic diversity originate as biases in language users. For example, Culbertson, Smolensky, and Legendre (2012) behaviorally replicate Greenberg’s universal #18. In a miniature language learning experiment, they find that adult learners preferentially acquire languages in which adjective-noun and numeral-noun orders are harmonic (both noun-first or both noun-last); and Culbertson and Newport (2015) show that child learners do this even more strongly. A small but growing number of studies have observed similar parallels between typological data and biases operating during acquisition of miniature languages (e.g., for phonology: Finley & Badecker, 2008, Wilson, 2006; word formation: Newport & Aslin, 2004; morphology: Fedzechkina, Jaeger, & Newport, 2012; Hudson Kam & Newport, 2005, 2009; Hupp, Sloutsky, & Culicover, 2009; St Clair, Monaghan, & Ramscar, 2009; syntax: Culbertson et al., 2012; Culbertson & Newport, 2015; Morgan, Meier, & Newport, 1987; Tily, Frank, & Jaeger, 2011).

Only a few artificial language learning studies have directly addressed the role of functionally motivated learning biases in shaping language structures. In previous work, we have studied one instance in which we hypothesized that grammatical properties associated with efficient communication originate at least in part in acquisition. Fedzechkina et al. (2012) presented adult participants with miniature languages that had flexible constituent order and optional case marking on either the object or the subject of the sentence, which also varied in animacy. Natural languages tend to use case marking efficiently – i.e., they typically condition case marking on semantic properties of the referent, such as animacy, and employ overt case marking when these semantic properties are more likely to bias the listener away from the intended interpretation (cf. Kurumada & Jaeger, 2015). The artificial languages used by Fedzechkina et al. (2012) differed from naturally occurring ones in that they did not condition the use of case marking on the referent’s animacy. However, learners did not acquire these inefficient morphological systems veridically. Instead, learners deviated from the input and used more case marking for animate (atypical) objects than for inanimate (typical) objects, making the case systems more efficient (Gibson et al., 2013; Jaeger, 2010; Kurumada & Jaeger, 2015) and more in line with typological data (Aissen, 2003; Fry, 2003; Lee, 2006; Silverstein, 1976).

Here we extend this line of research to ask whether typological patterns of cue trade-offs at different levels of linguistic organization (e.g., syntax and morphology) can also be explained by preferences for efficient communicative systems. We present new evidence that the inverse correlation between case and constituent order flexibility emerges during language learning. This, we argue, can be understood as a trade-off between effort and robust information transmission (cf., Jäger, 2007; Kurumada & Jaeger, 2015; Zipf, 1949). In languages with relatively fixed constituent order, the ordering of arguments is highly informative about their grammatical function (which is the subject and which is the object of the clause). Hence there is little uncertainty about the intended meaning after hearing a sentence in such a language (formally, the conditional entropy over grammatical function assignments is low), and morphological marking would provide little additional information beyond that conveyed by constituent order. In languages with flexible constituent order, however, the ordering of arguments alone leaves greater uncertainty about the intended grammatical function assignment. For such languages, morphological marking is an informative cue to grammatical function. In both types of languages the effort required for the production of morphological marking is identical; but the utility of morphological marking –i.e., the trade-off between production effort and the likelihood of successful message transmission—is higher in flexible constituent order languages (for related arguments, see also Haspelmath, 1999; Hawkins, 2004; Jäger, 2007; Kurumada & Jaeger, 2015; Tily, 2010).

Similarly – and discussed in more detail below — there is an inverse correlation between case marking and constituent order flexibility and learnability: flexible constituent order languages are learned more successfully by connectionist networks if case marking is available (Lupyan & Christiansen, 2002; Van Everbroeck, 2003). The current work tests whether these simulations are supported by data from human language learners.

As in our earlier work (Fedzechkina et al., 2012), we test whether learners exposed to a communicatively imperfect input language will deviate from their input to ‘improve’ the language, and whether they will do so by reducing average production effort, reducing average uncertainty about the intended meaning, or both. The present experiment tests this prediction by directly investigating the trade-off between constituent order (a syntactic cue) and case marking (a morphological cue). We further investigate how case is used in the flexible constituent order language. We ask, in particular, whether learners increase communicative success by favoring robust information transfer over effort and regularize case marking in the language overall, or whether they favor an efficient balance between these two goals by conditioning case marking on constituent order.

Go to:

2. Method

2.1. Participants

Monolingual native speakers of English were recruited from the University of Rochester. Recruiting and execution of this study were approved by the Research Subjects Review Board of the University of Rochester. Following our previous work (Fedzechkina et al., 2012), recruitment continued until the number of participants who successfully learned the miniature languages reached 20 in each condition. A total of 52 volunteers were recruited for the experiment, all of whom received $25 for their time. 12 participants were excluded from the analysis for the following reasons: failure to achieve 65% accuracy on the comprehension test (10 participants); computer error (1 participant); being bilingual (1 participant). This left the data from 40 participants for the analysis.

2.2. Design and Materials

Each of the two miniature artificial languages contained 10 novel content words (4 verbs and 6 nouns) and a case marker ‘kah’ (see Table 1). All words were phonotactically legal non-words of English. Individual words were synthesized using the AT&T speech synthesizer (voice ‘Crystal’) and concatenated into sentences with 35ms silence between the words using Praat (Boersma, 2001). This procedure ensured that the stimuli did not contain prosodic cues to sentence meaning. All sentences described short videos created using Poser Pro software that depicted simple transitive events such as ‘hug’ or ‘poke’ performed by two male actors such as ‘chef’ or ‘referee’ (see Fig. 1 and Fig. 2 for example stimuli).

An external file that holds a picture, illustration, etc.
Object name is nihms745557f1.jpg

Fig. 1

Illustration of form to grammatical function mappings in two extreme scenarios (completely fixed and completely free constituent order languages) in the experiment. Pictures are still images of sample videos with their descriptions in the miniature language (not shown to participants). Arrows indicate form to meaning mappings in the two languages. Solid arrows indicate one-to-one form-meaning mappings such as in the absence of constituent order variation or in the presence of case marking (underlined). Dashed arrows indicate one-to-many form-meaning mappings, such as in the absence of case marking for variable constituent order. The strikethrough form refers to the ungrammatical (OSV) sentences in the fixed (SOV-only) language.

An external file that holds a picture, illustration, etc.
Object name is nihms745557f2.jpg

Fig. 2

Experimental procedure. Pictures are still images of sample videos used in the experiment

Table 1

The artificial lexicon.

Nouns	Verbs	Case-marker
glim	geed	kah
flugit	kleidum
spad	zamper
bliffen	shen
norg
melnawg

All verbs occurred equally frequently within each language overall and with each constituent order allowed by the language. All nouns occurred equally often in the subject and object position with each verb.

Participants were randomly assigned to one of the two language conditions. Both languages contained optional case marking: 67% of all objects were marked with a case-maker ‘kah’ and 33% of objects had no overt marking. Subjects were never case-marked in either of the languages. Both languages had head-final constituent order (i.e., the verb followed both the subject and the object). This constituent order was chosen since it is cross-linguistically more common in languages with a case system (Dryer & Haspelmath, 2011; Greenberg, 1963).

The two languages differed in their constituent order consistency (and therefore in the amount of information that constituent order conveyed about sentence meaning). In the flexible constituent order language, subject-object-verb (SOV) and object-subject-verb (OSV) orders occurred equally frequently in the input. Thus, in this language, constituent order was uninformative about grammatical function assignment, and case marking added important information for decoding sentence meaning.

The fixed constituent order language did not contain constituent order variation: SOV constituent order occurred in 100% of the input sentences. In this language, constituent order was highly informative and always disambiguated grammatical function assignment; case marking was a redundant cue to grammatical function.

If language acquisition is indeed biased towards efficient communicative systems, we expect this to be reflected in the languages that learners acquire. This makes two predictions. First and most importantly, we predicted that learners would use more case marking when it is informative of grammatical function assignment (flexible constituent order language) than when it is a redundant cue (fixed constituent order language). Second, if learners strongly disprefer to expend effort when it is not required for successful communication, we might see that the redundant case marking is frequently omitted by learners of the fixed constituent order language, compared to the input language.

2.3. Procedure

The procedure was adopted from Hudson Kam and Newport (2005, 2009). At the beginning of the experiment, participants were informed that they would be learning a novel ‘alien’ language by watching short videos and hearing their descriptions in this language, but they received no explicit instructions about the structure of the language.

Participants completed 3 experimental sessions (each lasting 30–35min) spread over 3 days, with at most 1 day between the sessions. During each visit, participants completed a series of exposure and test blocks that focused on noun and sentence learning. All sessions followed the same overall procedure; the number of blocks, however, differed from session to session (Fig. 2).

Noun exposure and tests

Noun exposure

Each experimental session began with a noun exposure block, where participants were presented with pictures of characters accompanied by their names in the novel language and were instructed to repeat the names to facilitate learning.

Noun comprehension

The 2-alternative forced choice noun comprehension block followed. Participants heard a name of a character in the novel language accompanied by two pictures and were asked to choose the correct picture. Feedback was provided after each trial.

Noun production

Participants were shown pictures of characters one at a time and asked to provide a label for them in the novel language. Feedback on correctness was provided on each trial.

Noun exposure and comprehension blocks included 12 trials each on Day 1 and 6 trials each on Days 2–3. The noun production block included 6 trials on all days of training. On Day 1, the three blocks were repeated immediately after completion of the noun production test. The noun exposure and comprehension blocks were also presented immediately before the sentence production test. On Days 2–3, participants completed only the noun production block before the sentence production block.

Sentence exposure and tests

Sentence exposure

The sentence exposure block followed noun exposure and tests on all days of training. Participants were shown short computer-generated videos accompanied by their descriptions in the novel language and were asked to repeat the descriptions out loud to facilitate learning. The video and sound stimuli were presented simultaneously: the description was played as the action unfolded. Participants were allowed to view the action and listen to the novel description as many times as they liked during the first sentence exposure block on Day 1; replay was disabled for all other blocks during the experiment.

Sentence comprehension

On all days of training, sentence exposure was followed by a sentence comprehension block. On each trial, participants were shown two previously unseen videos accompanied by a sentence in the novel language. Both videos depicted the same action performed by the same characters, but the roles of the actor and patient were reversed in the two videos. Participants were asked to choose the video that best matched the sentence they heard. No feedback on correctness was provided during the test.

On all days of training, participants were presented with two sets of two sentence exposure blocks and one sentence comprehension block (24 trials each).

Sentence production

Each experimental session ended with a sentence production block (48 trials total). Participants viewed previously unseen videos and were asked to describe them in the novel language. To facilitate production, participants were auditorily presented with a novel verb prompt. No feedback was provided during this test.

Go to:

3. Results

Before turning to the central prediction of this study – differential case marker use depending on constituent order flexibility – we describe how our data were scored and discuss learners’ overall performance. Then we analyze case marking preferences in participants’ productions. To more fully capture how language learners are affected by the hypothesized bias to trade off robust information transmission against production effort, we provide several complementary analyses. First, we present the most common type of analysis – analyzing patterns of case use across participants. This provides a clear picture of how learners change the use of case marking in their productions compared to the input they receive. This analysis, however, does not capture the joint effect of constituent order and case marking in the hypothesized trade-off. There are many ways to make the input languages more communicatively efficient, and the most efficient use of case marking also depends on how each individual learner uses constituent order. For example, some learners might create more communicatively efficient languages by fixing constituent order and dropping case marking, while others might maintain constituent order variation and add case marking. Indeed, we observe considerable individual differences between learners in both constituent order and case marking preferences. This leads us to suggest a unifying information-theoretic approach that will directly assess the extent to which participants trade off effort against robust information transmission through the joint use of constituent order and case marking.

3.1. Scoring

Comprehension accuracy

Participants who did not achieve 65% accuracy on the comprehension test on the final day of training were removed from the analysis. For this purpose we analyzed responses only on case-marked (i.e., unambiguous) trials. This excluded 10 participants, all in the flexible order language. This is not surprising since participants were monolingual native speakers of English, a language that has no constituent order variation or case marking. This makes the flexible order language considerably harder to learn.

For the remaining 40 participants, mean comprehension accuracy was 97% across languages (99% for the fixed constituent order language and 96% for the flexible constituent order language) on the final day of training. The results reported below did not change when participants who failed to pass the 65% criterion were included in the analysis.

Production accuracy

For each trial, we scored constituent order used in the utterance, the presence of case marking on the object, lexical (using incorrect vocabulary) and grammatical mistakes (using a constituent order not allowed by the language or using a case marker on a constituent other than the object). On a small number of utterances, participants mispronounced the name of a referent or an action. If it was impossible to determine the constituent order used in the utterance (e.g., when both referent names were mispronounced), the production was scored as both lexically and grammatically incorrect. If it was still possible to determine the constituent order used in the utterance (e.g., the name of only one referent was incorrect), the production was coded as a lexical mistake and was scored for grammatical accuracy.

Both languages were acquired with a high degree of accuracy. On the final day of training, participants made 1.3% lexical mistakes across languages (1% in the fixed constituent order language and 1.6% in the flexible constituent order language) and 6.2% grammatical mistakes (all in the flexible constituent order language), suggesting that the task was feasible for our participants. All analyses reported below are based on grammatically correct productions only. The same results were obtained when productions containing lexical mistakes were also removed from the analysis.

3.2. Constituent order in production

Since learners in our experiment received no instruction as to which structures to use in their own productions, they could have made languages more efficient by changing the amount of constituent order flexibility allowed in the languages (e.g., by making the flexible constituent order language more fixed or the opposite) or by differentially using case marking depending on the language condition. Learners did not vary the constituent order properties of the input languages (see Fig. 3): Participants in both language conditions maintained the input constituent order proportions on all days of training.

An external file that holds a picture, illustration, etc.
Object name is nihms745557f3.jpg

Fig. 3

Constituent order use by language condition. The error bars represent bootstrapped 95% confidence intervals. The dashed lines indicate the input proportions (different across conditions).

Having established that learners acquired the meaning of case marking in our experiment and matched the constituent order distribution in the input, we now turn to the analysis of case marker preferences. We first explore the main prediction of this study – whether learners use case marking differentially depending on the amount of constituent order variability in the language. We then turn to a more detailed analysis of how case marking is used.

3.3. Case marker use in production

The central prediction of the current study is that language learners are biased against excessive redundancy in linguistic systems and use additional cues to grammatical function only if the existing cues do not provide sufficient information for successful recovery of sentence meaning. Note that while we expect learners to trade off the information provided by cues to sentence meaning, we do not expect them to completely remove all redundancy from the grammatical system. That is, we hypothesize differential case marker use depending on the amount of constituent order flexibility allowed by the language, but not the categorical absence of case marking in the fixed constituent order language. Some amount of redundancy is expected in an efficient linguistic system since communication takes place in the presence of noise and thus each of the probabilistic cues to grammatical function assignment has a certain probability of being misperceived during communication.

To test this prediction, we used a mixed logit model (Breslow & Clayton, 1993; Jaeger, 2008) to regress the presence of case marking on the object onto full factorial design (all main effects and interactions) of language condition (fixed vs. flexible constituent order) and day of training (1–3). All analyses reported below contained the maximal random effects structure justified by the data based on backward model comparison. All results also hold when the fullest converging random effects structure is used.

There was a significant main effect of language condition (see Fig. 4): Learners of the flexible constituent order language used significantly more case marking in their productions than learners of the fixed constituent order language ( [beta] = 1.45, z=2.24, p<0.05). Language condition interacted with Day 2 ( [beta] =0.46, z=3.4, p<0.001) and Day 3 of training ( [beta] =0.25, z=2.75, p<0.01). Simple effects test revealed that the difference between the two language conditions (fixed vs. flexible constituent order) was significant on Day 2 ( [beta] =1.65, z=2.5, p<0.05) and Day 3 ( [beta] =1.94, z=2.72, p<0.01) of training. Thus, as predicted, learners of the flexible constituent order language were more likely to use case marking.

An external file that holds a picture, illustration, etc.
Object name is nihms745557f4.jpg

Fig. 4

Case marker use by language condition. The error bars represent bootstrapped 95% confidence intervals. The dashed line represents the input proportion (equal across conditions).

Second, we also predicted that learners of the fixed constituent order language would reduce the amount of case marking compared to the input, since the production of case consumes effort but adds no information above that already conveyed by constituent order. Indeed, learners of the fixed constituent order language used case marking in their own productions significantly below the input on all days of training, supporting our hypothesis (Day 1: 50% case marking in production, marginally lower than 67% input proportion [Wilcoxon Signed-Rank test over by-participant proportions: V=59, Z=−1.73, p=0.08]; Day 2: 45% case marking in production, significantly lower than the input [V=44, Z=−2.06, p<0.05]; Day 3: 41% case marking in production, significantly lower than the input [V=37, Z=−2.56, p<0.05]).

With these two basic predictions confirmed, we next investigated in more detail how speakers of the flexible order language used case marking.

3.4. How is case marking used in the flexible order language?

There are at least two ways in which learners could increase the robustness of information transmission in the flexible order language. First, learners can regularize case marking in the language overall (i.e., produce it more frequently than in the input overall). This change would increase production effort compared to the input since all objects will be case-marked. Alternatively, learners could condition case marking on constituent order – for example, by always using it with one variant but never with the other (perfectly asymmetric case marking). This would result in low uncertainty about the intended meaning since the absence of case marking in this instance is highly informative of grammatical function assignment. Crucially, this latter strategy is more efficient than the former since it reduces case marking to a subset of all sentential objects. This makes asymmetric case marking a more robust system without a concomitant increase in effort compared to the input. Note that even imperfect case marking asymmetry (i.e., marking objects more often in one constituent order than the other) can reduce the uncertainty about grammatical function assignment, compared to perfectly symmetric case marking. Next we explore which of these strategies learners used in our experiment.

Overall, the use of case by learners of the flexible constituent order language was the same as in the input on all days of training, suggesting that learners did not adopt the full case regularization strategy (Day 1: 55% case marking in production, not significantly different from 67% input proportion [Wilcoxon Signed-Rank test over by-participant proportions: V=69, Z=−1.34, p=0.18]; Day 2: 72% case marking in production, not significantly different from the input [V=142, Z=1.38, p=0.17]; Day 3: 71% case marking in production, not significantly different from the input [V=130, Z=0.93, p=0.36]).

Did learners prefer a more efficient strategy of conditioning case marking on constituent order, leading to asymmetric case marking? We regressed the presence of case marking onto constituent order (SOV/OSV), day of training (1–3), and their interaction. Learners did not use case marking uniformly across the two constituent orders, but instead produced case significantly more often in OSV sentences ( [beta] =1.11, z=17, p<0.001; see Fig. 5). A significantly higher proportion of object case marking in OSV sentences compared to SOV sentences was observed on every day of training (simple effects for Day 1: [ [beta] =1.53, z=12.6, p<0.001]; Day 2 [ [beta] =0.9, z=8.6, p<0.001]; Day 3 [ [beta] =0.91, z=8.23, p<0.001]). We also compared learners’ case use to input: They produced significantly more case marking than the input in OSV sentences and tended to match the input proportion of case marking in SOV sentences on most days of training (for OSV sentences: Day 1: 81% case marking in production, significantly higher than 67% input proportion [Wilcoxon Signed-Rank test over by-participant proportions: V=165, Z=−2.28, p<0.05]; Day 2: 86% case marking in production, significantly higher than the input [V=177, Z=3.31, p<0.001]; Day 3: 85% case marking in production, significantly higher than the input [V=158, Z=2.56, p<0.05]; for SOV sentences: Day 1: 28% case marking in production, significantly lower than 67% input proportion [V=33, Z=−2.7, p<0.01]; Day 2: 54% case marking in production, not significantly different from the input [V=60, Z=−1.41, p=0.16]; Day 3: 55% case marking in production, not significantly different from the input [V=69, Z=−1.04, p=0.3]).

An external file that holds a picture, illustration, etc.
Object name is nihms745557f5.jpg

Fig. 5

Case marker use by sentence constituent order (flexible constituent order language). The error bars represent bootstrapped 95% confidence intervals. The dashed line represents the input proportion (equal across constituent orders).

The analyses presented so far provide evidence in support of the hypothesis that learners trade off robust information transmission against production effort. In understanding how languages change over time as a function of this pressure, it is also of interest to understand individual learners’ behavior. It is possible, for example, that all learners follow the same overall preference in constituent order and case marking use. Alternatively, all learners could follow the same general bias to balance effort and informativity, but do so in a variety of different ways. This would highlight the role of individual innovations in language change. Yet another possibility is that the across-participant averages we have discussed so far draw a misleading picture: it is theoretically possible that individual participants never exhibit the hypothesized trade-off.

In the next two sections, we first illustrate that a high degree of between-participant variability applies to our data, as is typical in this type of experiments (Culbertson et al., 2012; Fedzechkina et al., 2012; Hudson Kam & Newport, 2005, 2009): Learners had dramatically different case marking and constituent order preferences. We then present a novel information-theoretic perspective on the data showing that the superficially differing individual strategies are guided by a single underlying principle of trading off effort against robust information transmission.

3.5. Individual learners’ preferences

There was considerable variation both in constituent order and case marking preferences. We describe these variable patterns in turn. While the majority of learners of the flexible order language matched the input distribution of constituent order (see Fig. 6), a number of learners substantially deviated from the input either in the direction of SOV or OSV. In particular, three learners adopted SOV as the dominant constituent order in their productions and five learners regularized OSV constituent order; the remaining participants tended to stay close to the input.

An external file that holds a picture, illustration, etc.
Object name is nihms745557f6.jpg

Fig. 6

Constituent order preferences of individual learners on the final (3^rd) day of training in the fixed (top panel) and flexible (bottom panel) constituent order languages. The dashed lines indicate the input proportion.

There was no between-participant variability in the fixed constituent order language: All learners used exclusively SOV constituent order in their productions. This learning outcome is in line with prior work suggesting that learners acquire consistent input very well and are not likely to introduce innovations into a perfectly consistent linguistic system (e.g., St Clair et al., 2009; Tily et al., 2011).

There was also considerable variation in case marker preferences among individual participants in the two languages (see Fig. 7). While the majority of learners of the fixed constituent order language produced case marking below the input (and eight learners never used case marking at all), three participants produced substantially more case markers than the input proportion. Similarly, in the flexible constituent order language, most learners tended to produce case above the input (and several learners used it in all their productions), while one learner never used case and five learners produced case marking below the input.

An external file that holds a picture, illustration, etc.
Object name is nihms745557f7.jpg

Fig. 7

Case marker preferences of individual learners on the final (3^rd) day of training in the fixed (top panel) and flexible (bottom panel) constituent order languages. The dashed lines indicate the input proportion.

3.6. A unifying information-theoretic perspective on individual differences

Individual preferences discussed above provide further support for the hypothesis that learners trade off production effort against robust information transmission. While the majority of learners of the fixed constituent order language produce case below the input proportion, learners of the flexible order language produce case above the input. These data, however, do not show a complete picture since communicative efficiency of the acquired languages ultimately depends on learners’ joint case and constituent order preferences. For example, the efficiency of case marker regularization in the flexible constituent order language would vary depending on learners’ constituent order preferences: While it is efficient to maintain or regularize case in the presence of constituent order variability, it would not be an efficient innovation if constituent order variability is eliminated.

To address learners’ joint case marker and constituent order preferences, we introduce a new perspective on our data that lets us directly assess whether learners strive to balance effort and uncertainty about the intended meaning, independent of the means by which they achieve it (i.e., independent of whether they, for example, fix constituent order, regularize case marking, or condition case marking on constituent order). For this purpose, we estimated production effort and the amount of uncertainty about grammatical function assignment for each language acquired by individual participants.

We formalize production effort as the average number of syllables per sentence.^⁴ As shown in Fig. 8, the two input languages in our experiment have the same amount of effort (ranging from 3 to 7 syllables per sentence, with a mean of 5.15) since they use the same artificial lexicon and have the same amount of case marking in the input.

An external file that holds a picture, illustration, etc.
Object name is nihms745557f8.jpg

Fig. 8

Uncertainty vs. production effort trade-off on the final (3^rd) day of training. Diamonds with red borders represent input languages. Solid circles represent mean output languages produced by participants. Open diamonds represent what the output languages produced by participants would look like if they contained the same overall proportions of constituent order and case marking as the actual output, but used them independently (i.e., did not condition case marking on constituent order). The error bars represent bootstrapped 95% confidence intervals.

The average uncertainty about grammatical function assignment experienced by the listener who has a perfect knowledge of the grammar used by the speaker was captured as conditional entropy over grammatical function assignments (GF):

H (GF | sent. form) = - \sum_{GFs} \sum_{sent. forms} p (sent. form, GF) * {log}_{2} p (GF | sent. form)

where the sum is over grammatical function assignments (subject-object, object-subject) and three possible sentence forms in the languages (NP1-no case NP2-case; NP1-case NP2-no case; NP1-no case NP2-no case).

In our languages, the conditional entropy of grammatical function assignment was 0 bits for sentences with case-marked objects and for all sentences if there was no constituent order variation in the language.^⁵ Thus, the fixed order input language had minimal weighted conditional entropy of 0 bits as it contained no constituent order variation. In the flexible order input language, 67% of sentences were case marked and thus resulted in the conditional entropy of 0 bits. 33% of input sentences were not case marked and thus resulted in the conditional entropy of 1 bit since SOV and OSV constituent order variants occurred equally often in the input. Thus, the average weighted conditional entropy of the flexible constituent order language was 0.33 bits (1*0.33+0*0.67).

As expected under our hypothesis, learners systematically deviated from the input towards languages that balanced effort and uncertainty about sentence meaning more efficiently (see Fig. 8). As predicted, by dropping redundant case marking in their productions, learners of the fixed constituent order language converged on a language that had lower average effort compared to the input. The output language produced by learners of the flexible constituent order language tended to have lower weighted conditional entropy of grammatical function assignment compared to the input without a concomitant increase in effort. This behavior highlights the benefit of asymmetric case marking: For the same overall effort, entropy reduction is larger for systems that have asymmetric case marking compared to systems with symmetric case marking (i.e., case marking independent of constituent order) for the same overall constituent order proportion (see Fig. 8).^⁶

3.6.1. Between-participant variability revisited from an information-theoretic perspective

The analysis of individual participants’ productions further highlights the benefits of understanding learners’ behavior as a function of effort and uncertainty reduction. As Fig. 9 shows, the majority of learners of the fixed constituent order language followed our prediction and reduced effort without increasing uncertainty. All learners of this language produced languages that had zero conditional entropy of grammatical function assignment, and 14 (out of 20) learners reduced effort compared to the input.

An external file that holds a picture, illustration, etc.
Object name is nihms745557f9.jpg

Fig. 9

Uncertainty vs. production effort trade-off on the final (3^rd) day of training. Diamonds with red borders represent input languages. Solid circles represent mean output languages produced by participants. Open circles represent language outputs produced by individual learners. The error bars represent bootstrapped 95% confidence intervals.

In the flexible constituent order language, there was an overall strong bias to reduce uncertainty: 12 out of 20 participants produced languages that had zero conditional entropy of grammatical function assignment (see Fig. 9) and 5 out of the remaining 8 learners produced languages with lower uncertainty than in the input. Only 3 participants increased conditional entropy of grammatical function assignment. However, all of them did so while decreasing effort.

Overall, the preference to reduce uncertainty in grammatical function assignment was stronger than a preference to reduce effort: Effort increases were acceptable as long as conditional entropy of grammatical function assignment was reduced. However, if uncertainty about grammatical function assignment was increased, there was a clear preference to reduce effort.

In summary, instead of veridically reproducing the input language, which did not trade off effort against robust information transmission efficiently, participants in our experiment introduced innovations into the acquired language that increased its efficiency. Learners did not differ in whether they followed this bias; they only differed in how they weighted the two opposing pressures. This suggests that, despite the fact that learners employ vastly differing strategies, their learning outcomes are guided by a deeper abstract principle of a trade-off between robust information transmission and effort.

Go to:

4. Discussion

Our findings add to the growing body of research showing that learners preferentially acquire and regularize typologically frequent patterns (Christiansen, 2000; Culbertson et al., 2012; Fedzechkina et al., 2012; Finley & Badecker, 2008; Morgan, Meier, & Newport, 1987; Newport & Aslin, 2004). Learning outcomes in our experiment parallel synchronic and diachronic cross-linguistic phenomena, thereby providing converging evidence that these patterns are not due to chance.

Even though learners of both languages were exposed to the same amount of case marking in the input, we observed differential learning outcomes depending on the language condition that followed the typological correlation between constituent order flexibility and case marking. Learners of the fixed order language tended to drop case marking in their productions, paralleling the lack of case systems in fixed constituent order languages like English or French. Learners of the flexible order language, however, retained case marking in their productions, mirroring flexible constituent order languages like Latin or Russian that typically have a case system.

Learning outcomes in our experiment also parallel diachronic typological patterns, such as, the change from Old English (a language with flexible constituent order and a rich case system) to Modern English (a language with fixed constituent order and no case system). Whether the historical loss of case marking precipitated (Marchand, 1951; Sapir, 1921) or followed constituent order fixing (Lehnert, 1957) is a matter of debate, and our current results are unable to tease apart these two alternatives. Under our hypothesis, however, both paths should result in the same outcome: Learners of a flexible order language should maintain constituent order flexibility if this language has a case system (cf. Old English) and gradually lose it if this language has no case system (cf. Modern English).

Interestingly, learners of the flexible order language did not fix or even regularize constituent order. One reason for this might be that changes to constituent order result in larger perceptual deviance from the input distributions, but future work is necessary to address this question.

The overall picture emerging from individual learners’ performance suggests that the preference to balance robust information transmission and effort is indeed in play during language acquisition (Aylett & Turk, 2004; Gibson et al., 2013; Jaeger, 2010; Jäger, 2007; Kurumada & Jaeger, 2015; see also Lindblom, 1990). Our findings provide insight into linguistic diversity: For any grammatical system, the general principle of trading off robust information transmission and effort can manifest itself in a variety of ways. Which of them would survive at the population level and eventually become part of the grammar depends on a variety of factors, including the inherent efficiency of the innovation, other properties of the linguistic system, historical factors such as language contact (Heine, 2008), and social factors such as social influences of a person creating the innovation (Nettle, 1999).

In the remainder of the paper, we first discuss some of our findings in more detail, and in particular learners’ preference to use more case marking in OSV sentences. We then address outstanding questions for future work. Specifically, we situate our findings in a broader literature on deviations from the input in adult artificial language learning and consider possible mechanisms underlying the learning outcomes observed in our experiment. We conclude with a discussion of how the logic of the arguments presented in this paper can be extended to a broader range of cues to grammatical function assignment, beyond the interaction between case marking and constituent order.

4.1. Conditioned variability

Learners of the flexible order language chose to condition case marking on constituent order rather than to regularize case marking in the language to 100%. These results corroborate prior work showing that adult learners prefer linguistic systems that contain conditioned variability over systems with no variability (Smith & Wonnacott, 2010; but see Hudson Kam & Newport, 2005, 2009 for the opposite findings in children). This learning outcome conceptually replicates our previous findings with different miniature languages (Fedzechkina et al., 2012), where learners conditioned case marking on the arguments’ animacy. Learning behavior in our experiment points to a preference for another cross-linguistically frequent property of human language – a tendency to avoid free variation in which several competing forms carry the same meaning in similar contexts. Instead, languages tend to have predictable variation by conditioning the use of competing forms on semantic, pragmatic, phonological, and other factors (Givon, 1985).

Conditioned variability in our experiment increases communicative success without a concomitant increase in production effort. In a language that conditions case marking on constituent order, not all sentential objects need to be marked but sentence meaning can be reliably inferred when case markers are not present. While the same increase in communicative success would be gained by conditioning case marking on either SOV or OSV order, learners in our experiment consistently used case more often in OSV sentences. What can account for this preference?

It is possible that this behavior is associated with OSV being a non-canonical constituent order. Even though OSV and SOV constituent orders occurred equally frequently in the input and in participants’ productions, OSV constituent order is typologically rare and is uncommon in English. The increase in case marking in OSV sentences could thus be a consequence of increased utterance planning difficulty for non-canonical structures (Ferreira, 1994; Urosevic, Carello, Savic, Lukatela, & Turvey, 1981). Alternatively, producers might prefer to mark structures that are less expected for the comprehender (in this case, less expected cross-linguistically or in the native language), thereby facilitating comprehension (cf. Aissen, 2003; Bybee & Hopper, 2001; Haiman, 1983; Jaeger, 2013; Jäger, 2007). Finally, the preference to use more case in OSV sentences might reflect a bias to maximize the number of linguistic dependencies processed at each point in time (cf. ‘Maximize On-line Processing’ principle that is observed both in natural language production and typological distributions (Hawkins, 1994, 2004; Nichols, 1986). Since our input languages only have object case marking, case would provide the earliest point of disambiguation in OSV sentences and allow for correct parsing commitments early on. Thus, it is possible that learners preferentially restructured the input language in such a way as to put more informative cues earlier in the sentence. This interpretation of our results parallels the developmental literature showing that young learners are highly sensitive to the order of cues to sentence meaning and disproportionally rely on early arriving cues when making parsing decisions (Choi & Trueswell, 2010; Snedeker & Trueswell, 2004; Trueswell, Sekerina, Hill, & Logrip, 1999). Similarly, early arriving cues also appear to be acquired more easily and accurately both in first language acquisition (Trueswell, Kaufman, Hafri, & Lidz, 2012) and in laboratory miniature language learning experiments (Pozzan & Trueswell, in press). The three explanations outlined above are mutually compatible, and future work is necessary to assess their validity.

4.2. When are deviations from the input expected?

Our results add to the growing body of research investigating the circumstances under which learners are likely to deviate from the input they receive. Adult learners in our experiment did not veridically reproduce the inefficient variation in the input, but tended to consistently deviate from the input towards more efficient linguistic systems. In fact, 60% of learners of the flexible constituent order language converged on output languages that had zero conditional entropy about grammatical function assignment. This behavior is atypical given that prior work in miniature artificial language learning has found that adult learners generally match the input they receive (Hudson Kam & Newport, 2005, 2009; Tily et al., 2011). These studies, however, differ from present work in several important respects.

First, a large body of work suggests that learners are not likely to introduce innovations into perfectly consistent languages (traditionally used to study the relationship between language learning and language structure), at least not within the short amount of time spent in the laboratory (e.g., Christiansen, 2000; St Clair et al., 2009; Tily et al., 2011). Unlike these studies, our input languages incorporate variability, somewhat reminiscent of the situation of a pidgin, to make learning biases manifest within the short period of time available in an experiment.

Studies using the paradigm similar to ours, in which learners are exposed to languages where several grammatical structures are available to express the same meaning, have produced somewhat mixed results with regard to adult learners’ behavior. Adult learners tend to match the statistics of the miniature artificial grammar in experiments of this type both in linguistic (Hudson Kam & Newport, 2005, 2009) and non-linguistic (Ferdinand, Thompson, Kirby, & Smith, 2013) domains. For example, Hudson Kam and Newport (2005) presented adults and children with miniature artificial languages that had fixed constituent order and unpredictable morphological variation: nouns were followed by determiners (‘ka’ and ‘po’) that varied probabilistically in the input. While young learners readily regularized the inconsistent input they received and used one of the determiners in almost all of their productions, adults mostly reproduced the unpredictable variation present in the input. This bias against deviating from the input can, however, be overcome if input languages are fairly complex (e.g., languages in which learners need to keep track of multiple frequencies simultaneously), while being still learnable within a short period of time (e.g., Ferdinand et al., 2013; Hudson Kam & Newport, 2009).

Crucially, the miniature artificial languages used in our experiment were different from those used in most previous work in that the design of our experiment provides a functional motivation to deviate from the input. Our work thus adds to the growing body of literature suggesting that in the presence of such motivation adult learners tend to deviate from the input provided that the input languages are carefully designed to be complex enough to observe such deviations (Culbertson & Legendre, 2010; Culbertson et al., 2012; Fedzechkina et al., 2012; Hudson Kam & Newport, 2009).

4.3. The mechanism underlying learning outcomes

Our findings raise questions about the precise nature of the mechanism underlying learners’ preferences to efficiently trade off cues to grammatical function assignment.

The learning outcomes in our experiment closely resemble the patterns found in online productions of adult speakers (Aylett & Turk, 2006; Frank & Jaeger, 2008; Gomez Gallo, Jaeger, & Smyth, 2008; Jaeger, 2010; Kurumada & Jaeger, 2015; Lindblom, 1990; van Son & Pols, 2003; van Son & van Santen, 2005). Particularly relevant to the work presented here is a recent study by Kurumada and Jaeger (2015). They found that speakers of Japanese were more likely to produce object case marking when sentence properties (such as animacy of the object or plausibility of grammatical function assignment) were likely to bias the listener away from the intended interpretation. This suggests that learners’ preferences in our experiments could originate in the human production system (see also Norcliffe & Jaeger, 2015 for similar evidence from optional head-marking in Yucatec Maya; for a cross-linguistic review see Jaeger & Buz, in press).

It is also possible that the patterns observed in our experiments originate in learning that is not specific to production: Learners may have misinterpreted some of the sentences they were exposed to, altering the characteristics of the input from which they learned. Where might such ‘misinterpretations’ arise? One possibility is that learners might have misinterpreted some of the sentences during initial perception (cf. Guy, 1996; Ohala, 1989). A promising account along these lines is presented by simulations of case marking and constituent order interactions (Lupyan & Christiansen, 2002; Van Everbroeck, 2003). For example, in Van Everbroeck’s simulations, the model was trained to determine grammatical function of each sequentially presented word in a sentence. When tested on previously unseen non-case-marked sentences, the model performed well on most fixed constituent orders, but showed somewhat poorer performance at discriminating constituents in fixed SOV order. For flexible constituent order languages, however, the model could not reliably distinguish between constituents in the absence of other cues to grammatical function assignment (e.g., case). Adding case marking dramatically improved network performance. Thus, the perceptual account can explain several cross-linguistic universals: the trade-off between case and constituent order (Blake, 2001; Sapir, 1921) as well as the prevalence of case marking in verb-final languages (Dryer & Haspelmath, 2011; Greenberg, 1963).

How likely is such an account to explain our results? On the one hand, the rarity of object-before-subject sequences in English (the native language of our learners) is likely to further increase error rates for OSV sentences (Christianson et al., 2001; Ferreira, 2003). On the other hand, two properties of our experimental design make an account that grounds misinterpretations in perception somewhat implausible. First, unlike the connectionist simulations described above, the meaning of the training sentences was always illustrated by the accompanying video in our experiment, thereby unambiguously conveying the intended grammatical function assignment. Recall that sentence descriptions were played as the action unfolded, thus making it less probable that learners have formed incorrect interpretations of sentence meaning based on the novel descriptions alone. Second, there was no time pressure. This makes it somewhat unlikely that misinterpretations during perception were sufficiently frequent to create the observed effect.

It is, however, possible that ‘misinterpretations’ arise later during memory encoding or retrieval. A recurring finding in verbal and non-verbal short-term memory is that items interfere with each other during encoding and retrieval (Jonides et al., 2008). This interference is increased if the items that need to be retrieved or encoded are in some way similar to each other (e.g., structurally, semantically, spatially, etc.). This type of interference has been shown to affect sentence comprehension as well (Lewis, 1996; Lewis & Nakayama, 2001; Lewis, Vasishth, & Van Dyke, 2006; Van Dyke & Lewis, 2003): Increasing syntactic or semantic similarity between the target and preceding or following material poses additional difficulty during retrieval (as manifested in longer reading times and increased comprehension errors). Since all visual referents in our experiment were human and male, it is likely that participants experienced this type of interference and thus misassigned grammatical roles in a certain number of trials. Such misassignments could have introduced changes into the input distribution when it was consolidated in memory. We note, however, that any explanation along these lines would need to be extended further in order to also capture the findings from optional case marking in both miniature and natural languages (Fedzechkina et al., 2012; Kurumada & Jaeger, 2015; Lee, 2006), whereas these findings follow straightforwardly under the hypothesis that learners are biased towards communicatively efficient languages.

The hypothesis that the outcomes observed in our experiment arise outside of the production system (whether in perception or memory) is broadly compatible with functional theories of efficient communication. It complements work that has focused on the organization of human production system as the locus of biases for efficient communication (Aylett & Turk, 2004; Jaeger, 2013; Kurumada & Jaeger, 2015; Lindblom, 1990).

4.4. The scope of explanation

While the main focus of this study is on the interaction between constituent order and case marking, the arguments presented here can be extended beyond these particular cues to grammatical function assignment. This view can be summarized as a more general implicational universal concerning the trade-off between cues to grammatical function assignment proposed in the functional literature (Bates & MacWhinney, 1982, 1989; Givón, 1991), which is also broadly compatible with information-theoretic accounts (e.g., Gibson et al., 2013; Jaeger, 2006, 2013; Levy, 2005). If an existing cue to grammatical function assignment is highly informative, other cues would be redundant and thus could be omitted; if existing cues do not contain enough information to successfully communicate who is doing what to whom in a sentence, additional cues should be recruited to maintain successful communication.

In this broad interpretation, this view captures the avoidance of excessive redundancy in encoding grammatical function assignment cross-linguistically: While natural languages employ a variety of strategies to identify the subject and the object of the sentence (e.g., constituent order, case marking, agreement, prosody, animacy, etc.), it is highly uncommon to consistently employ all attested means of grammatical function assignment within the same language. Our account also makes explicit predictions for languages that go against the correlation between constituent order flexibility and the presence of case, such as some highly flexible constituent order languages that have no case marking (e.g., Abkhaz, Papago): Such languages are expected to have cues other than case marking to encode grammatical function assignment. Indeed, natural languages typically preserve the distinction between the subject and object in the sentence (Wasow, 2015) and often recruit such cues as animacy (Prat-Sala & Branigan, 2000), agreement (Siewierska, 1998), or pragmatic information (Mithun, 1987) when constituent order is flexible.

Note that while we argue against excessive unnecessary redundancy in linguistic systems, some amount of redundancy is expected under our account (for a related discussion, see also Piantadosi, Tily, and Gibson (2011a). Human communication takes place in the presence of noise. Thus listeners need to infer the speaker’s intended message from multiple probabilistic cues, each of which can be corrupted during its transmission via the noisy channel with a certain probability. For example, case markers and constituent order might not be successfully perceived every time – they can be lost or misheard. This has consequences for when case marking would be expected in a language. Even when constituent order is highly informative, case marking will often further increase robust information transmission (especially in those cases when the message is less expected). This might explain why fixed constituent order languages sometimes have case and agreement systems (Siewierska, 1998) or why many languages with optional case marking (e.g., Japanese, Korean, Hindi) will optionally mark both grammatical subjects and objects (for a review, see Malchukov, 2008).

Go to:

5. Conclusion

To summarize, our findings suggest that the well-documented inverse correlation between constituent order flexibility and the presence of a case system can be explained by learners’ preference for grammatical systems that encode linguistic information efficiently. More generally, our results provide additional support for the hypothesis that at least some cross-lexical and grammatical properties of languages represent efficient trade-offs between effort and robust information transmission (Maurits, Perfors, & Navarro, 2010; Piantadosi, Tily, & Gibson, 2011b; Qian & Jaeger, 2012). Our results also contribute to a growing body of work demonstrating the potential of using miniature artificial language learning to study the relationship between learning biases and language structures.

Go to:

Acknowledgements

We thank T. Stanley and S. Krishnan for their help with data collection and data coding; C. Anzalone, G. Smith, E. Williams, and B. Chu for their help with data coding; C. Donatelli, I. Minkina and A. Wood for their help with video stimuli; L. Emberson, P. Reeder, and J.P. Blevins for valuable feedback on the manuscript. This work was supported in part by NSF grant IIS-1150028, as well as an Alfred P. Sloan Fellowship BR2011-092 to T. F. Jaeger, and by NIH grant DC00167 and HD037082 to E.L. Newport.

Go to:

Footnotes

¹Throughout this paper, we use the term ‘language universals’ to refer to cross-linguistic generalizations regardless of whether they are statistical or absolute..

²We focus exclusively on one function of case and constituent order–encoding of grammatical function assignment—even though both cues typically convey additional information in natural languages (Du Bois, 1987). An example is Japanese, in which some case markers are associated with focus, topic or discourse-marking (Shimojo, 2006; Yatanabe, 1999) or Russian, where constituent order can express pragmatic information (Yokoyama, 1986).

³This inverse relation is by no means absolute. For example, though rare, there are some languages that arguably have fixed constituent order and case marking such as Icelandic (Kiparsky, 1997). The existence of exceptions does not, however, argue against the existence of statistical tendencies. There is also a substantial number of languages with flexible constituent order that employ means other than case marking to mark grammatical relations (e.g., head-marking in many Balkan languages, Friedman, 2006) . This is compatible with the perspective we explore here, which predicts a trade-off between different means of encoding grammatical function assignment. We return to this point in Section 4.4.

⁴Our choice of complexity measure (e.g., syllable count vs. word count) does not affect the results presented here since they are perfectly correlated in our language. More generally, measures of complexity are typically highly correlated in natural language as well (Szmrecsanyi, 2004).

⁵We make a simplifying assumption of noise-free recognition of word sequences and case marking. However, the perspective assumed here is readily extendable to a more plausible assumption of noisy acoustic input and noisy recognition (e.g., Kleinschmidt & Jaeger, 2015; Levy, 2011; Levy, Bicknell, Slattery, & Rayner, 2009; Norris & McQueen, 2008).

⁶See Chapter 2 in Fedzechkina (2014) for data from a language with flexible but informative constituent order (75/25% SOV/OSV), which provides further support for the conclusions drawn here.

Go to:

References

Aissen J. Differential Object Marking: Iconicity vs. Economy. Nat Lang Linguist Th. 2003;21(3):435–483. 10.1023/A:1024109008573. [CrossRef] [Google Scholar]
Aylett MP, Turk A. The smooth signal redundancy hypothesis: A functional explanation for relationships between redundancy, prosodic prominence, and duration in spontaneous speech. Lang Speech. 2004;47(1):31–56. 10.1177/00238309040470010201. [Abstract] [CrossRef] [Google Scholar]
Aylett MP, Turk A. Language redundancy predicts syllabic duration and the spectral characteristics of vocalic syllable nuclei. The Journal of the Acoustical Society of America. 2006;119:3048. 10.1121/1.2188331. [Abstract] [CrossRef] [Google Scholar]
Bates E, MacWhinney B. Functionalist approaches to grammar. In: Wanner E, Gleitman L, editors. Language Acquisition: the State of the Art. Cambridge: Cambridge University Press; 1982. pp. 173–218. [Google Scholar]
Bates E, MacWhinney B. Functionalism and the competition model. The crosslinguistic study of sentence processing. 1989:3–73. [Google Scholar]
Baugh A, Cable T. A history of the English language. 4th. London: Routledge; 1993. [Google Scholar]
Bever T. The cognitive basis for linguistic structures. In: Hayes JR, editor. Cognition and the development of language. John Wiley & Sons; 1970. pp. 279–362. [Google Scholar]
Blake BJ. Case. Cambridge: CUP: 2001. [Google Scholar]
Boersma P. Praat, a system for doing phonetics by computer. Glot International. 2001;5(9/10):341–345. [Google Scholar]
Breslow NE, Clayton DG. Approximate inference in generalized linear mixed models. J Am Stat Assoc. 1993;88:9–25. 10.1080/01621459.1993.10594284. [CrossRef] [Google Scholar]
Bybee JL. Frequency of Use and the Organization of Language. New York: Oxford University Press; 2007. [Google Scholar]
Bybee JL, Hopper P. Introduction to frequency and the emergence of linguistic structure. Frequency and the emergence of linguistic structure. 2001:1–24. [Google Scholar]
Choi Y, Trueswell J. Children’s (in)ability to recover from garden paths in a verb-final language: Evidence for developing control in sentence processing. J Exp Psychol. 2010;106(1):41–61. 10.1016/j.jecp.2010.01.003. [Europe PMC free article] [Abstract] [CrossRef] [Google Scholar]
Chomsky N. Aspects of the Theory of Syntax. Vol. 119. The MIT press; 1965. [Google Scholar]
Christiansen MH. Using artificial language learning to study language evolution: Exploring the emergence of word order universals. In: Dessalles JL, Ghadakpour L, editors. The Evolution of Language: 3rd International Conference. Paris, France: Ecole Nationale Superieure des Telecommunications; 2000. pp. 45–48. [Google Scholar]
Christiansen MH, Chater N. Language as shaped by the brain. Behav Brain Sci. 2008;31(05):489–509. 10.1017/S0140525X08004998. [Abstract] [CrossRef] [Google Scholar]
Christianson K, Hollingwoth A, Halliwell JF, Ferreira F. Thematic-roles assigned along the garden path linger. Cognitive Psychol. 2001;42:368–407. 10.1006/cogp.2001.0752. [Abstract] [CrossRef] [Google Scholar]
Croft W, Bhattacharya T, Kleinschmidt D, Smith DE, Jaeger TF. Greenbergian universals, diachrony and statistical analyses. Linguistic Typology. 2011;15(2):433–453. 10.1515/lity.2011.029. [CrossRef] [Google Scholar]
Culbertson J, Legendre G. Investigating the evolution of agreement systems using an artificial language learning paradigm. In: Bailey D, Teliga V, editors. Proceedings of the 39th Western Conference on Linguistics. Fresno: California State University, Fresno; 2010. pp. 46–58. [Google Scholar]
Culbertson J, Newport E. Harmonic biases in child learners: In support of language universals. Cognition. 2015;139:71–82. 10.1016/j.cognition.2015.02.007. [Europe PMC free article] [Abstract] [CrossRef] [Google Scholar]
Culbertson J, Smolensky P, Legendre G. Learning biases predict a word order universal. Cognition. 2012;122:306–329. 10.1016/j.cognition.2011.10.017. [Abstract] [CrossRef] [Google Scholar]
Cysouw M. On the probability distribution of typological frequencies. In: Ebert C, Jäger G, Michaelis J, editors. The mathematics of language. Springer; 2010. pp. 29–35. [Google Scholar]
Deacon TW. The symbolic species: The co-evolution of language and the brain. WW Norton & Company; 1997. [Google Scholar]
Detges U. How useful is case morphology? The loss of the Old French two-case system within a theory of Preferred Argument Structure. In: Barðdal J, Chelliah S, editors. The role of semantic, pragmatic, and discourse factors in the development of case. Amsterdam: John Benjamins; 2009. [Google Scholar]
Dryer M. Large linguistic areas and language sampling. Studies in Language. 1989;13:257–292. 10.1075/sl.13.2.03dry. [CrossRef] [Google Scholar]
Dryer M, Haspelmath M, editors. The world atlas of language structures online. Munich: Max Planck Digital Library; 2011. [Google Scholar]
Du Bois JW. The discourse basis of ergativity. Language. 1987;63(4):805–855. 10.2307/415719. [CrossRef] [Google Scholar]
Dunn M, Greenhill SJ, Levinson SC, Gray RD. Evolved structure of language shows lineage-specific trends in word-order universals. Nature. 2011;473(7345):79–82. 10.1038/nature09923. [Abstract] [CrossRef] [Google Scholar]
Evans N, Levinson S. The myth of language universals: Language diversity and its importance for cognitive science. Behavioral and brain sciences. 2009;32:429–492. [Abstract] [Google Scholar]
Fedzechkina M. Communicative efficiency, language learning, and language universals. University of Rochester; 2014. (PhD) [Google Scholar]
Fedzechkina M, Jaeger T, Newport E. Language learners restructure their input to facilitate efficient communication. Proc Natl Acad Sci USA. 2012;109(44):17897–17902. 10.1073/pnas.1215776109. [Europe PMC free article] [Abstract] [CrossRef] [Google Scholar]
Fedzechkina M, Newport E, Jaeger T. Miniature artificial language learning as a complement to typological data. In: Tyler A, Ortega L, Uno M, editors. The usage-based study of language learning and multilingualism. Georgetown University Press; (in press) [Google Scholar]
Ferdinand V, Thompson B, Kirby S, Smith K. Regularization behavior in a non-linguistic domain; Proceedings of the 35th Annual Meeting of the Cognitive Science Society; 2013. pp. 436–441. [Google Scholar]
Ferreira F. Choice of passive voice is affected by verb type animacy. Journal of Memory and Language. 1994;33:715–736. 10.1006/jmla.1994.1034. [CrossRef] [Google Scholar]
Ferreira F. The misinterpretation of noncanonical sentences. Cognitive Psychology. 2003;47:164–203. 10.1016/S0010-0285(03)00005-7. [Abstract] [CrossRef] [Google Scholar]
Finley S, Badecker W. Substantive biases for vowel harmony languages. In: Abner N, Bishop J, editors. 27th West Coast Conference on Formal Linguistics. Somerville, MA: Cascadilla Press; 2008. pp. 168–176. [Google Scholar]
Fodor JD. Setting syntactic parameters. The Handbook of contemporary syntactic theory. 2001:730–767. [Google Scholar]
Frank A, Jaeger T. Speaking Rationally: Uniform Information Density as an Optimal Strategy for Language Production; Paper presented at the The 30th Annual Meeting of the Cognitive Science Society; Austin, TX. 2008. [Google Scholar]
Friedman V. Balkans as a linguistic area. In: Brown K, editor. Encyclopedia of language and linguistics. 2nd. Vol. 1. Oxford: Elsevier; 2006. pp. 657–672. [Google Scholar]
Fry J. Ellipsis and Wa-Marking in Japanese Conversation. Stanford, CA: Stanford University; 2003. (Ph.D. dissertation) [Google Scholar]
Gibson E, Piantadosi S, Brink K, Bergen L, Lim E, Saxe R. A noisy-channel account of cross-linguistic word order variation. Psychological Science. 2013;4(7):1079–1088. 10.1177/0956797612463705. [Abstract] [CrossRef] [Google Scholar]
Givon T. Function, structure, and language acquisition. In: Slobin D, editor. The crosslinguistic study of language acquisition. Vol. 2. Hillsdale: Erlbaum; 1985. pp. 1005–1028. [Google Scholar]
Givón T. Markedness in grammar: Distributional, communicative and cognitive correlates of syntactic structure. Stud Lang. 1991;15(2):335–370. 10.1075/sl.15.2.05giv. [CrossRef] [Google Scholar]
Gomez Gallo C, Jaeger T, Smyth R. Incremental Syntactic Planning across Clauses; Proceedings of the 30th Annual Meeting of the Cognitive Science Society (CogSci08); 2008. pp. 845–850. [Google Scholar]
Greenberg J. Some universals of grammar with particular reference to the order of meaningful elements. In: Greenberg J, editor. Universals of human language. MIT Press; 1963. pp. 73–113. [Google Scholar]
Guy R. Form and function in linguistic variation. In: Guy R, Feagin D, Schiffrin D, Baugh J, editors. Towards a social science of language: Papers in honot of William Labov. 1996. pp. 221–252. [Google Scholar]
Haiman J. Iconic and economic motivation. Language. 1983;56:515–540. 10.2307/413373. [CrossRef] [Google Scholar]
Haspelmath M. Optimality and diachronic adaptation. Zeitschrift für Sprachwissenschaft. 1999;18(2):180–205. 10.1515/zfsw.1999.18.2.180. [CrossRef] [Google Scholar]
Hawkins JA. A Performance Theory of Order and Constituency. Cambridge: Cambridge University Press; 1994. [Google Scholar]
Hawkins JA. Efficiency and Complexity in Grammar. Oxford: Oxford University Press; 2004. [Google Scholar]
Heine B. Contact-induced word order change without word order change. In: Siemund P, Kintana N, editors. Language contact and contact languages. Amsterdam: Benjamins; 2008. [Google Scholar]
Hudson Kam C, Newport E. Regularizing unpredictable variation: The roles of adult and child learners in language formation and change. Lang Learn Dev. 2005;1(2):151–195. 10.1080/15475441.2005.9684215. [CrossRef] [Google Scholar]
Hudson Kam C, Newport E. Getting it right by getting it wrong: When learners change languages. Cognitive Psychol. 2009;59(1):30–66. [Europe PMC free article] [Abstract] [Google Scholar]
Hupp JM, Sloutsky VM, Culicover PW. Evidence for a domain-general mechanism underlying the suffixation preference in language. Lang Cognitive Proc. 2009;24(6):876–909. 10.1080/01690960902719267. [CrossRef] [Google Scholar]
Jaeger T. Redundancy and syntactic reduction in spontaneous speech. Stanford, CA: Stanford University; 2006. (PhD dissertation) [Google Scholar]
Jaeger T. Categorical data analysis: Away from ANOVAs (transformation or not) and towards logit mixed models. J Mem Lang. 2008;59:434–446. 10.1016/j.jml.2007.11.007. [Europe PMC free article] [Abstract] [CrossRef] [Google Scholar]
Jaeger T. Redundancy and reduction: Speakers manage syntactic information density. Cognitive Psychol. 2010;61:23–62. 10.1016/j.cogpsych.2010.02.002. [Europe PMC free article] [Abstract] [CrossRef] [Google Scholar]
Jaeger T. Production preferences cannot be understood without reference to communication. Frontiers in Psychology. 2013;4(230) 10.3389/fpsyg.2013.00230. [Europe PMC free article] [Abstract] [CrossRef] [Google Scholar]
Jaeger T, Graff P, Croft W, Pontillo D. Mixed effects models for genetic and areal dependencies in linguistic typology: Commentary on Atkinson. Linguistic Typology. 2011;15(2):281–319. 10.1515/lity.2011.021. [CrossRef] [Google Scholar]
Jaeger T, Buz E. Signal Reduction and Linguistic Encoding. In: Fernandez EM, Cairns HS, editors. Handbook of Psycholinguistics. Chichester: Wiley-Blackwell; (in press) [Google Scholar]
Jaeger T, Tily H. On language ‘utility’: processing complexity and communicative efficiency. Wiley Interdiscip Rev: Cognitive Science. 2011;2(3):323–335. 10.1002/wcs.126. [Abstract] [CrossRef] [Google Scholar]
Jäger G. Evolutionary Game Theory and Typology. A Case Study. Language. 2007;83(1):74–109. 10.1353/lan.2007.0020. [CrossRef] [Google Scholar]
Jonides J, Lewis R, Nee D, Lustig C, Berman M, Moore K. The mind and brain of short-term memory. Annual review of psychology. 2008;59(193–224) 10.1146/annurev.psych.59.103006.093615. [Europe PMC free article] [Abstract] [CrossRef] [Google Scholar]
Kiparsky P. The Rise of Positional Licensing. In: van Kemenade A, Vincent N, editors. Parameters of morphosyntactic change. Cambridge: Cmbridge University Press; 1997. [Google Scholar]
Kleinschmidt D, Jaeger TF. Robust speech perception: Recognizing the familiar, generalizing to the similar, and adapting to the novel. Psychological Review. 2015 10.3758/s13423-015-0943-z. [Europe PMC free article] [Abstract] [CrossRef] [Google Scholar]
Kurumada C, Jaeger T. Communicative efficiency in language production: Optional case-marking in Japanese. Journal of Memory and Language. 2015;83:152–178. 10.1016/j.jml.2015.03.003. [CrossRef] [Google Scholar]
Lee H. Parallel optimization in case systems: Evidence from case ellipsis in Korean. J East Asian Linguis. 2006;15:69–96. 10.1007/s10831-005-3004-1. [CrossRef] [Google Scholar]
Lehnert M. The interrelation between form and function in the development of the English language. Zeitschrift für Anglistik und Amerikanistik. 1957;5:43–56. [Google Scholar]
Levy R. Probabilistic models of word order and syntactic discontinuity. Stanford: Stanford University; 2005. (Ph.D.) [Google Scholar]
Levy R. Integrating surprisal and uncertain-input models in online sentence comprehension: formal techniques and empirical results. In: Lin D, editor. Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies. Stroudsburg: Association for Computational Linguistics; 2011. pp. 1055–1065. [Google Scholar]
Levy R, Bicknell K, Slattery T, Rayner K. Eye movement evidence that readers maintain and act on uncertainty about past linguistic input. Proceedings of the National Academy of Sciences. 2009;106(50):21086–21090. 10.1073/pnas.0907664106. [Europe PMC free article] [Abstract] [CrossRef] [Google Scholar]
Lewis RL. Interference in short-term memory: The magical number two (or three) in sentence processing. Journal of Psycholinguistic Research. 1996;25(1):93–115. 10.1007/BF01708421. [Abstract] [CrossRef] [Google Scholar]
Lewis RL, Nakayama M. Syntactic and positional similarity effects in the processing of Japanese embeddings. In: Nakayama M, editor. Sentence processing in East Asian languages. Stanford, CA: CLSI Publications; 2001. pp. 85–113. [Google Scholar]
Lewis RL, Vasishth S, Van Dyke JA. Computational principles of working memory in sentence comprehension. Trends in Cognitive Science. 2006;10(10) 10.1016/j.tics.2006.08.007. [Europe PMC free article] [Abstract] [CrossRef] [Google Scholar]
Lindblom B. Explaining phonetic variation: a sketch of the H&H theory. Speech production and speech modelling. 1990;55:403–439. 10.1007/978-94-009-2037-8_16. [CrossRef] [Google Scholar]
Lupyan G, Christiansen MH. Case, word order, and language learnability: insights from connectionist modeling; Paper presented at the In Proceedings of the 24th Annual Conference of the Cognitive Science Society.2002. [Google Scholar]
Malchukov AL. Animacy and asymmetries in differential case marking. Lingua. 2008;118:203–221. 10.1016/j.lingua.2007.02.005. [CrossRef] [Google Scholar]
Marchand H. The syntactical change from inflectional to word order system and some effects of this change on the relation verb-object in English. Anglia. 1951;70:70–89. [Google Scholar]
Maurits L, Perfors A, Navarro D. Why are some word orders more common than others? A uniform information density account. In: Lafferty J, Williams C, Shawe-Taylor J, Zemel R, Culotta A, editors. Advances in Neural Information Processing Systems. Vol. 23. Vancouver, BC: 2010. pp. 1585–1593. [Google Scholar]
Mithun M. Is basic word order universal? In: Tomlin RS, editor. Coherence and grounding in discourse. Amsterdam: John Benjamins; 1987. [Google Scholar]
Morgan JL, Meier RP, Newport E. Structural packaging in the input to language learning: Contributions of prosodic and morphological marking of phrases to the acquisition of language. Cognitive Psychol. 1987;19(4):498–550. 10.1016/0010-0285(87)90017-x. [Abstract] [CrossRef] [Google Scholar]
Nettle D. Using Social Impact Theory to simulate language change. Lingua. 1999;108:95–117. 10.1016/S0024-3841(98)00046-1. [CrossRef] [Google Scholar]
Newport E. Constraints on structure: Evidence from American Sign Language and language learning. In: Collins WA, editor. Aspects of the development of competence. Minnesota Symposium on Child Psychology. Vol. 14. Erlbaum: 1981. pp. 93–124. [Google Scholar]
Newport E. Maturational constraints on language learning. Cognitive Science. 1990;14(1):11–28. 10.1207/s15516709cog1401_2. [CrossRef] [Google Scholar]
Newport E, Aslin R. Learning at a distance I. Statistical learning of non-adjacent dependencies. Cognitive Psychol. 2004;48(2):127–162. 10.1016/S0010-0285(03)00128-2. [Abstract] [CrossRef] [Google Scholar]
Nichols J. Head-marking and dependent-marking grammar. Language. 1986:56–119. 10.2307/415601. [CrossRef] [Google Scholar]
Norcliffe E, Jaeger T. Predicting head-marking variability in Yucatec Maya relative clause production. Language and Cognition. 2015 10.1017/langcog.2014.39. [CrossRef] [Google Scholar]
Norris D, McQueen JM. Shortlist B: A Bayesian model of continuous speech recognition. Psychological Review. 2008;115(2):357. 10.1037/0033-295X.115.2.357. [Abstract] [CrossRef] [Google Scholar]
Ohala J. Discussion of Lindblom’s ‘Phonetic invariance and the adaptive nature of speech’ In: Elsendoorn B, Bouma H, editors. Working models of human perception. London: Academic Press; 1989. pp. 175–183. [Google Scholar]
Piantadosi S, Tily H, Gibson E. The communicative function of ambiguity in language. Cognition. 2011a;122:280–291. 10.1016/j.cognition.2011.10.004. [Abstract] [CrossRef] [Google Scholar]
Piantadosi S, Tily H, Gibson E. Word lengths are optimized for efficient communication. Proc Natl Acad Sci USA. 2011b;108(9):3526. 10.1073/pnas.1012551108. [Europe PMC free article] [Abstract] [CrossRef] [Google Scholar]
Pinker S. Language learnability and language development. Cambridge, MA: Harvard University Press; 1984. [Google Scholar]
Pintzuk S. Morphological case and word order in Old English. Language Sciences. 2002;24:381–395. 10.1016/S0388-0001(01)00039-0. [CrossRef] [Google Scholar]
Pozzan L, Trueswell J. Revise and resubmit: How real-time parsing limitations influence grammar acquisition. Cognitive Psychology. (in press) [Europe PMC free article] [Abstract] [Google Scholar]
Prat-Sala M, Branigan HP. Discourse constraints on syntactic processing in language production: A corss-linguistic study in English and Spanish. Journal of Memory and Language. 2000;42:168–182. 10.1006/jmla.1999.2668. [CrossRef] [Google Scholar]
Qian T, Jaeger TF. Cue Effectiveness in Communicatively Efficient Discourse Production. Cognitive Sci. 2012;36:1312–1336. 10.1111/j.1551-6709.2012.01256.x. [Abstract] [CrossRef] [Google Scholar]
Rafferty A, Griffiths T, Klein D. Analyzing the rate at which languages lose the inﬂuence of a common ancestor. Cognitive Science. 2014;38(7):1406–1431. 10.1111/cogs.12112. [Abstract] [CrossRef] [Google Scholar]
Sapir E. Language: An introduction to the study of speech. New York: Harcourt, Brace; 1921. [Google Scholar]
Shimojo M. Properties of particle ‘omission’ revisited; Paper presented at the Toronto working papers in Linguistics.2006. [Google Scholar]
Siewierska A. Variation in major constituent order. In: Siewierska A, editor. Constituent order in the languages of Europe. Berlin: Mouton de Gruyter; 1998. pp. 465–529. [Google Scholar]
Silverstein M. Hierarchy of features and ergativity. In: Dixon RMW, editor. Grammatical categories in Australian languages. Canberra: Australian Institute of Aboriginal Studies; 1976. pp. 112–171. [Google Scholar]
Sinnemäki K. Complexity trade-offs in core argument marking. In: Karlsson F, Miestamo M, Sinnemäki K, editors. Language Complexity: Typology, Contact, Change. Amsterdam: John Benjamins; 2008. pp. 67–88. [Google Scholar]
Slobin D. Cognitive prerequisites for the development of grammar. In: Ferguson CA, Slobin DI, editors. Studies of child language development. New York: Holt, Rinehart, Winston; 1973. [Google Scholar]
Smith K, Wonnacott E. Eliminating unpredictable variation through iterated learning. Cognition. 2010;116(3):444–449. 10.1016/j.cognition.2010.06.004. [Abstract] [CrossRef] [Google Scholar]
Snedeker J, Trueswell J. The developing constraints on parsing decisions: The role of lexical biases and referential scenes in child and adult sentence processing. Cognit Psychol. 2004;49(3):238–299. 10.1016/j.cogpsych.2004.03.001. [Abstract] [CrossRef] [Google Scholar]
St Clair MC, Monaghan P, Ramscar M. Relationships between language structure and language learning: The suffixing preference and grammatical categorization. Cognitive Science. 2009;33(7):1317–1329. 10.1111/j.1551-6709.2009.01065.x. [Abstract] [CrossRef] [Google Scholar]
Szmrecsanyi B. On operationalizing syntactic complexity; Paper presented at the JADT, Louvain-la-Neuve; Belgium. 2004. [Google Scholar]
Tily H. The role of processing complexity in word order variation and change. Stanford University; 2010. (Ph.D.) [Google Scholar]
Tily H, Frank M, Jaeger T. The learnability of constructed languages reﬂects typological patterns. In: Carlson L, Hoelscher C, Shipley T, editors. Proceedings of the 33rd Annual Conference of the Cognitive Science Society. Boston, MA: Cognitive Science Society; 2011. pp. 1364–1369. [Google Scholar]
Traugott EC. A history of English syntax. New York: Holt: Rinehart and Winston; 1972. [Google Scholar]
Trueswell J, Kaufman D, Hafri A, Lidz J. Development of parsing abilities interacts with grammar learning: Evidence from Tagalog and Kannada; Paper presented at the Proceedings of BUCLD; Sommerville, MA. 2012. [Google Scholar]
Trueswell J, Sekerina I, Hill N, Logrip M. The kindergarten-path effect: studying on-line sentence processing in young children. Cognition. 1999;73:89–134. 10.1016/S0010-0277(99)00032-3. [Abstract] [CrossRef] [Google Scholar]
Urosevic Z, Carello C, Savic M, Lukatela G, Turvey M. Some word-order effects in Serbo-Croat. Language and Speech. 1981;29(2):177–195. [Abstract] [Google Scholar]
Van Dyke JA, Lewis RL. Distinguishing effects of structure and decay on attachment and repair: A cue-based parsing account of recovery from misanalyzed ambiguities. Journal of Memory and Language. 2003;49(3):285–316. 10.1016/S0749-596X(03)00081-0. [CrossRef] [Google Scholar]
Van Everbroeck E. Language type frequency and learnability from a connectionist perspective. Linguistic Typology. 2003;7(1):1–50. 10.1515/lity.2003.011. [CrossRef] [Google Scholar]
van Son RJJH, Pols LCW. How efficient is speech?; Paper presented at the Proc of the Institute of Phonetic Sciences; Amsterdam. 2003. [Google Scholar]
van Son RJJH, van Santen JPH. Duration and spectral balance of intervocalic consonants: A case for effcient communication. Speech Commun. 2005;47:464–484. 10.1016/j.specom.2005.06.005. [CrossRef] [Google Scholar]
Wasow T. Ambiguity avoidance is overrated. In: Winkler S, editor. Ambiguity: Language and Communication. Berlin: DeGruyter; 2015. pp. 21–51. [Google Scholar]
Wilson C. Learning phonology with substantive bias: An experimental and computational study of velar palatalization. Cognitive Science. 2006;30(5):945–982. 10.1207/s15516709cog0000_89. [Abstract] [CrossRef] [Google Scholar]
Yatanabe S. Particle ellipsis and focus projection in Japanese. Language, Information, Text. 1999;6:79–104. [Google Scholar]
Yokoyama O. Discourse and Word Order. Amsterdam/Philadelphia: John Benjamins; 1986. [Google Scholar]
Zipf GK. Human Behavior and the Principle of Least Effort. Cambridge, MA: Addison-Wesley; 1949. [Google Scholar]

Full text links

Read article at publisher's site: https://doi.org/10.1111/cogs.12346

Read article for free, from open access legal sources, via Unpaywall: https://onlinelibrary.wiley.com/doi/pdfdirect/10.1111/cogs.12346

Citations & impact

Impact metrics

Citations

Jump to Citations

Citations of article over time

Alternative metrics

Altmetric item for https://www.altmetric.com/details/5980666

Altmetric
Discover the attention surrounding your research
https://www.altmetric.com/details/5980666

Smart citations by scite.ai
Explore citation contexts and check if this article has been supported or disputed.
https://scite.ai/reports/10.1111/cogs.12346

Supporting

Mentioning

Contrasting

Article citations

The emergence of phonological dispersion through interaction: an exploratory secondary analysis of a communicative game.
Roberts G, Clark R
Front Psychol, 14:1130837, 24 May 2023
Cited by: 0 articles | PMID: 37292509 | PMCID: PMC10244737
Articles in the Open Access Subset are available under a Creative Commons license. This means they are free to read, and that reuse is permitted under certain circumstances. There are six different Creative Commons licenses available, see the copyright license for this article to understand what type of reuse is permitted.
Free full text in Europe PMC
Grammatical Gender Disambiguates Syntactically Similar Nouns.
Rogers PG, Gries ST
Entropy (Basel), 24(4):520, 07 Apr 2022
Cited by: 0 articles | PMID: 35455183 | PMCID: PMC9032811
Articles in the Open Access Subset are available under a Creative Commons license. This means they are free to read, and that reuse is permitted under certain circumstances. There are six different Creative Commons licenses available, see the copyright license for this article to understand what type of reuse is permitted.
Free full text in Europe PMC
The Impact of Information Structure on the Emergence of Differential Object Marking: An Experimental Study.
Tal S, Smith K, Culbertson J, Grossman E, Arnon I
Cogn Sci, 46(3):e13119, 01 Mar 2022
Cited by: 2 articles | PMID: 35297091 | PMCID: PMC9286624
Articles in the Open Access Subset are available under a Creative Commons license. This means they are free to read, and that reuse is permitted under certain circumstances. There are six different Creative Commons licenses available, see the copyright license for this article to understand what type of reuse is permitted.
Free full text in Europe PMC
Cross-Linguistic Trade-Offs and Causal Relationships Between Cues to Grammatical Subject and Object, and the Problem of Efficiency-Related Explanations.
Levshina N
Front Psychol, 12:648200, 12 Jul 2021
Cited by: 2 articles | PMID: 34322056 | PMCID: PMC8311235
Articles in the Open Access Subset are available under a Creative Commons license. This means they are free to read, and that reuse is permitted under certain circumstances. There are six different Creative Commons licenses available, see the copyright license for this article to understand what type of reuse is permitted.
Free full text in Europe PMC
Corpus-based typology: applications, challenges and some solutions.
Levshina N
Linguist Typol, 26(1):129-160, 30 Mar 2021
Cited by: 0 articles | PMID: 35881664 | PMCID: PMC9159679
Review Free full text in Europe PMC

Go to all (8) article citations

Funding

Funders who supported this work.

NICHD NIH HHS (2)

Grant ID: R01 HD037082
82 publications
Grant ID: U54 HD090257
472 publications

NIDCD NIH HHS (1)

Grant ID: R01 DC000167
50 publications

National Institutes of Health (2)

Grant ID: HD037082
2 publications
Grant ID: DC00167
3 publications

National Science Foundation (1)

Grant ID: IIS‐1150028
2 publications

Search life-sciences literature (43,920,221 articles, preprints and more)

Balancing Effort and Information Transmission During Language Acquisition: Evidence From Word Order and Case Marking.

Author information

Affiliations

Authors

Authors

Authors

ORCIDs linked to this article

Abstract

Free full text

Balancing effort and information transmission during language acquisition: Evidence from word order and case marking

Maryia Fedzechkina

Elissa L. Newport

T. Florian Jaeger

Abstract

1. Introduction

2. Method

2.1. Participants

2.2. Design and Materials

Table 1

2.3. Procedure

Noun exposure and tests

Noun exposure

Noun comprehension

Noun production

Sentence exposure and tests

Sentence exposure

Sentence comprehension

Sentence production

3. Results

3.1. Scoring

Comprehension accuracy

Production accuracy

3.2. Constituent order in production

3.3. Case marker use in production

3.4. How is case marking used in the flexible order language?

3.5. Individual learners’ preferences

3.6. A unifying information-theoretic perspective on individual differences

3.6.1. Between-participant variability revisited from an information-theoretic perspective

4. Discussion

4.1. Conditioned variability

4.2. When are deviations from the input expected?

4.3. The mechanism underlying learning outcomes

4.4. The scope of explanation

5. Conclusion

Acknowledgements

Footnotes

References

Full text links

Citations & impact

Impact metrics

Citations of article over time

Alternative metrics

Article citations

Similar Articles

Funding

NICHD NIH HHS (2)﻿

NIDCD NIH HHS (1)﻿

National Institutes of Health (2)﻿

National Science Foundation (1)﻿

NICHD NIH HHS (2)

NIDCD NIH HHS (1)

National Institutes of Health (2)

National Science Foundation (1)