Article

Can Infants Retain Statistically Segmented Words and Mappings Across a Delay?

Authors:
To read the full-text of this research, you can request a copy directly from the authors.

Abstract

Infants are sensitive to statistics in spoken language that aid word‐form segmentation and immediate mapping to referents. However, it is not clear whether this sensitivity influences the formation and retention of word‐referent mappings across a delay, two real‐world challenges that learners must overcome. We tested how the timing of referent training, relative to familiarization with transitional probabilities (TPs) in speech, impacts English‐learning 23‐month‐olds’ ability to form and retain word‐referent mappings. In Experiment 1, we tested infants’ ability to retain TP information across a 10‐min delay and use it in the service of word learning. Infants successfully mapped high‐TP but not low‐TP words to referents. In Experiment 2, infants readily mapped the same words even when they were unfamiliar. In Experiment 3, high‐ and low‐TP word‐referent mappings were trained immediately after familiarization, and infants readily remembered these associations 10 min later. In sum, although 23‐month‐old infants do not need strong statistics to map word forms to referents immediately, or to remember those mappings across a delay, infants are nevertheless sensitive to these statistics in the speech stream, and they influence mapping after a delay. These findings suggest that, by 23 months of age, sensitivity to statistics in speech may impact infants’ language development by leading word forms with low coherence to be poorly mapped following even a short period of consolidation.

No full-text available

Request Full-text Paper PDF

To read the full-text of this research,
you can request a copy directly from the authors.

... Additional indirect evidence that the role of TPs in word learning changes across development can be found in a related study that focused on 24-month-old infants' ability to remember HTP and LTP mappings across a brief delay (Karaman et al., 2023). In that study, infants were familiarized to the Italian corpus from Shoaib et al. (2018) and Hay et al. (2011), and given a mapping task that was designed to facilitate forming word-referent mappings. ...
... Based on the findings reviewed above (Karaman et al., 2023;Mirman et al., 2008;Shoaib et al., 2018), we predicted that 24-month-olds would show an adult-like pattern. More specifically, we predicted that infants would map HTP and LTP word forms to referents equally well. ...
... Although the Italian corpus likely sounds nonnative to Englishlearning infants (Mehler et al., 1988;Shoaib et al., 2018), it shares some key similarities with English: All of the Italian target words were trochees (i.e., strong-weak stress, a pattern that is common in English), and all of the target words were phonotactically legal in English. These features make it likely that infants will be able to track the syllable-level TPs, and evidence from prior studies with these materials suggests that 8-to 24-month-olds do so (e.g., Hay et al., 2011;Karaman et al., 2023;Pelucchi et al., 2009a;Shoaib et al., 2018). ...
Article
Full-text available
Infants’ sensitivity to transitional probabilities (TPs) supports language development by facilitating mapping high-TP (HTP) words to meaning, at least up to 18 months of age. Here we tested whether this HTP advantage holds as lexical development progresses, and infants become better at forming word–referent mappings. Two groups of 24-month-olds (N = 64 and all White, tested in the United States) first listened to Italian sentences containing HTP and low-TP (LTP) words. We then used HTP and LTP words, and sequences that violated these statistics, in a mapping task. Infants learned HTP and LTP words equally well. They also learned LTP violations as well as LTP words, but learned HTP words better than HTP violations. Thus, by 2 years of age sensitivity to TPs does not lead to an HTP advantage but rather to poor mapping of violations of HTP word forms.
Article
Full-text available
Infants’ sensitivity to transitional probabilities (TPs) supports language development by facilitating mapping high-TP (HTP) words to meaning, at least up to 18 months of age. Here we tested whether this HTP advantage holds as lexical development progresses, and infants become better at forming word–referent mappings. Two groups of 24-month-olds (N = 64 and all White, tested in the United States) first listened to Italian sentences containing HTP and low-TP (LTP) words. We then used HTP and LTP words, and sequences that violated these statistics, in a mapping task. Infants learned HTP and LTP words equally well. They also learned LTP violations as well as LTP words, but learned HTP words better than HTP violations. Thus, by 2 years of age sensitivity to TPs does not lead to an HTP advantage but rather to poor mapping of violations of HTP word forms.
Article
Full-text available
People spend approximately half of their waking hours in a so-called offline state — daydreaming, mind wandering or otherwise inattentive to their surroundings. These activities are often viewed as a waste of time, perhaps as moments of lost productivity. However, periods of offline waking rest can facilitate the consolidation of newly formed memories. Even a few minutes of rest with closed eyes can improve memory, perhaps to the same degree as a full night of sleep. These findings have profound implications for understanding the memory consolidation process, its time course and its underlying mechanisms. In this Review, I describe evidence that offline waking rest retroactively facilitates memory. Similar to the beneficial effect of sleep, the effect of rest might be driven by neural-level reactivation of newly formed memory traces. As both rest and sleep seem to support consolidation, I next consider whether these two states support the same or dissociable stages of consolidation. Then I review evidence that seconds-long bouts of offline rest occur throughout the day and that even these ultrashort offline periods might benefit memory. Finally, I conclude by describing future directions for research into the underlying processes of sleep and wake states. People spend approximately half of their waking hours inattentive to their surroundings. In this Review, Wamsley describes the beneficial effect that these periods of offline waking rest have on memory, contrasting this benefit and its underlying mechanisms with the effects of sleep.
Article
Full-text available
Recent studies show that brief periods of rest after learning facilitate consolidation of new memories. This effect is associated with memory-related brain activity during quiet rest and suggests that in our daily lives, moments of unoccupied rest may serve an essential cognitive function. Access full-text here until April 2019: https://authors.elsevier.com/a/1YZ2v4sIRvBFGk
Article
Full-text available
Perception involves making sense of a dynamic, multimodal environment. In the absence of mechanisms capable of exploiting the statistical patterns in the natural world, infants would face an insurmountable computational problem. Infant statistical learning mechanisms facilitate the detection of structure. These abilities allow the infant to compute across elements in their environmental input, extracting patterns for further processing and subsequent learning. In this selective review, we summarize findings that show that statistical learning is both a broad and flexible mechanism (supporting learning from different modalities across many different content areas) and input specific (shifting computations depending on the type of input and goal of learning). We suggest that statistical learning not only provides a framework for studying language development and object knowledge in constrained laboratory settings, but also allows researchers to tackle realworld problems, such as multilingualism, the role of ever-changing learning environments, and differential developmental trajectories. Expected final online publication date for the Annual Review of Psychology Volume 69 is January 4, 2018. Please see http://www.annualreviews.org/page/journal/pubdates for revised estimates.
Article
Full-text available
Research over the past 2 decades has demonstrated that infants are equipped with remarkable computational abilities that allow them to find words in continuous speech. Infants can encode information about the transitional probability (TP) between syllables to segment words from artificial and natural languages. As previous research has tested infants immediately after familiarization, infants' ability to retain sequential statistics beyond the immediate familiarization context remains unknown. Here, we examine infants' memory for statistically defined words 10 min after familiarization with an Italian corpus. Eight-month-old English-learning infants were familiarized with Italian sentences that contained 4 embedded target words-2 words had high internal TP (HTP, TP = 1.0) and 2 had low TP (LTP, TP = .33)-and were tested on their ability to discriminate HTP from LTP words using the Headturn Preference Procedure. When tested after a 10-min delay, infants failed to discriminate HTP from LTP words, suggesting that memory for statistical information likely decays over even short delays (Experiment 1). Experiments 2-4 were designed to test whether experience with isolated words selectively reinforces memory for statistically defined (i.e., HTP) words. When 8-month-olds were given additional experience with isolated tokens of both HTP and LTP words immediately after familiarization, they looked significantly longer on HTP than LTP test trials 10 min later. Although initial representations of statistically defined words may be fragile, our results suggest that experience with isolated words may reinforce the output of statistical learning by helping infants create more robust memories for words with strong versus weak co-occurrence statistics. (PsycINFO Database Record
Article
Full-text available
In the first few years of life children spend a good proportion of time sleeping as well as acquiring the meanings of hundreds of words. There is now ample evidence of the effects of sleep on memory in adults and the number of studies demonstrating the effects of napping and nocturnal sleep in children is also mounting. In particular, sleep appears to benefit children's memory for recently-encountered novel words. The effect of sleep on children's generalization of novel words across multiple items, however, is less clear. Given that sleep is polyphasic in the early years, made up of multiple episodes, and children's word learning is gradual and strengthened slowly over time, it is highly plausible that sleep is a strong candidate in supporting children's memory for novel words. Importantly, it appears that when children sleep shortly after exposure to novel word-object pairs retention is better than if sleep is delayed, suggesting that napping plays a vital role in long-term word retention for young children. Word learning is a complex, challenging, and important part of development, thus the role that sleep plays in children's retention of novel words is worthy of attention. As such, ensuring children get sufficient good quality sleep and regular opportunities to nap may be critical for language acquisition.
Article
Full-text available
The "looking-while-listening" methodology uses real-time measures of the time course of young children's gaze patterns in response to speech. This procedure is low in task demands and does not require automated eye-tracking technology, similar to "preferential looking" procedures. However, the looking-while-listening methodology differs critically from preferential-looking procedures in the methods used for data reduction and analysis, yielding high-resolution measures of speech processing from moment to moment, rather than relying on summary measures of looking preference. Because children's gaze patterns are time-locked to the speech signal and coded frame-by-frame, response latencies can be coded with millisecond precision on multiple trials over multiple items, based on data from thousands of frames in each experiment. The meticulous procedures required in the collection, reduction, and multiple levels of analysis of such detailed data are demanding, but well worth the effort, revealing a dynamic and nuanced picture of young children's developing skill in finding meaning in spoken language. On-line methods in children's language processing, John Benjamins: Amsterdam (pp. 97-135.).
Article
Full-text available
Maximum likelihood or restricted maximum likelihood (REML) estimates of the parameters in linear mixed-effects models can be determined using the lmer function in the lme4 package for R. As for most model-fitting functions in R, the model is described in an lmer call by a formula, in this case including both fixed- and random-effects terms. The formula and data together determine a numerical representation of the model from which the profiled deviance or the profiled REML criterion can be evaluated as a function of some of the model parameters. The appropriate criterion is optimized, using one of the constrained optimization functions in R, to provide the parameter estimates. We describe the structure of the model, the steps in evaluating the profiled deviance or REML criterion, and the structure of classes or types that represents such a model. Sufficient detail is included to allow specialization of these structures by users who wish to write functions to fit specialized linear mixed models, such as models incorporating pedigrees or smoothing splines, that are not easily expressible in the formula language used by lmer.
Article
Full-text available
Linear mixed-effects models (LMEMs) have become increasingly prominent in psycholinguistics and related areas. However, many researchers do not seem to appreciate how random effects structures affect the generalizability of an analysis. Here, we argue that researchers using LMEMs for confirmatory hypothesis testing should minimally adhere to the standards that have been in place for many decades. Through theoretical arguments and Monte Carlo simulation, we show that LMEMs generalize best when they include the maximal random effects structure justified by the design. The generalization performance of LMEMs including data-driven random effects structures strongly depends upon modeling criteria and sample size, yielding reasonable results on moderately-sized samples when conservative criteria are used, but with little or no power advantage over maximal models. Finally, random-intercepts-only LMEMs used on within-subjects and/or within-items data from populations where subjects and/or items vary in their sensitivity to experimental manipulations always generalize worse than separate F1 and F2 tests, and in many cases, even worse than F1 alone. Maximal LMEMs should be the ‘gold standard’ for confirmatory hypothesis testing in psycholinguistics and beyond.
Article
Full-text available
In order to successfully acquire a new word, young children must learn the correct associations between labels and their referents. For decades, word-learning researchers have explored how young children are able to form these associations. However, in addition to learning label-referent mappings, children must also remember them. Despite the importance of memory processes in forming a stable lexicon, there has been little integration of early memory research into the study of early word learning. After discussing what we know about how young children remember words over time, this paper reviews the infant memory development literature as it relates to early word learning, focusing on changes in retention duration, encoding, consolidation, and retrieval across the first 2 years of life. A third section applies this review to word learning and presents future directions, arguing that the integration of memory processes into the study of word learning will provide researchers with novel, useful insights into how young children acquire new words.
Article
Full-text available
A number of theorists have argued that the productive naming explosion results from advances in abilities that underlie language learning (e.g., the realization that words are symbols, changes in conceptual structure, or the onset of word learning constraints). If any of these accounts are accurate, there should be parallel developments in comprehension. To explore this issue, 4 studies assessed whether pre- and postnaming explosion children differ in their ability to learn a new word after limited exposure. 13- and 18-mo-olds heard a new object label just 9 times in a 5-min training session and then their comprehension was assessed in a multiple-choice procedure. Under favorable testing conditions, both 18- and 13-mo-olds showed comprehension of the new word, even after a 24-hr delay. These results suggest that well before the productive naming explosion, children can learn a new object label quickly. (PsycINFO Database Record (c) 2012 APA, all rights reserved)
Article
Full-text available
Conducted 10 experiments to evaluate the notion of "depth of processing" in human memory. Undergraduate Ss were asked questions concerning the physical, phonemic, or semantic characteristics of a long series of words; this initial question phase was followed by an unexpected retention test for the words. It was hypothesized that "deeper" (semantic) questions would take longer to answer and be associated with higher retention of the target words. These ideas were confirmed by the 1st 4 experiments. Exps V-X showed (a) it is the qualitative nature of a word's encoding which determines retention, not processing time as such; and (b) retention of words given positive and negative decisions was equalized when the encoding questions were equally salient or congruous for both types of decision. While "depth" (the qualitative nature of the encoding) serves a useful descriptive purpose, results are better described in terms of the degree of elaboration of the encoded trace. Finally, results have implications for an analysis of learning in terms of its constituent encoding operations. (PsycINFO Database Record (c) 2012 APA, all rights reserved)
Article
Full-text available
Learning a new word involves integration with existing lexical knowledge. Previous work has shown that sleep-associated memory consolidation processes are important for the engagement of novel items in lexical competition. In 3 experiments we used spaced exposure regimes to investigate memory for novel words and whether lexical integration can occur within a single day. The degree to which a new spoken word (e.g., cathedruke) engaged in lexical competition with established phonological neighbors (e.g., cathedral) was employed as a marker for lexical integration. We found evidence for improvements in recognition and cued recall following a time period including sleep, but we also found lexical competition effects emerging within a single day. Spaced exposure to novel words on its own did not bring about this within-day lexical competition effect (Experiment 2), which instead occurred with either spaced or massed exposure to novel words, provided that there was also spaced exposure to the phonological neighbors (Experiments 1 and 3). Although previous studies have indicated that sleep-dependent memory consolidation may be sufficient for lexical integration, our results show it is not a necessary precondition. (PsycINFO Database Record (c) 2012 APA, all rights reserved).
Article
Full-text available
Speech processing in adults in continuous: as acoustic-phonetic information is heard, listeners' interpretation of the speech is updated incrementally. The present studies used a visual fixation technique to examine whether young children also interpret speech continuously. In Experiments 1 and 2, 24-month-old children looked at visual displays while hearing sentences. Sentences each contained a target word labeling one of the two displayed pictures. Children's latency to fixate the labeled picture was measured. Children's responses were delayed when the competing distractor picture's label overlapped phonetically with the target at onset (dog-doll), but not when the pictures' labels rhymed (ball-doll), showing that children monitored the speech stream incrementally for acoustic-phonetic information specifying the correct picture. In Experiment 3, adults' responses in the same task were found to be very similar to those of the 24-month-olds. This research shows that by 24 months, children can interpret speech continuously.
Article
Full-text available
Infants' long-term memory for the phonological patterns of words versus the indexical properties of talkers' voices was examined in 3 experiments using the Headturn Preference Procedure (D. G. Kemler Nelson et al., 1995). Infants were familiarized with repetitions of 2 words and tested on the next day for their orientation times to 4 passages--2 of which included the familiarized words. At 7.5 months of age, infants oriented longer to passages containing familiarized words when these were produced by the original talker. At 7.5 and 10.5 months of age, infants did not recognize words in passages produced by a novel female talker. In contrast, 7.5-month-olds demonstrated word recognition in both talker conditions when presented with passages produced by both the original and the novel talker. The findings suggest that talker-specific information can prime infants' memory for words and facilitate word recognition across talkers.
Article
Full-text available
To explore how online speech processing efficiency relates to vocabulary growth in the 2nd year, the authors longitudinally observed 59 English-learning children at 15, 18, 21, and 25 months as they looked at pictures while listening to speech naming one of the pictures. The time course of eye movements in response to speech revealed significant increases in the efficiency of comprehension over this period. Further, speed and accuracy in spoken word recognition at 25 months were correlated with measures of lexical and grammatical development from 12 to 25 months. Analyses of growth curves showed that children who were faster and more accurate in online comprehension at 25 months were those who showed faster and more accelerated growth in expressive vocabulary across the 2nd year.
Article
Full-text available
In this study, 1.5-year-olds were taught a novel word. Some children were familiarized with the word's phonological form before learning the word's meaning. Fidelity of phonological encoding was tested in a picture-fixation task using correctly pronounced and mispronounced stimuli. Only children with additional exposure in familiarization showed reduced recognition performance given slight mispronunciations relative to correct pronunciations; children with fewer exposures did not. Mathematical modeling of vocabulary exposure indicated that children may hear thousands of words frequently enough for accurate encoding. The results provide evidence compatible with partial failure of phonological encoding at 19 months of age, demonstrate that this limitation in learning does not always hinder word recognition, and show the value of infants' word-form encoding in early lexical development.
Article
Full-text available
The integration of a newly learned spoken word form with existing knowledge in the mental lexicon is characterized by the word form's ability to compete with similar-sounding entries during auditory word recognition. Here we show that although the mere acquisition of a spoken form is swift, its engagement in lexical competition requires an incubation-like period that is crucially associated with sleep. Words learned at 8 p.m. do not induce (inhibitory) competition effects immediately, but do so after a 12-hr interval including a night's sleep, and continue to induce such effects after 24 hr. In contrast, words learned at 8 a.m. do not show such effects immediately or after 12 hr of wakefulness, but show the effects only after 24 hr, after sleep has occurred. This time-course dissociation is best accommodated by connectionist and neural models of learning in which sleep provides an opportunity for hippocampal information to be fed into long-term neocortical memory.
Article
Full-text available
The present experiments investigated how the process of statistically segmenting words from fluent speech is linked to the process of mapping meanings to words. Seventeen-month-old infants first participated in a statistical word segmentation task, which was immediately followed by an object-label-learning task. Infants presented with labels that were words in the fluent speech used in the segmentation task were able to learn the object labels. However, infants presented with labels consisting of novel syllable sequences (nonwords; Experiment 1) or familiar sequences with low internal probabilities (part-words; Experiment 2) did not learn the labels. Thus, prior segmentation opportunities, but not mere frequency of exposure, facilitated infants' learning of object labels. This work provides the first demonstration that exposure to word forms in a statistical word segmentation task facilitates subsequent word learning.
Article
Toddlers learn words in the context of speech from adult social partners. The present studies quantitatively describe the temporal context of parent speech to toddlers about objects in individual real-world interactions. We show that at the temporal scale of a single play episode, parent talk to toddlers about individual objects is predominantly, but not always, clustered. Clustered speech is characterized by repeated references to the same object close in time, interspersed with lulls in speech about the object. Clustered temporal speech patterns mirror temporal patterns observed at longer timescales, and persisted regardless of play context. Moreover, clustered speech about individual novel objects predicted toddlers' learning of those objects' novel names. Clustered talk may be optimal for toddlers' word learning because it exploits domain-general principles of human memory and attention, principles that may have evolved precisely because of the clustered structure of natural events important to humans, including human behavior.
Article
Identifying the referent of novel words is a complex process that young children do with relative ease. When given multiple objects along with a novel word, children select the most novel item, sometimes retaining the word‐referent link. Prior work is inconsistent, however, on the role of object novelty. Two experiments examine 18‐month‐old children's performance on referent selection and retention with novel and known words. The results reveal a pervasive novelty bias on referent selection with both known and novel names and, across individual children, a negative correlation between attention to novelty and retention of new word‐referent links. A computational model examines possible sources of the bias, suggesting novelty supports in‐the‐moment behavior but not retention. Together, results suggest that when lexical knowledge is weak, attention to novelty drives behavior, but alone does not sustain learning. Importantly, the results demonstrate that word learning may be driven, in part, by low‐level perceptual processes.
Article
Two experiments investigated two-year-olds' retention and generalization of novel words across short and long time delays. Specifically, retention of newly learned words and generalization to novel exemplars or novel contexts were tested 1 min or 1 week after learning. Experiment 1 revealed successful retention as well as successful generalization to both new exemplars and new contexts after a one-minute delay, with no statistical differences between retention and generalization performance for either generalization type. Toddlers tested after a week delay (Experiment 2) showed successful retention and generalization as well, but while context generalization was statistically equivalent to retention accuracy, exemplar generalization was significantly lower than retention accuracy. The overall success in both retention and generalization suggests that toddlers' newly learned words are robust and flexible. However, the lower exemplar generalization performance compared to retention after a weeklong delay suggests that novel words may become less flexible across exemplar characteristics over time.
Article
How do infants learn so rapidly and with little apparent effort? In 1996, Saffran, Aslin, and Newport reported that 8‐month‐old human infants could learn the underlying temporal structure of a stream of speech syllables after only 2 min of passive listening. This demonstration of what was called statistical learning, involving no instruction, reinforcement, or feedback, led to dozens of confirmations of this powerful mechanism of implicit learning in a variety of modalities, domains, and species. These findings reveal that infants are not nearly as dependent on explicit forms of instruction as we might have assumed from studies of learning in which children or adults are taught facts such as math or problem solving skills. Instead, at least in some domains, infants soak up the information around them by mere exposure. Learning and development in these domains thus appear to occur automatically and with little active involvement by an instructor (parent or teacher). The details of this statistical learning mechanism are discussed, including how exposure to specific types of information can, under some circumstances, generalize to never‐before‐observed information, thereby enabling transfer of learning. WIREs Cogn Sci 2017, 8:e1373. doi: 10.1002/wcs.1373 This article is categorized under: Psychology > Learning
Article
Infants show robust ability to track transitional probabilities within language and can use this information to extract words from continuous speech. The degree to which infants remember these words across a delay is unknown. Given well-established benefits of sleep on long-term memory retention in adults, we examine whether sleep similarly facilitates memory in 6.5 month olds. Infants listened to an artificial language for 7 minutes, followed by a period of sleep or wakefulness. After a time-matched delay for sleep and wakefulness dyads, we measured retention using the head-turn-preference procedure. Infants who slept retained memory for the extracted words that was prone to interference during the test. Infants who remained awake showed no retention. Within the nap group, retention correlated with three electrophysiological measures (1) absolute theta across the brain, (2) absolute alpha across the brain, and (3) greater fronto-central slow wave activity (SWA).
Article
To what extent can language acquisition be explained in terms of different associative learning mechanisms? It has been hypothesized that distributional regularities in spoken languages are strong enough to elicit statistical learning about dependencies among speech units. Distributional regularities could be a useful cue for word learning even without rich language-specific knowledge. However, it is not clear how strong and reliable the distributional cues are that humans might use to segment speech. We investigate cross-linguistic viability of different statistical learning strategies by analyzing child-directed speech corpora from nine languages and by modeling possible statistics-based speech segmentations. We show that languages vary as to which statistical segmentation strategies are most successful. The variability of the results can be partially explained by systematic differences between languages, such as rhythmical differences. The results confirm previous findings that different statistical learning strategies are successful in different languages and suggest that infants may have to primarily rely on non-statistical cues when they begin their process of speech segmentation.
Article
The present experiments tested bilingual infants' developmental narrowing for the interpretation of sounds that form words. These studies addressed how language specialization proceeds when the environment provides varied and divergent input. Experiment 1 (N = 32) demonstrated that bilingual 14- and 19-month-olds learned a pair of object labels consisting of the same syllable produced with distinct pitch contours (rising and falling). Infants' native languages did not use pitch contour to differentiate words. In Experiment 2 (N = 16), 22-month-old bilinguals failed to learn the labels. These results conflict with the developmental trajectory of monolinguals, who fail to learn pitch contour contrasts as labels at 17-19 months (Hay, Graf Estes, Wang, & Saffran, 2015). Bilingual infants exhibited a prolonged period of flexibility in their interpretation of potential word forms. © 2015 The Authors. Child Development © 2015 Society for Research in Child Development, Inc.
Article
In this article, we review literature on word learning and propose a theoretical account of how lexical knowledge and word use emerge and develop over time. We contend that the developing lexical system is built on processes that support children's in-the-moment word usage interacting with processes that create long-term learning. We argue for a new characterization of word learning in which simple mechanisms like association and competition, and the interaction between the two, guide children's selection of referents and word use in the moment. This in turn strengthens and refines the network of relationships in the lexicon, improving referent selection and use in future encounters with words. By integrating in-the-moment word use with long-term learning through simple domain-general mechanisms, this account highlights the dynamic nature of word learning and creates a broader framework for understanding language and cognitive development more generally.
Article
A recent report demonstrated that 8-month-olds can seg- ment a continuous stream of speech syllables, containing no acoustic or prosodic cues to word boundaries, into wordlike units after only 2 min of listening experience (Saffran, Aslin, & Newport, 1996). Thus, a powerful learning mechanism capable of extracting statistical informa- tion from fluent speech is available early in development. The present study extends these results by documenting the particular type of statis- tical computation—transitional (conditional) probability—used by infants to solve this word-segmentation task. An artificial language corpus, consisting of a continuous stream of trisyllabic nonsense words, was presented to 8-month-olds for 3 min. A postfamiliarization test compared the infants' responses to words versus part-words (tri- syllabic sequences spanning word boundaries). The corpus was con- structed so that test words and part-words were matched in frequency, but differed in their transitional probabilities. Infants showed reliable discrimination of words from part-words, thereby demonstrating rapid segmentation of continuous speech into words on the basis of transi- tional probabilities of syllable pairs.
Article
Four experiments explored the processes that bridge between referent selection and word learning. Twenty-four-month-old infants were presented with several novel names during a referent selection task that included both familiar and novel objects and tested for retention after a 5-min delay. The 5-min delay ensured that word learning was based on retrieval from long-term memory. Moreover, the relative familiarity of objects used during the retention test was explicitly controlled. Across experiments, infants were excellent at referent selection, but very poor at retention. Although the highly controlled retention test was clearly challenging, infants were able to demonstrate retention of the first 4 novel names presented in the session when referent selection was augmented with ostensive naming. These results suggest that fast mapping is robust for reference selection but might be more transient than previously reported for lexical retention. The relations between reference selection and retention are discussed in terms of competitive processes on 2 timescales: competition among objects on individual referent selection trials and competition among multiple novel name–object mappings made across an experimental session.
Article
Recent research demonstrated that although twenty-four month-old infants do well on the initial pairing of a novel word and novel object in fast-mapping tasks, they are unable to retain the mapping after a five-minute delay. The current study examines the role of familiarity with the objects and words on infants' ability to bridge between the initial fast mapping of a name and object, and later retention in the service of slow mapping. Twenty-four-month-old infants were familiarized with either novel objects or novel names prior to the referent selection portion of a fast-mapping task. When familiarized with the novel objects, infants retained the novel mapping after a delay, but not when familiarized with the novel words. This suggests familiarity with the object versus the word form leads to differential encoding of the name-object link. We discuss the implications of this finding for subsequent slow mapping.
Article
Four-day-old French and 2-month-old American infants distinguish utterances in their native languages from those of another language. In contrast, neither group gave evidence of distinguishing utterances from two foreign languages. A series of control experiments confirmed that the ability to distinguish utterances from two different languages appears to depend upon some familiarity with at least one of the two languages. Finally, two experiments with low-pass-filtered versions of the samples replicated the main findings of discrimination of the native language utterances. These latter results suggest that the basis for classifying utterances from the native language may be provided by prosodic cues.RésuméDeux groupes de bébés de communautés linguistiques différentes ont été testés sur leurs capacités à discriminer des séquences de discours spontané prononcées par un locuteur bilingue en deux langues différentes. Des nouveau-nés des quatre jours, français, sont capables de discriminer des séquences en français de séquences similaires en russe. Des nourrisson américains de deux mois ont manifesté un comportement similaire en présence de séquences en anglais et en italien. Cependant aucun groupe d'enfants ne montre de réponse de discrimination pour des séquences extraites de deux langues étrangéres (français, russe pour les enfants américains; anglais, italien pour les nouveau-nés français). Ceci est également le cas pour des nouveau-nés étrangers nés en France, en présence d'énoncés en français et en russe. Ainsi pour discriminer des énoncés de deux langues différentes, une certaine familiarité avec l'une d'entre elles semble nécessaire. Enfin les nouveau-nés et les nourrissons ont également montré des réactions de discrimination pour des versions filtrées des énoncés. Ces derniers résultats suggérent que les enfants pourraient classer les énoncés comme appartenant à leur langue maternelle sur la base d'indices prosodiques.
Article
The processes of infant word segmentation and infant word learning have largely been studied separately. However, the ease with which potential word forms are segmented from fluent speech seems likely to influence subsequent mappings between words and their referents. To explore this process, we tested the link between the statistical coherence of sequences presented in fluent speech and infants' subsequent use of those sequences as labels for novel objects. Notably, the materials were drawn from a natural language unfamiliar to the infants (Italian). The results of three experiments suggest that there is a close relationship between the statistics of the speech stream and subsequent mapping of labels to referents. Mapping was facilitated when the labels contained high transitional probabilities in the forward and/or backward direction (Experiment 1). When no transitional probability information was available (Experiment 2), or when the internal transitional probabilities of the labels were low in both directions (Experiment 3), infants failed to link the labels to their referents. Word learning appears to be strongly influenced by infants' prior experience with the distribution of sounds that make up words in natural languages.
Article
Numerous recent studies suggest that human learners, including both infants and adults, readily track sequential statistics computed between adjacent elements. One such statistic, transitional probability, is typically calculated as the likelihood that one element predicts another. However, little is known about whether listeners are sensitive to the directionality of this computation. To address this issue, we tested 8-month-old infants in a word segmentation task, using fluent speech drawn from an unfamiliar natural language. Critically, test items were distinguished solely by their backward transitional probabilities. The results provide the first evidence that infants track backward statistics in fluent speech.
Article
Numerous studies over the past decade support the claim that infants are equipped with powerful statistical language learning mechanisms. The primary evidence for statistical language learning in word segmentation comes from studies using artificial languages, continuous streams of synthesized syllables that are highly simplified relative to real speech. To what extent can these conclusions be scaled up to natural language learning? In the current experiments, English-learning 8-month-old infants' ability to track transitional probabilities in fluent infant-directed Italian speech was tested (N = 72). The results suggest that infants are sensitive to transitional probability cues in unfamiliar natural language stimuli, and support the claim that statistical learning is sufficiently robust to support aspects of real-world language acquisition.
Article
Learners rely on a combination of experience-independent and experience-dependent mechanisms to extract information from the environment. Language acquisition involves both types of mechanisms, but most theorists emphasize the relative importance of experience-independent mechanisms. The present study shows that a fundamental task of language acquisition, segmentation of words from fluent speech, can be accomplished by 8-month-old infants based solely on the statistical relationships between neighboring speech sounds. Moreover, this word segmentation was based on statistical learning from only 2 minutes of exposure, suggesting that infants have access to a powerful mechanism for the computation of statistical properties of the language input.
Article
Infants' long-term retention of the sound patterns of words was explored by exposing them to recordings of three children's stories for 10 days during a 2-week period when they were 8 months old. After an interval of 2 weeks, the infants heard lists of words that either occurred frequently or did not occur in the stories. The infants listened significantly longer to the lists of story words. By comparison, a control group of infants who had not been exposed to the stories showed no such preference. The findings suggest that 8-month-olds are beginning to engage in long-term storage of words that occur frequently in speech, which is an important prerequisite for learning language.
Article
Although children's knowledge of the sound patterns of words has been a focus of debate for many years, little is known about the lexical representations very young children use in word recognition. In particular, researchers have questioned the degree of specificity encoded in early lexical representations. The current study addressed this issue by presenting 18-23-month-olds with object labels that were either correctly pronounced, or mispronounced. Mispronunciations involved replacement of one segment with a similar segment, as in 'baby-vaby'. Children heard sentences containing these words while viewing two pictures, one of which was the referent of the sentence. Analyses of children's eye movements showed that children recognized the spoken words in both conditions, but that recognition was significantly poorer when words were mispronounced. The effects of mispronunciation on recognition were unrelated to age or to spoken vocabulary size. The results suggest that children's representations of familiar words are phonetically well-specified, and that this specification may not be a consequence of the need to differentiate similar words in production.
Article
Many studies have shown that listeners can segment words from running speech based on conditional probabilities of syllable transitions, suggesting that this statistical learning could be a foundational component of language learning. However, few studies have shown a direct link between statistical segmentation and word learning. We examined this possible link in adults by following a statistical segmentation exposure phase with an artificial lexicon learning phase. Participants were able to learn all novel object-label pairings, but pairings were learned faster when labels contained high probability (word-like) or non-occurring syllable transitions from the statistical segmentation phase than when they contained low probability (boundary-straddling) syllable transitions. This suggests that, for adults, labels inconsistent with expectations based on statistical learning are harder to learn than consistent or neutral labels. In contrast, a previous study found that infants learn consistent labels, but not inconsistent or neutral labels.
  • F Karaman
  • J Lany
  • J F Hay
F. Karaman, J. Lany, J. F. Hay / Cognitive Science 48 (2024)