ArticlePDF Available

Enduring musician advantage among former musicians in prosodic pitch perception

Authors:

Abstract and Figures

Musical training has been associated with various cognitive benefits, one of which is enhanced speech perception. However, most findings have been based on musicians taking part in ongoing music lessons and practice. This study thus sought to determine whether the musician advantage in pitch perception in the language domain extends to individuals who have ceased musical training and practice. To this end, adult active musicians (n = 22), former musicians (n = 27), and non-musicians (n = 47) were presented with sentences spoken in a native language, English, and a foreign language, French. The final words of the sentences were either prosodically congruous (spoken at normal pitch height), weakly incongruous (pitch was increased by 25%), or strongly incongruous (pitch was increased by 110%). Results of the pitch discrimination task revealed that although active musicians outperformed former musicians, former musicians outperformed non-musicians in the weakly incongruous condition. The findings suggest that the musician advantage in pitch perception in speech is retained to some extent even after musical training and practice is discontinued.
This content is subject to copyright. Terms and conditions apply.
1
Vol.:(0123456789)
Scientic Reports | (2023) 13:2657 | https://doi.org/10.1038/s41598-023-29733-3
www.nature.com/scientificreports
Enduring musician advantage
among former musicians
in prosodic pitch perception
Xin Ru Toh
1, Shen Hui Tan
1, Galston Wong
2, Fun Lau
1 & Francis C. K. Wong 1*
Musical training has been associated with various cognitive benets, one of which is enhanced speech
perception. However, most ndings have been based on musicians taking part in ongoing music
lessons and practice. This study thus sought to determine whether the musician advantage in pitch
perception in the language domain extends to individuals who have ceased musical training and
practice. To this end, adult active musicians (n = 22), former musicians (n = 27), and non-musicians
(n = 47) were presented with sentences spoken in a native language, English, and a foreign language,
French. The nal words of the sentences were either prosodically congruous (spoken at normal
pitch height), weakly incongruous (pitch was increased by 25%), or strongly incongruous (pitch was
increased by 110%). Results of the pitch discrimination task revealed that although active musicians
outperformed former musicians, former musicians outperformed non-musicians in the weakly
incongruous condition. The ndings suggest that the musician advantage in pitch perception in speech
is retained to some extent even after musical training and practice is discontinued.
Musical training has been associated with various cognitive enhancements1, making it an attractive enrich-
ment and intervention activity. In the language domain, one notable nding is that musical training is linked
to an advantage in speech perception. For instance, musicians are better than non-musicians at perceiving
speech in noisy conditions even in older adulthood2,3. In particular, a vast amount of literature has documented
positive music-to-language cross-domain transfer eects in pitch processing, which undergirds Patel’s OPERA
hypothesis4. e OPERA hypothesis describes how musical training benets the neural encoding of speech when
ve requirements are satised: there is an overlap in the brain networks employed to process an acoustic feature
common to music and speech; the processing of the shared acoustic feature occurs at higher precision in music
than in speech; and the musical activities evoke strong positive emotion, have frequent repetition, and encompass
focused attention. e OPERA hypothesis may account for the superior pitch processing abilities in speech seen
in musically trained individuals, as pitch is a basic acoustic property found in both music and speech. While pitch
dierences are used to form melodies in music, they are used to convey contrastive meaning via lexical tones,
stress, and intonation in speech. To augment the OPERA hypothesis, this study seeks to explicate whether the
musician advantage persists in former musicians who have ceased musical training and practice.
e OPERA hypothesis is well supported by empirical studies comparing musicians and non-musicians. Stud-
ies have found that among individuals with no tone language experience, musicians outperform non-musicians
in lexical tone perception515. In addition, musicians without tone language experience show enhanced brainstem
and cortical encoding when listening to lexical tones14,16,17. Yet, it is uncertain whether the musician advantage
in lexical tone perception also exists among tone language speakers. It was previously found that for English or
French speakers, musicians outperform non-musicians in Cantonese tone discrimination, whereas for Cantonese
speakers, musicians and non-musicians both show ceiling eects18. On the other hand, it was also found that
Cantonese musicians outperform Cantonese non-musicians in the discrimination and identication of merging
Cantonese tone pairs, especially the most dicult Tone 2/Tone 5 contrast19. More recently, Toh etal.20 found that
even among speakers of a tone language, those who have received musical training outperform non-musicians
in non-native lexical tone perception. Apart from lexical tone perception, studies have found that among indi-
viduals with no tone language experience, musicians are better than non-musicians at perceiving stress, which is
indicated by a combination of pitch, duration, and intensity variations21,22. However, it remains unclear whether
the musician advantage in stress perception also applies to tone language speakers. Among English speakers,
musicians outperform non-musicians in English stress perception, while Cantonese-English bilingual musicians
OPEN
1Linguistics and Multilingual Studies, School of Humanities, Nanyang Technological University, Singapore,
Singapore. 2School of Brain and Behavioral Sciences, The University of Texas at Dallas, Dallas, TX, USA. *email:
franciswong@ntu.edu.sg
Content courtesy of Springer Nature, terms of use apply. Rights reserved
2
Vol:.(1234567890)
Scientic Reports | (2023) 13:2657 | https://doi.org/10.1038/s41598-023-29733-3
www.nature.com/scientificreports/
and non-musicians perform equally well23. Tone language experience has been linked to enhanced pitch percep-
tion abilities in speech6,2430. As such, one possible explanation for the conicting ndings is that the musician
advantage for pitch perception in speech applies across speakers irrespective of language background, but the
more subtle eect among tone language speakers is likely to be masked by ceiling-like performance in tasks that
are not suciently sensitive.
Besides lexical tones and stress, another aspect in which there is mounting evidence for the musician advan-
tage is prosody. A series of studies have consistently found that musicians outperform non-musicians in detecting
pitch contour expectancy violations3135. is research paradigm, rst created by Schön etal.31, is designed by
manipulating the fundamental frequency of either the nal notes of musical phrases or nal words of linguistic
phrases. In particular, the weakly incongruous condition entails a small pitch change which is dicult to detect,
and hence evaluates pitch perception in a more ne-grained manner. rough both behavioural and electrophysi-
ological measures, they found that adult musicians detected these pitch variations better than non-musicians
in not only music but also their native language, thereby lending support for a domain-general pitch processing
mechanism. is nding was reinforced in follow-up cross-sectional and longitudinal studies32,33, in which they
found similar group dierences among 8-year-old children, despite the fact that the children musicians received
a shorter duration of musical training than the adult musicians in the original study. eir nding was also
expanded in follow-up studies introducing unfamiliar language contexts34,35, in which they found that partici-
pants across groups found it more dicult to detect pitch changes in a non-native language or pseudolanguage
than in their native language. e researchers posited that understanding the semantic content and being familiar
with intonational contours in sentences might help with anticipating and detecting pitch changes in ones native
language. at said, the researchers found that musicians held an advantage over non-musicians in detecting
prosodic pitch violations across native and non-native language contexts. Moreover, behavioural studies have
found that musicians outperform non-musicians in matching spoken utterances to their intonation melodies36
and identifying emotional prosody in speech3739. Interestingly, similar results were seen in a longitudinal study
with 6-year-old children, with those who were randomly assigned to receive 1year of musical training in the
form of keyboard or vocal lessons outperforming those who received no lessons when tested on the identica-
tion of emotional prosody in speech37. Collectively, these studies substantiate the notion that musical training
facilitates speech perception at not only the segmental but also supra-segmental level.
Furthermore, neurological studies suggest that musical training is linked to structural and functional dif-
ferences in the brain4050. Notably, the eects of musical training on brain development seem to be causal in
nature33,4547. For instance, Hyde etal.46,47 randomly assigned 6-year-old children without any behavioural or
brain dierences in pre-tests to receive either 15months of musical training or no training. ey found that only
those who received musical training showed structural brain changes in motor and auditory areas which were
correlated with behavioural improvements on melodic and rhythmic discrimination tests. ese studies suggest
that there may be musical training-induced brain plasticity eects that could potentially translate to long-lasting
cognitive impacts. While the data on ageing and musicianship remains scant, there is emerging evidence that
an age-related decline in auditory perception may be mitigated by musical training among lifelong musicians
who maintain regular musical practice. Older and younger adult musicians outperform non-musicians in vari-
ous auditory processing abilities, such as detecting speech-in-noise and mistuned harmonics, assessed using
neurophysiological and behavioural measures5153.
In light of the above ndings, musical training does appear to facilitate speech perception, providing empiri-
cal evidence for the OPERA hypothesis4. A critical question to consider is whether the OPERA hypothesis can
be extended to former musicians. Studies on the eects of musical training typically characterise musicians as
individuals with 6years of musical training and ongoing instrumental practice for 1h a week54. However,
such professional musicians may not represent the general population in which many individuals who take
up music lessons in childhood eventually do not commit to it55,56. Although there has been extensive research
on professional active musicians, more research needs to be done with individuals who choose not to pursue
musicianship professionally but nonetheless have had some musical experience. Of particular interest is whether
cognitive benets such as in speech perception persist even aer musical training and practice is discontinued.
Qualifying the extent of the inuence of musical training among individuals who have undergone music attri-
tion will serve to not only provide insight on the generalisability of the OPERA hypothesis, but also inform the
eectiveness of musical training as a means of improving cognitive and linguistic abilities in the long-term, as
well as protecting against age-related cognitive decline.
As noted by Costa-Giomi56, few studies to date have investigated whether cognitive advantages exist in the
long term aer musical training and practice is discontinued. Costa-Giomi and Ryan58 (as cited in Costa-Giomi57)
conducted a longitudinal study in which children in the experimental group received 3years of piano lessons.
Seven years aer musical training was discontinued, the researchers found no dierences in IQ or memory
between the adults who had and had not received childhood musical training, suggesting that musical training
does not result in permanent cognitive benets. Nevertheless, the researchers postulated that the lack of long-
lasting cognitive improvements may have been due to low attendance and time spent practising the musical
instrument55,58. In contrast, two behavioural studies found improved performance in various cognitive tasks
such as IQ59 and executive functions60 in adulthood even aer musical training and practice had ceased, sug-
gesting that musical training has long-term benets and contributes to the establishment of a cognitive reserve.
However, the measures used in these behavioural studies have focused on general cognitive abilities rather than
speech perception abilities specically.
In terms of auditory perception, two brain imaging studies havefound that musical training in early child-
hood provide sustained enhanced neural processing of auditory stimuli in adulthood aer musical training and
practice had ceased. Skoe and Kraus61 found that young adults who had received musical training in childhood
showed more robust signal-to-noise ratio brainstem responses to pure tones, as compared to non-musicians.
Content courtesy of Springer Nature, terms of use apply. Rights reserved
3
Vol.:(0123456789)
Scientic Reports | (2023) 13:2657 | https://doi.org/10.1038/s41598-023-29733-3
www.nature.com/scientificreports/
White-Schwoch etal.62 found that older adults with a greater number of years of musical training in childhood
or young adulthood showed faster neural timing in response to consonant–vowel transitions in speech syllables
presented in quiet and noise, compared to older adults with fewer number of years of musical training or no
musical training at all. Although these two brain imaging studies suggest that music-related neuroplasticity is
maintained even aer music attrition, studies have yet to investigate if these neural traces translate to a clear
behavioural advantage in acoustic processing of speech stimuli. is is a critical research gap that the present
study aims to bridge.
In sum, although the literature has generally established that professional musicians have an advantage over
non-musicians in pitch perception abilities in the language domain, it remains inconclusive whether this musi-
cian advantage would also be observed behaviourally among individuals who have ceased musical training and
practice. at being the case, the overarching aim of our study is to add to the OPERA hypothesis and elucidate
whether a potential music-to-language transfer eect exists among former musicians. To this end, our study
compared active musicians, former musicians, and non-musicians in their ability to perform a well-replicated
experimental task—detecting linguistic prosodic pitch violations.
Although there has been a burgeoning number of studies revolving around various types of pitch percep-
tion in speech, such as lexical tones and stress, we were theoretically motivated to focus on prosody for several
reasons. Firstly, prosody is oen described as “the music of speech63, thereby making it an obvious candidate for
the present study on music-to-language transfer. Patel himself has called attention to the fact that both melody
in music and prosody in speech rely primarily on the same acoustic parameter of pitch contour, with the former
necessitating more precise acoustic processing than the latter64. is overlap in neural resources has been dem-
onstrated in the studies outlined above, in which musicians tend to surpass non-musicians in prosodic pitch
perception. On top of that, Patel and other researchers have shown that individuals on the other end of the spec-
trum with a musical disorder known as amusia exhibit decits in perceiving speech prosody6467. Accordingly,
speech prosody is of exceptional relevance to the OPERA hypothesis. Secondly, unlike lexical tones which are
only of pertinence to tone languages, speech prosody is an important aspect of all languages, thereby making it a
universal topic of interest with great practical signicance. Broadly speaking, prosody signals speaker intention
and meaning, imparting crucial information pertaining to syntax and pragmatics68. Research on rst language
acquisition has found that prosodic sensitivity is related to literacy skills69, reading comprehension70,71, and speech
comprehension7274. In a similar vein, research on second language acquisition in children and adults has found
that prosodic sensitivity might facilitate the learning of word order and new vocabulary75,76, while exposure to
prosodic features of the target language apparently improves second language prociency and uency77,78. e
ndings yielded from this study will therefore have important pedagogical implications for language and literacy
skills as well as foreign language learning.
In order to study prosodic pitch perception, we chose to adopt the well-replicated prosodic pitch contour
expectancy violation task, as it has consistently demonstrated the musician advantage in dierent age groups and
languages with robust ndings. Given that previous studies revealed a trend in which participants, regardless of
musicianship, showed superior performance in detecting prosodic pitch violations in a familiar language rela-
tive to an unfamiliar language34,35, two dierent language contexts were implemented in the present study. We
included a non-native language context in part to help circumvent a problem we anticipated; namely, that tasks
using native language stimuli might not be adequately sensitive to tease apart group dierences18,23, especially for
tone language speakers. Furthermore, by introducing both native and non-native language contexts, we hoped
to examine music-to-language transfer eects both with and without the top-down inuence from other types
of linguistic processing, allowing us to better assess the generalisability of the eects. Finally, the two language
contexts mirror rst language competence and second language learning respectively, shedding light on the
practical application of the enduring music-to-language transfer eects in former musicians, if any.
Method
Participants. Participants were recruited to take part in the study via an online screening questionnaire.
ey were between 19 and 42years old (M = 23.04, SD = 3.90), with normal hearing based on an audiometric test
(25dB HL for octave frequencies from 500 to 4000Hz). All of the participants were either native Singaporeans
or had lived in Singapore for at least 10years to ensure that they were familiar with the local accented variety of
English. ey had no formal exposure to the French language, the non-native speech stimuli used in this study.
A total of 127 individuals participated in this study. Data from 31 participants was excluded due to the fol-
lowing cases: (a) participants with self-reported exposure to French (n = 8); (b) participants who had between
2and 6years of musical training experience (n = 23).
e nal dataset consisted of 96 participants. ey were classied into three groups based on information
obtained from a self-report questionnaire on their language and music background. In this study, active musi-
cians consisted of those who had had at least 6years of musical training and were still currently maintaining
a consistent practice schedule of at least 3h per week in the past 2years (n = 22). On the other hand, former
musicians referred to those who similarly had at least 6years of musical training but had stopped maintaining a
regular practice schedule for at least 2years (n = 27). Finally, non-musicians referred to those who had had less
than 2years of musical training (n = 47). ose with musical training predominantly had experience in string,
wind, and vocal musical training. None of the participants were musicians by profession. Reecting the diver-
sity of multilingualism in the local population, the majority of the participants were procient in English and
Mandarin Chinese (n = 86), while several were procient in English and a second language other than Mandarin
Chinese, specically Malay (n = 3), Tamil (n = 5), Tagalog (n = 1), and Burmese (n = 1). e representation of
non-Mandarin Chinese speakers was similar across groups, χ2(2) = 2.355, p = 0.308.
Content courtesy of Springer Nature, terms of use apply. Rights reserved
4
Vol:.(1234567890)
Scientic Reports | (2023) 13:2657 | https://doi.org/10.1038/s41598-023-29733-3
www.nature.com/scientificreports/
To validate the grouping, participants’ general musical abilities were assessed using the Musical Ear Test
(MET)79. e MET consisted of two components: the melody subtest and the rhythm subtest. For each subtest,
participants listened to 52 pairs of phrases, and had to judge whether the second phrase was the “same” or “dif-
ferent” compared to the rst phrase. Half of the trials were “same” trials and the other half were “dierent” trials.
e “dierent” trials involved a pitch violation in the melody subtest and a rhythmic change in the rhythm subtest.
e MET stimuli were delivered via headphones, and participants gave their responses on an accompanying
answer sheet. All participants completed the melody subtest followed by the rhythm subtest. Table1 shows the
nal sample and descriptive information of each participant group.
Materials and procedures
e research procedures were approved by the Institutional Review Board at the Nanyang Technological Univer-
sity. All research methods were performed in accordance with the relevant guidelines and regulations. Written
informed consent was obtained from all participants and/or their legal guardians before participation.
Aer providing their written informed consent, participants were seated comfortably in a soundproof booth,
and undertook two experimental tasks. Firstly, the participants completed a two-choice speech pitch discrimi-
nation task. e English and French language blocks were counterbalanced across participants, with half of the
participants presented with the English set rst, and the other half with the French set rst. Secondly, the partici-
pants completed a general musical abilities test, i.e., the MET. Short breaks were given between tasks to prevent
fatigue. e total length of time for participation was approximately 1h, and the participants were monetarily
compensated for their time upon successful completion of the experiment.
Participants’ linguistic perception abilities were assessed using a pitch violation discrimination task that has
been well-replicated in the literature3135. For the pitch discrimination task, 40 spoken declarative sentences in
English and French respectively were recorded to form the experimental speech stimuli (see Supplementary
Tables1 and 2). e sentences were compiled and modied from a combination of sources, including the Har-
vard Sentences database80 for the English stimuli and Smith’s paper81 for the French stimuli, with the nal word
in each sentence being disyllabic as in Marques etal.’s study34. Two female speakers, one native in Singapore
English and the other in French, voiced the English and French sentences respectively at a normal speaking rate.
e recorded sentences were then digitised (sampling at 44.1kHz and 16 bit) using Audacity® Version 2.0.5.082.
For each language, there were three dierent auditory conditions, and 40 sentences were presented in each
auditory condition, thus leading to a total of 120 sentences. e nal word of each sentence was either prosodi-
cally congruous, weakly incongruous, or strongly incongruous. In the prosodically incongruous conditions, the
pitch (F0) of the nal words was increased using Praat83, such that there was a local pitch manipulation on the
nal words (+ 25% in the weakly incongruous condition, + 110% in the strongly incongruous condition) while
maintaining the original natural global pitch contour (Fig.1). e pitch increases used in the present study dier
from those used in past studies (+ 35% in the weakly incongruous condition, + 120% in the strongly incongru-
ous condition)3135. Preliminary pilot testing using conventional pitch increase values revealed a ceiling eect
among our Singaporean participants, likely because enhanced pitch perception abilities in speech have been
associated with bilingualism84 and tone language experience6,2430. As such, we reduced the pitch incongruity
Table 1. Participant group demographics. Mean values and standard deviations (in parentheses) are given
for age, musical background, and musical abilities. For the group dierences on musical abilities, one-way
ANOVA and pairwise comparisons are Bonferroni corrected.
Participant group Group dierences on musical
abilities (MET scores) as revealed
by one-way ANOVAActive musicians (AM) Former musicians (FM) Non-musicians (NM)
N 22 27 47 N/A
Age 21.45 (2.87) 23.07 (3.93) 23.77 (4.13) N/A
No. of years of musical training 11.00 (3.80) 9.37 (3.12) 0.09 (0.28)
F(2,93) = 202.564, p < 0.001
AM > NM (p < 0.001)
AM = FM (p = 0.069)
FM > NM (p < 0.001)
No. of practice hours per week within
the past two years 5.91 (2.51) 0.70 (0.77) N/A N/A
No. of years since musical training
was discontinued 0.09 (0.29) 5.26 (3.44) N/A N/A
MET melody (%) 85.58 (7.43) 77.99 (7.92) 69.23 (7.38)
F(2,93) = 37.299, p < 0.001
AM > NM (p < 0.001)
AM > FM (p = 0.002)
FM > NM (p < 0.001)
MET rhythm (%) 78.76 (7.93) 73.29 (7.77) 66.98 (9.49)
F(2,93) = 14.579, p < 0.001
AM > NM (p < 0.001)
AM = FM (p = 0.093)
FM > NM (p = 0.010)
Language background 20 English-Mandarin bilingual
1 English-Malay bilingual
1 English-Tagalog bilingual
26 English-Mandarin bilingual
1 English-Tamil bilingual
40 English-Mandarin bilingual
2 English-Malay bilingual
4 English-Tamil bilingual
1 English-Burmese bilingual
Content courtesy of Springer Nature, terms of use apply. Rights reserved
5
Vol.:(0123456789)
Scientic Reports | (2023) 13:2657 | https://doi.org/10.1038/s41598-023-29733-3
www.nature.com/scientificreports/
in order to increase the diculty of the task, and preliminary pilot testing using our modied pitch increase
values obtained pitch discrimination accuracy rates across the experimental conditions which were similar to
those found by Marques etal.34.
Participants listened to the speech stimuli via headphones. ey were briefed that they would be listening
to either English or French sentences, and that comprehension of the sentences was not required. In each trial,
participants were asked to judge whether the nal word of each sentence sounded normal (congruous condition)
or strange (weakly incongruous or strongly incongruous conditions). Responses were recorded via a keyboard
press, “N” or “S” respectively. Participants were asked to provide a response within 3s. e practice phase
consisted of 6 trials, with feedback provided at the end of each trial to indicate if the participants had answered
correctly. e experimental phase consisted of 120 trials, broken up into four blocks of 30 sentences each. Sen-
tence blocks were counterbalanced across participants; half of the participants in each group heard blocks one
and two rst, while the other half heard blocks three and four rst. Sentences from each experimental condition
occurred equally frequently within each block and in pseudorandom order. Up to three consecutive “strange
trials were allowed within each block, while pitch-manipulated variants of the same sentence were not allowed
to occur within the same block.
Results
A 2
×
3
×
3 mixed ANOVA was conducted with pitch discrimination accuracy as the dependent variable, language
(native vs. non-native) and prosodic congruity (congruous vs. weakly incongruous vs. strongly incongruous)
as the within-subject factors, and music group (active musicians vs. former musicians vs. non-musicians) as
the between-subject factor. As Mauchly’s Test indicated that the assumption of sphericity had been violated
for the prosodic congruity eect, χ2(2) = 242.497, p < 0.001, and the language by prosodic congruity eect,
χ2(2) = 153.767, p < 0.001, Greenhouse–Geisser correction was applied, ε = 0.519 and ε = 0.552 respectively. As
Box’s M Test indicated that the assumption of equality of covariance had been violated, Box’s M = 249.251,
F = 5.307, p < 0.001, Pillai’s Trace was used. For all pairwise comparisons, Bonferroni correction was applied.
The interaction effect between prosodic congruity and music group was statistically significant,
F(2.074,96.456) = 10.124, p < 0.001 (Fig.2). e main source of the interaction eect as revealed by simple eect
analyses was from the weakly incongruous condition, F(2,93) = 13.877, p < 0.001; the mean pitch discrimination
accuracy was signicantly dierent between active musicians (68%), former musicians (54%), and non-musicians
(42%). Pairwise comparisons revealed that active musicians outperformed former musicians (p = 0.034) and
non-musicians (p < 0.001), while former musicians also outperformed non-musicians (p = 0.042).
e eect of music group was signicant for the strongly incongruous condition, F(2,93) = 3.293, p = 0.042; the
mean pitch discrimination accuracy was signicantly dierent between active musicians (99%), former musicians
(98%), and non-musicians (96%). However, pairwise comparisons revealed no signicant dierences between
groups aer Bonferroni correction. Active musicians did not dier from non-musicians (p = 0.060), and former
musicians diered from neither active musicians (p = 1.000) nor non-musicians (p = 0.257). Meanwhile. the eect
of music group was not signicant for the congruous condition, F(2,93) = 0.767, p = 0.467.
e three-way interaction between language and prosodic congruity and music group was not signi-
cant, F(2.207,102.648) = 0.335, p = 0.737; neither was the interaction between language and music group,
F(2.000,93.000) = 0.615, p = 0.543.
There was also a significant interaction effect between language and prosodic congruity,
F(1.104,102.648) = 39.450, p < 0.001 (Fig.3). e eect of language was signicant for the weakly incongruous
condition, F(1.000,93.000) = 63.833, p < 0.001, where participants showed higher pitch discrimination accuracy
Figure1. Fundamental frequency (F0 in Hz) for a sample sentence in the three prosodic conditions.
Content courtesy of Springer Nature, terms of use apply. Rights reserved
6
Vol:.(1234567890)
Scientic Reports | (2023) 13:2657 | https://doi.org/10.1038/s41598-023-29733-3
www.nature.com/scientificreports/
in their native language English (66%) than in their non-native language French (43%). e eect of language
was also signicant for the congruous condition, F(1.000,93.000) = 8.816, p = 0.004, where participants showed
higher pitch discrimination accuracy in their native language English (98%) than in their non-native language
French (95%). However, the eect of language was not signicant for the strongly incongruous condition,
F(1.000,93.000) = 0.008, p = 0.931).
Signicant main eects were found for language, F(1.000,93.000) = 85.129, p < 0.001, prosodic congruity,
F(1.037,96.456 = 376.582, p < 0.001, and music group, F(2,93) = 14.275, p < 0.001. Participants showed higher pitch
discrimination accuracy in their native language English (87%) than in their non-native language French (79%).
e weakly incongruous condition (52%) was the most dicult to detect compared to the congruous condition
(96%) and strongly incongruous condition (97%). Active musicians (88%) and former musicians (83%) showed
higher pitch discrimination accuracy compared to non-musicians (78%).
Figure2. Pitch discrimination accuracy of active musicians, former musicians, and non-musicians in the three
prosodic conditions. Error bars denote standard error. *p < 0.05, ***p < 0.001.
Figure3. Pitch discrimination accuracy for the native language English and non-native language French in the
three prosodic conditions. Error bars denote standard error. **p < 0.01, ***p < 0.001.
Content courtesy of Springer Nature, terms of use apply. Rights reserved
7
Vol.:(0123456789)
Scientic Reports | (2023) 13:2657 | https://doi.org/10.1038/s41598-023-29733-3
www.nature.com/scientificreports/
Discussion
e present study is one of the rst to ascertain whether individuals who have discontinued musical training
and practice retain a behavioural advantage over non-musicians in pitch perception abilities in speech. Our key
nding is that there was a signicant interaction eect between prosodic congruity and music group. In the
weakly incongruous condition where pitch deviations were small and dicult to detect, our results showed a
stepwise progression in pitch discrimination accuracy, with active musicians having better performance than
former musicians, who in turn had better performance than non-musicians.
Our nding of an advantage by musicians over non-musicians in pitch discrimination echoes past ndings
that musical training facilitates pitch perception in the language domain523,3139, thereby pointing towards a com-
mon domain-general pitch processing mechanism in music and speech perception as described in the OPERA
hypothesis4. Our nding also coheres with ndings of long-lasting neural changes from past musical training in
young adults61 and older adults62, as well as ndings of improved cognitive performance in adulthood even aer
ceasing musical training59,60. Taken together, it appears that former musicians share similar neural enhancement
as active musicians, and that the sharpened neural processing translates to perceptual benets behaviourally.
One explanation is that musical training requires individuals to attend to subtle sound contrasts, such as in pitch
and duration. Consequently, musicians become more sensitive to such subtle acoustic cues, which has a positive
spillover eect when discriminating similar contrasts in speech. Musical training contributes to the establish-
ment of cognitive enhancement, such that there are some enduring cross-domain transfer benets of musical
training on the discrimination of subtle speech contrasts even aer musical training and practice is discontinued.
More importantly, our nding that former musicians diered from active musicians qualies the extent of the
positive music-to-language transfer eects. Drawing on previous studies showing a clear behavioural advantage
held by adult musicians over non-musicians in a pitch contour violation task similar to that used in the present
study31,34, it appears that former musicians retain some musician advantage, but that such advantage may fade
over time aer musical training and practice is discontinued. Parallel results were seen in the data from the
musical abilities tests. ere were signicant group dierences in MET melody scores (Table1), where active
musicians had better musical abilities than former musicians, and former musicians in turn had better musical
abilities than non-musicians. e dierence in pitch discrimination and musical abilities between active musi-
cians and former musicians cannot be attributed to the length of musical training, which was similar for the two
groups as seen in the pairwise comparison (Table1). erefore, our data suggests that music attrition in pitch
perception manifests in both the music and language domains among former musicians. One possible explana-
tion is that subtle acoustic cues may no longer be behaviourally relevant in former musicians’ everyday auditory
environment, such that positive music-to-language transfer benets may diminish over time. is interpretation
is corroborated by exploratory Pearson correlations conducted to assess the relationship between the number
of years since discontinuing musical practice and other factors. As reported in the Supplementary Results,
in the sample of musicians both active and former, the number of years since discontinuing musical practice
was signicantly negatively correlated with pitch discrimination accuracy for the weakly incongruous prosodic
condition in the native language English and non-native language French (see Supplementary Figs.1 and 2), as
well as with MET melody and rhythm subtest performance. e longer the period since discontinuing musical
training and practice, the poorer one is at discriminating subtle violations in speech and music, indicating that
there might be a gradual attenuation in ner acoustic discrimination abilities among former musicians. Future
research can be conducted with older adults as participants, including former musicians who have discontinued
musical training and practice for a longer period, to examine the issue further.
We also found a signicant main eect for language and a signicant interaction eect between language
and prosodic congruity. Apart from a native language context with multiple possible sources of information that
might assist in pitch processing, we implemented a non-native language context without additional information
for participants to rely on. e interaction eect revealed that in both the congruous and weakly incongruous
conditions, participants were more accurate in prosodic pitch discrimination in their native language than in a
non-native language, consistent with previous ndings by Marques etal.34 and Deguchi etal.35. ere are two
possible explanations for this native language advantage, which Deguchi etal.35 investigated by introducing jab-
berwocky sentences that preserved the intonational contours of the native language but consisted of meaningless
legal pseudowords. ey found that participants were better at detecting pitch changes in their native language
than in jabberwocky, but were also better at detecting pitch changes in jabberwocky than in the non-native lan-
guage. is suggests that participants were familiar with typical intonational contours in their native language,
and were consequently better able to detect pitch changes in the native language and jabberwocky speech stimuli
but not in the non-native language speech stimuli. At the same time, participants could understand the mean-
ing of their native language, and for that reason were better able to anticipate when the nal word carrying the
pitch variation would occur in the native language speech stimuli but not in the jabberwocky and non-native
language speech stimuli. e native language advantage observed in our study can thus be explained by the fact
that participants were making use of both prosodic and semantic information to complete the pitch incongruity
detection task. Nonetheless, regardless of the language used for the speech stimuli, group dierences were seen
in the weakly incongruous condition. Active and former musicians were more accurate than non-musicians in
detecting prosodic pitch violation no matter whether they had prior knowledge of the language tested. In the
non-native language context which better isolated the prosodic pitch dimension without inuence from other
types of linguistic processing, participants would not have been able to exploit additional top-down processing
frameworks and would have been relying solely on bottom-up pitch perceptual sensitivity to acoustic cues. In
the native language context, group dierences were also seen despite the fact that all participants were able to
draw on additional linguistic resources. e present study hence extends the OPERA hypothesis4 by underscor-
ing that—presumably due to their prior musical training with melodic pitch patterns—former musicians retain
Content courtesy of Springer Nature, terms of use apply. Rights reserved
8
Vol:.(1234567890)
Scientic Reports | (2023) 13:2657 | https://doi.org/10.1038/s41598-023-29733-3
www.nature.com/scientificreports/
enhanced underlying pitch processing abilities, which generalise to the perception of prosodic pitch contours
in speech for both native and non-native languages.
A caveat to keep in mind, though, is that although there is a large body of evidence in which musicians—be it
active and former—outperform non-musicians in various pitch perception tasks, it may not be straightforward
to conclude that the so-called musician advantage is a result of musical training. It is plausible that the results
reported in this present study may be driven by a third, unexplored factor, such as general intelligence, educa-
tion background, or socioeconomic status. On top of that, as with most of the previously reported literature, this
present study adopted a cross-sectional design comparing dierent population groups at a specic point in time.
In recent years, some researchers have propounded the idea that inherent musical abilities, rather than musical
training, might be linked to enhanced speech perception30,85,86. Ergo, the music-to-language transfer eects that
we speak of may be a consequence of pre-existing dierences and self-selection, as opposed to a consequence
of musical training per se. In other words, individuals pre-disposed with superior auditory or pitch processing
abilities to begin with may be more inclined to pick up and continue musical training, such that the dierences
observed between active, former, and non-musicians later in life may not be a direct outcome of musical train-
ing in and of itself.
However, as highlighted in the introduction, there is some compelling evidence in the existing literature that
musical training has a causal inuence on brain development and pitch perception. Participants initially matched
in musical aptitude, general intelligence, and socioeconomic status have been shown to demonstrate group dif-
ferences in neurological and behavioural post-tests related to pitch perception depending on the training they
are randomly assigned to33,4547. Of particular relevance to our study, Moreno etal.33 conducted a longitudinal
experimental study with 8-year-old children without any prior musical training. Pre-tests conrmed that the
children were initially matched in pitch perception performance, general cognitive abilities, as well as socio-
economic status. ese children were then randomly assigned to receive 6months of either musical training
or painting training. e researchers recorded both electrophysiological and behavioural measures for a pitch
violation discrimination task similar to that used in this present study. ey found that children who received
musical training, but not those who received painting training, showed improved prosodic pitch discrimina-
tion abilities in speech. Along the same lines, Nan etal.45 randomly assigned 4- to 5-year-old children with
tone language experience to receive 6months of piano training, reading training, or no training. Although the
children were initially matched in general cognitive abilities and socioeconomic status, and although all groups
showed improvements in general cognitive abilities, only those who received piano training showed enhanced
cortical responses to pitch changes in music and speech which were correlated with behavioural performance.
ese ndings suggest that musical training can indeed cause experience-dependent transfer eects that cannot
be attributed to external factors or pre-existing dierences, while our study further suggests that some transfer
eects may be retained even aer musical training and practice is discontinued. Having said that, future research
can strengthen our nding by performing an intervention study with longitudinal randomised controlled trials
to track and compare the eects of long-term, short-term, and no musical training among individuals who are
otherwise matched on other variables.
In conclusion, our study shows that musical training confers positive cross-domain benets in speech per-
ception, adding to the body of literature on music-to-language transfer and suggesting that there is a common
pitch processing mechanism underlying pitch perception in the two domains. More importantly, our results
further show that these benets may be retained to some extent over time, such that former musicians show
some behavioural advantage over non-musicians even aer the discontinuation of musical training and practice.
Situated within the OPERA hypothesis4, it appears that musical training alters the shared neural networks for
music and speech in a long-lasting manner, such that the musician advantage applies not only to active musicians
but to former musicians as well. Moreover, this advantage in prosodic pitch perception is seen with both native
and non-native languages. Possible future directions for research include using neurological and behavioural
measures to compare active musicians, former musicians, and non-musicians’ pitch perception abilities in the
language domain in other areas such as the perception of lexical tones, stress, and emotional prosody. Our nd-
ings have real-life implications for boosting rst language acquisition and foreign language learning, as well as
protecting against age-related cognitive and auditory decline in the ageing population. It appears that musical
training and practice can serve as an eective enrichment activity and intervention method to improve speech
perception, and that individuals can reap some long-lasting cognitive benets throughout their lifespan even
aer musical training and practice is discontinued.
Data availability
e dataset generated during and/or analysed during the current study is included in the Supplementary Infor-
mation le.
Received: 19 September 2022; Accepted: 9 February 2023
References
1. S chellenberg, E. G. & Weiss, W. M. Music and cognitive abilities. In e Psychology of Music (ed. Deutsch, D.) 499–550 (Academic
Press, 2013).
2. Parbery-Clark, A., Anderson, S., Hittner, E. & Kraus, N. Musical experience osets age-related delays in neural timing. Neurobiol.
Aging 33(1483), e1-4. https:// doi. org/ 10. 1016/j. neuro biola ging. 2011. 12. 015 (2012).
3. Parbery-Clark, A., Skoe, E., Lam, C. & Kraus, N. Musician enhancement for speech-in-noise. Ear. Hear. 30, 653–661. https:// doi.
org/ 10. 1097/ AUD. 0b013 e3181 b412e9 (2009).
Content courtesy of Springer Nature, terms of use apply. Rights reserved
9
Vol.:(0123456789)
Scientic Reports | (2023) 13:2657 | https://doi.org/10.1038/s41598-023-29733-3
www.nature.com/scientificreports/
4. Patel, A. D. Why would musical training benet the neural encoding of speech? e OPERA hypothesis. Front. Psychol. https://
doi. org/ 10. 3389/ fpsyg. 2011. 00142 (2011).
5. Alexander, J. A., Wong, P. C. M. & Bradlow, A. R. Lexical tone perception in musicians and non-musicians. In Proceedings of Proc.
Annual Conference of the International Speech Communication Association Interspeech (2005).
6. Burnham, D., Brooker, R. & Reid, A. e eects of absolute pitch ability and musical training on lexical tone perception. Psychol.
Music. 43, 881–897. https:// doi. org/ 10. 1177/ 03057 35614 546359 (2015).
7. Choi, W. e selectivity of musical advantage: Musicians exhibit perceptual advantage for some but not all Cantonese tones. Music
Percept. 37, 423–434. https:// doi. org/ 10. 1525/ MP. 2020. 37.5. 423 (2020).
8. Delogu, F., Lampis, G. & Belardinelli, M. O. From melody to lexical tone: Musical ability enhances specic aspects of foreign
language perception. Eur. J. Cogn. Psychol. 22, 46–61. https:// doi. org/ 10. 1080/ 09541 44080 27081 36 (2010).
9. Gottfried, T. L. & Riester, D. Relation of pitch glide perception and Mandarin tone identication. J. Acoust. Soc. Am. 108, 2604.
https:// doi. org/ 10. 1121/1. 47436 98 (2000).
10. Gottfried, T. L., Staby, A. M. & Ziemer, C. J. Musical experience and Mandarin tone discrimination and imitation. J. Acoust. Soc.
Am. 115, 2545. https:// doi. org/ 10. 1121/1. 47836 74 (2001).
11. Han, Y., Goudbeek, M., Mos, M. & Swerts, M. Mandarin tone identication by tone-naïve musicians and non-musicians in
auditory-visual and auditory-only conditions. Front. Commun. 4, 1–14. https:// doi. org/ 10. 3389/ fcomm. 2019. 00070 (2019).
12. Hung, T.-H. & Lee, C.-Y. Processing linguistic and musical pitch by English-speaking musicians and non-musicians. In 20th North
American Conference on Chinese Linguistics (2008).
13. Lee, C.-Y. & Hung, T.-H. Identication of Mandarin tones by English-speaking musicians and nonmusicians. J. Acoust. Soc. Am.
124, 3235–3248. https:// doi. org/ 10. 1121/1. 29907 13 (2008).
14. Marie, C. L., Delogu, F., Lampis, G., Belardinelli, M. O. & Besson, M. Inuence of musical expertise on segmental and tonal pro-
cessing in Mandarin Chinese. J. Cogn. Neurosci. 23, 2701–2715. https:// doi. org/ 10. 1162/ jocn. 2010. 21585 (2011).
15. Wayland, R. P., Herrera, E. & Kaan, E. Eects of musical experience and training on pitch contour perception. J. Phon. 38, 654–662.
https:// doi. org/ 10. 1016/j. wocn. 2010. 10. 001 (2010).
16. Bidelman, G. M., Gandour, J. T. & Krishnan, A. Cross-domain eects of music and language experience on the representation of
pitch in the human auditory brainstem. J. Cogn. Neurosci. 23, 425–434. https:// doi. org/ 10. 1162/ jocn. 2009. 21362 (2011).
17. Wong, P. C. M., Skoe, E., Russo, N. M., Dees, T. & Kraus, N. Musical experience shapes human brainstem encoding of linguistic
pitch patterns. Nat. Neurosci. 10, 420–422. https:// doi. org/ 10. 1038/ nn1872 (2007).
18. Mok, P. P. K. & Zuo, D. e separation between music and speech: Evidence from the perception of Cantonese tones. J. Acoust.
Soc. Am. 132, 2711–2720. https:// doi. org/ 10. 1121/1. 47470 10 (2012).
19. Ong, J. H., Wong, P. C. M. & Liu, F. Musicians show enhanced perception, but not production, of native lexical tones. J. Acoust.
Soc. Am. 148, 3443. https:// doi. org/ 10. 1121/ 10. 00027 76 (2020).
20. Toh, X. R., Lau, F. & Wong, F. C. K. Individual dierences in nonnative lexical tone perception: Eects of tone language repertoire
and musical experience. Front. Psychol. 13, 940363. https:// doi. org/ 10. 3389/ fpsyg. 2022. 940363 (2022).
21. Kolinsky, R., Cuvelier, H., Goetry, V., Peretz, I. & Morais, J. Music training facilitates lexical stress processing. Music Percept. 26,
235–246. https:// doi. org/ 10. 1525/ mp. 2009. 26.3. 235 (2009).
22. Choi, W. Towards a native OPERA hypothesis: Musicianship and English stress perception. Lang. Speech 65, 697–712. https:// doi.
org/ 10. 1177/ 00238 30921 10494 58 (2022).
23. Choi, W. What is “music” in music-to-language transfer? Musical ability but not musicianship supports Cantonese listeners’ English
stress perception. J. Speech Lang. Hear. Res. 65, 4047–4059. https:// doi. org/ 10. 1044/ 2022_ JSLHR- 22- 00175 (2022).
24. Lee, Y.-S., Vakoch, D. A. & Lee, H. W. Tone perception in Cantonese and Mandarin: A cross-linguistic comparison. J. Psycholinguist.
Res. 25, 527–542. https:// doi. org/ 10. 1007/ BF017 58181 (1996).
25. Morett, L. M. e inuence of tonal and atonal bilingualism on children’s lexical and non-lexical tone perception. Lang. Speech
63, 221–241. https:// doi. org/ 10. 1177/ 00238 30919 834679 (2020).
26. Qin, Z. & Mok, P. K. P. Discrimination of Cantonese tones by speakers of tone and non-tone languages. Kans. Work. Pap. Linguist.
34, 26–42. https:// doi. org/ 10. 17161/ KWPL. 1808. 12864 (2013).
27. Schaefer, V. & Darcy, I. Lexical function of pitch in the rst language shapes cross-linguistic perception of ai tones. Lab. Phonol.
5, 489–522. https:// doi. org/ 10. 1515/ lp- 2014- 0016 (2014).
28. Schaefer, V. & Darcy, I. Applying a newly learned second language dimension to the unknown: e inuence of second language
Mandarin tones on the naïve perception of ai tones. Psychol. Lang. Commun. 24, 90–123. https:// doi. org/ 10. 2478/ plc- 2020- 0007
(2020).
29. Wayland, R. P. & Guion, S. G. Training English and Chinese listeners to perceive ai tones: A preliminary report. Lang. Learn.
54, 681–712. https:// doi. org/ 10. 1111/j. 1467- 9922. 2004. 00283.x (2004).
30. Wayland, R. P. & Li, B. Eects of two training procedures in cross-language perception of tones. J. Phon. 36, 250–267. https:// doi.
org/ 10. 1016/j. wocn. 2007. 06. 004 (2008).
31. Schön, D., Magne, C. & Besson, M. e music of speech: Music training facilitates pitch processing in both music and language.
Psychophysiology 41, 341–349. https:// doi. org/ 10. 1111/ 1469- 8986. 00172.x (2004).
32. Magne, C., Schön, D. & Besson, M. Musician children detect pitch violations in both music and language better than nonmusician
children: Behavioral and electrophysiological approaches. J. Cogn. Neurosci. 18, 199–211. https:// doi. org/ 10. 1162/ 08989 29067
75783 660 (2006).
33. Moreno, S. et al. Musical training inuences linguistic abilities in 8-year-old children: More evidence for brain plasticity. Cereb.
Cortex 19, 712–723. https:// doi. org/ 10. 1093/ cercor/ bhn120 (2009).
34. Marques, C., Moreno, S., Castro, S. L. & Besson, M. Musicians detect pitch violation in a foreign language better than nonmusi-
cians: Behavioral and electrophysiological evidence. J. Cogn. Neurosci. 19, 1453–1463. ht tps:// doi. org/ 10. 1162/ jocn. 2007. 19.9. 1453
(2007).
35. Deguchi, C. et al. Sentence pitch change detection in the native and unfamiliar language in musicians and non-musicians: Behav-
ioral, electrophysiological and psychoacoustic study. Brain Res. 1455, 75–89. https:// doi. org/ 10. 1016/j. brain res. 2012. 03. 034 (2012).
36. ompson, W. F., Schellenberg, E. G. & Husain, G. Perceiving prosody in speech: Eects of music lessons. Ann. N. Y. Acad. Sci.
999, 530–532. https:// doi. org/ 10. 1196/ annals. 1284. 067 (2003).
37. ompson, W. F., Schellenberg, E. G. & Husain, G. Decoding speech prosody: Do music lessons help?. Emotion 4, 46–64. https://
doi. org/ 10. 1037/ 1528- 3542.4. 1. 46 (2004).
38. Farmer, E., Jicol, C. & Petrini, K. Musicianship enhances perception but not feeling of emotion from others’ social interaction
through speech prosody. Music Percept. 37, 323–338. https:// doi. org/ 10. 1525/ mp. 2020. 37.4. 323 (2020).
39. Lima, C. F. & Castro, S. L. Speaking to the trained ear: Musical expertise enhances the recognition of emotions in speech prosody.
Emotion 11, 1021–1031. https:// doi. org/ 10. 1037/ a0024 521 (2011).
40. Pantev, C. et al. Increased auditory cortical representation in musicians. Nature 392, 811–814. https:// doi. org/ 10. 1038/ 33918 (1998).
41. Pantev, C., Engelien, A., Candia, V. & Elbert, T. Representational cortex in musicians: Plastic alterations in response to musical
practice. Ann. N. Y. Acad. Sci. 930, 300–314. https:// doi. org/ 10. 1111/j. 1749- 6632. 2001. tb057 40.x (2001).
42. Schlaug, G. e brain of musicians: A model for functional and structural adaptation. Ann. N. Y. Acad. Sci. 930, 281–299. https://
doi. org/ 10. 1111/j. 1749- 6632. 2001. tb057 39.x (2001).
Content courtesy of Springer Nature, terms of use apply. Rights reserved
10
Vol:.(1234567890)
Scientic Reports | (2023) 13:2657 | https://doi.org/10.1038/s41598-023-29733-3
www.nature.com/scientificreports/
43. Bermudez, P. & Zatorre, R. J. Dierences in gray matter between musicians and nonmusicians. Ann. N. Y. Acad. Sci. 1060, 395–399.
https:// doi. org/ 10. 1196/ annals. 1360. 057 (2005).
44. Gaser, C. & Schlaug, G. Brain structures dier between musicians and non-musicians. J. Neurosci. 23, 9240–9245. https:// doi. org/
10. 1523/ JNEUR OSCI. 23- 27- 09240. 2003 (2003).
45. Nan, Y. et al. Piano training enhances the neural processing of pitch and improves speech perception in Mandarin-speaking
children. Proc. Natl. Acad. Sci. USA 115, 6630–6639. https:// doi. org/ 10. 1073/ pnas. 18084 12115 (2018).
46. Hyde, K. L. et al. e eects of musical training on structural brain development. Ann. N. Y. Acad. Sci. 1169, 182–186. https:// doi.
org/ 10. 1111/j. 1749- 6632. 2009. 04852.x (2009).
47. Hyde, K. L. et al. Musical training shapes structural brain development. J. Neurosci. 29, 3019–3025. https:// do i . org/ 10. 1523/ JNEUR
OSCI. 5118- 08. 2009 (2009).
48. Kraus, N., Skoe, E., Parbery-Clark, A. & Ashley, R. Experience-induced malleability in neural encoding of pitch, timbre, and tim-
ing. Ann. N. Y. Acad. Sci. 11691, 543–557. https:// doi. org/ 10. 1111/j. 1749- 6632. 2009. 04549.x (2009).
49. Wan, C. Y. & Schlaug, G. Music making as a tool for promoting brain plasticity across the lifespan. Neuroscientist 16, 566–577.
https:// doi. org/ 10. 1177/ 10738 58410 377805 (2010).
50. Neves, L., Correia, A. I., Castro, S. L., Martins, D. & Lima, C. F. Does music training enhance auditory and linguistic processing?
A systematic review and meta-analysis of behavioral and brain evidence. Neurosci. Biobehav. Rev. 140, 104777. https:// doi. org/ 10.
1016/j. neubi orev. 2022. 104777 (2022).
51. Zendel, B. R. & Alain, C. Musicians experience less age-related decline in central auditory processing. Psychol. Aging 27, 410–417.
https:// doi. org/ 10. 1037/ a0024 816 (2012).
52. Zendel, B. R. & Alain, C. e inuence of lifelong musicianship on neurophysiological measures of concurrent sound segregation.
J. Cogn. Neurosci. 25, 503–516. https:// doi. org/ 10. 1162/ jocn_a_ 00329 (2013).
53. Alain, C., Zendel, B. R., Hutka, S. & Bidelman, G. M. Tur ning down the noise: e benet of musical training on the aging auditory
brain. Hear. Res. 308, 162–173. https:// doi. org/ 10. 1016/j. heares. 2013. 06. 008 (2014).
54. Zhang, J. D., Susino, M., McPherson, G. E. & Schubert, E. e denition of a musician in music psychology: A literature review
and the six-year rule. Psychol. Music 48, 389–409. https:// doi. org/ 10. 1177/ 03057 35618 804038 (2020).
55. Costa-Giomi, E. Music instruction and children’s intellectual development: e educational context of music participation. In
Music, Health, and Wellbeing (eds MacDonald, R. et al.) 339–355 (Oxford University Press, 2012).
56. Costa-Giomi, E. e long-term eects of childhood music instruction on intelligence and general cognitive abilities. Update Appl.
Res. Music Educ. 33, 20–26. https:// doi. org/ 10. 1177/ 87551 23314 540661 (2015).
57. Costa-Giomi, E. & Ryan, C. e benets of music insturction: What remains years later. Symp. Res. Music Behav. 20, 25 (2007).
58. Costa-Giomi, E. e eects of three years of piano instruction on children’s cognitive development. J. Res. Music Educ. 47, 198–212.
https:// doi. org/ 10. 2307/ 33457 79 (1999).
59. Schellenberg, E. G. Long-term positive associations between music lessons and IQ. J. Educ. Psychol. 98, 457–468. https:// doi. org/
10. 1037/ 0022- 0663. 98.2. 457 (2006).
60. Strong, J. V. & Midden, A. Cognitive dierences between older adult instrumental musicians: Benets of continuing to play. Psychol.
Music 48, 67–83. https:// doi. org/ 10. 1177/ 03057 35618 785020 (2020).
61. Skoe, E. & Kraus, N. A little goes a long way: How the adult brain is shaped by musical training in childhood. J. Neurosci. 32,
11507–11510. https:// doi. org/ 10. 1523/ JNEUR OSCI. 1949- 12. 2012 (2012).
62. White-Schwoch, T., Carr, K. W., Anderson, S., Strait, D. L. & Kraus, N. Older adults benet from music training early in life:
Biological evidence for long-term training-driven plasticity. J. Neurosci. 33, 17667–17674. https:// doi. org/ 10. 1523/ JNEUR OSCI.
2560- 13. 2013 (2013).
63. Wennerstrom, A. e Music of Everyday Speech: Prosody and Discourse Analysis (Oxford University Press, 2001).
64. Patel, A. D., Wong, M., Foxton, J., Lochy, A. & Peretz, I. Speech intonation perception decits in musical tone deafness (congenital
amusia). Music Percept. 25, 357–368. https:// doi. org/ 10. 1525/ mp. 2008. 25.4. 357 (2008).
65. Hutchins, S., Gosselin, N. & Peretz, I. Identication of changes along a continuum of speech intonation is impaired in congenital
amusia. Front. Psychol. 1, 236. https:// doi. org/ 10. 3389/ fpsyg. 2010. 00236 (2010).
66. Jiang, C., Hamm, J. P., Lim, V. K., Kirk, I. J. & Yang, Y. Processing melodic contour and speech intonation in congenital amusics
with Mandarin Chinese. Neuropsychologia 48, 2630–2639. https:// doi. org/ 10. 1016/j. neuro psych ologia. 2010. 05. 009 (2010).
67. Liu, F., Patel, A. D., Fourcin, A. & Stewart, L. Intonation processing in congenital amusia: Discrimination, identication and imita-
tion. Brain 133, 1682–1693. https:// doi. org/ 10. 1093/ brain/ awq089 (2010).
68. Monrad-Krohn, G. H. e prosodic quality of speech and its disorders. Acta Psychiatr. Scand. 22, 255–269. https:// doi. org/ 10.
1111/j. 1600- 0447. 1947. tb082 46.x (1947).
69. Zhang, J. & McBride-Chang, C. Auditory sensitivity, speech perception, and reading development and impairment. Educ. Psychol.
Rev. 22, 323–338. https:// doi. org/ 10. 1007/ s10648- 010- 9137-4 (2010).
70. Holliman, A. J. et al. Beginning to disentangle the prosody-literacy relat ionship: A multi-component measure of prosodic sensitiv-
it y. Read. Writ. 27, 255–266. https:// doi. org/ 10. 1007/ s11145- 013- 9443-6 (2014).
71. Groen, M. A., Veenendaal, N. J. & Verhoeven, L. e role of prosody in reading comprehension: Evidence from poor comprehend-
ers. J. Res. Read. 42, 37–57. https:// doi. org/ 10. 1111/ 1467- 9817. 12133 (2019).
72. Cutler, A., Dahan, D. & van Donselaar, W. Prosody in the comprehension of spoken language: A literature review. Lang. Speech
40, 141–201. https:// doi. org/ 10. 1177/ 00238 30997 04000 203 (1997).
73. Hellbernd, N. & Sammler, D. Prosody conveys speaker’s intentions: Acoustic cues for speech act perception. J. Mem. Lang. 88,
70–86. https:// doi. org/ 10. 1016/j. jml. 2016. 01. 001 (2016).
74. Hupp, J. M., Jungers, M. K., Hinerman, C. M. & Porter, B. L. Cup! Cup? Cup: Comprehension of intentional prosody in adults and
children. Cogn. Dev. 57, 100971. https:// doi. org/ 10. 1016/j. cogdev. 2020. 100971 (2021).
75. Campeld, D. E. & Murphy, V. A. e inuence of prosodic input in the second language classroom: Does it stimulate child
acquisition of word order and function words?. Lang. Learn. J. 45, 81–99. https:// doi. org/ 10. 1080/ 09571 736. 2013. 807864 (2017).
76. Saksida, A., Fló, A., Guedes, B., Nespor, M. & Peña Garay, M. Prosody facilitates learning the word order in a new language. Cogni-
tion 213, 104686. https:// doi. org/ 10. 1016/j. cogni tion. 2021. 104686 (2021).
77. Saito, Y. & Saito, K. Dierential eects of instruction on the development of second language comprehensibility, word stress,
rhythm, and intonation: e case of inexperienced Japanese EFL learners. Lang. Teach. Res. 21, 589–608. https:// doi. org/ 10. 1177/
13621 68816 643111 (2017).
78. Yenkimaleki, M. Prosody training benets in perception vs production skills in simultaneous interpreting: An experimental study.
Dutch J. Appl. Linguist. https:// doi. org/ 10. 51751/ dujal 9888 (2021).
79. Wallentin, M., Nielsen, A. H., Friis-Olivarius, M., Vuust, C. & Vuust, P. e Musical Ear Test, a new reliable test for measuring
musical competence. Learn. Indiv. Dier. 20, 188–196. https:// doi. org/ 10. 1016/j. lindif. 2010. 02. 004 (2010).
80. Rothauser, E. H. et al. IEEE recommended practice for speech quality measures. IEEE Trans. Audio Electroacoust. 17, 225–246.
https:// doi. org/ 10. 1109/ TAU. 1969. 11620 58 (1969).
81. Smith, C. L. Prosodic nality and sentence type in French. Lang. Speech 45, 141–178. https:// doi. org/ 10. 1177/ 00238 30902 04500
20301 (2002).
82. Audacity: Free Audio Editor and Recorder v. 2.3.2 (2018).
83. Boersma, P. Praat, a system for doing phonetics by computer. Glot Int. 5, 341–345 (2001).
Content courtesy of Springer Nature, terms of use apply. Rights reserved
11
Vol.:(0123456789)
Scientic Reports | (2023) 13:2657 | https://doi.org/10.1038/s41598-023-29733-3
www.nature.com/scientificreports/
84. Krizman, J., Marian, V., Shook, A., Skoe, E. & Kraus, N. Subcortical encoding of sound is enhanced in bilinguals and relates to
executive function advantages. Proc. Natl. Acad. Sci. USA 109, 7877–7881. https:// doi. org/ 10. 1073/ pnas. 12015 75109 (2012).
85. Mankel, K. & Bidelman, G. M. Inherent auditory skills rather than formal music training shape the neural encoding of speech.
Proc. Natl. Acad. Sci. USA 115, 13129–13134. https:// doi. org/ 10. 1073/ pnas. 18117 93115 (2018).
86. Swaminathan, S. & Schellenberg, E. G. Musical ability, music training, and language ability in childhood. J. Exp. Psychol. Learn.
Mem. Cogn. 46, 2340–2348. https:// doi. org/ 10. 1037/ xlm00 00798 (2020).
Acknowledgements
is study was supported by research grants from the Ministry of Education (MOE), Singapore (MOE2019-
SSRTG-016, MOE-T2EP402A20-0003). We thank Dr. Alice H. D. Chan for her insight on the design of the study.
We are also grateful to all participants for their contribution to the study.
Author contributions
S.T. and G.W. contributed to the conception, design, and implementation of the study. X.T., S.T., G.W., F.L.,
and F.W. performed the statistical analysis and interpreted the data. X.T., S.T., and G.W. wrote dras of the
manuscript. X.T., F.L., and F.W. revised and nalised the manuscript. All authors reviewed and approved the
submitted manuscript.
Additional information
Supplementary Information e online version contains supplementary material available at https:// doi. org/
10. 1038/ s41598- 023- 29733-3.
Correspondence and requests for materials should be addressed to F.C.K.W.
Reprints and permissions information is available at www.nature.com/reprints.
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and
institutional aliations.
Open Access is article is licensed under a Creative Commons Attribution 4.0 International
License, which permits use, sharing, adaptation, distribution and reproduction in any medium or
format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the
Creative Commons licence, and indicate if changes were made. e images or other third party material in this
article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the
material. If material is not included in the article’s Creative Commons licence and your intended use is not
permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from
the copyright holder. To view a copy of this licence, visit http:// creat iveco mmons. org/ licen ses/ by/4. 0/.
© e Author(s) 2023
Content courtesy of Springer Nature, terms of use apply. Rights reserved
1.
2.
3.
4.
5.
6.
Terms and Conditions
Springer Nature journal content, brought to you courtesy of Springer Nature Customer Service Center GmbH (“Springer Nature”).
Springer Nature supports a reasonable amount of sharing of research papers by authors, subscribers and authorised users (“Users”), for small-
scale personal, non-commercial use provided that all copyright, trade and service marks and other proprietary notices are maintained. By
accessing, sharing, receiving or otherwise using the Springer Nature journal content you agree to these terms of use (“Terms”). For these
purposes, Springer Nature considers academic use (by researchers and students) to be non-commercial.
These Terms are supplementary and will apply in addition to any applicable website terms and conditions, a relevant site licence or a personal
subscription. These Terms will prevail over any conflict or ambiguity with regards to the relevant terms, a site licence or a personal subscription
(to the extent of the conflict or ambiguity only). For Creative Commons-licensed articles, the terms of the Creative Commons license used will
apply.
We collect and use personal data to provide access to the Springer Nature journal content. We may also use these personal data internally within
ResearchGate and Springer Nature and as agreed share it, in an anonymised way, for purposes of tracking, analysis and reporting. We will not
otherwise disclose your personal data outside the ResearchGate or the Springer Nature group of companies unless we have your permission as
detailed in the Privacy Policy.
While Users may use the Springer Nature journal content for small scale, personal non-commercial use, it is important to note that Users may
not:
use such content for the purpose of providing other users with access on a regular or large scale basis or as a means to circumvent access
control;
use such content where to do so would be considered a criminal or statutory offence in any jurisdiction, or gives rise to civil liability, or is
otherwise unlawful;
falsely or misleadingly imply or suggest endorsement, approval , sponsorship, or association unless explicitly agreed to by Springer Nature in
writing;
use bots or other automated methods to access the content or redirect messages
override any security feature or exclusionary protocol; or
share the content in order to create substitute for Springer Nature products or services or a systematic database of Springer Nature journal
content.
In line with the restriction against commercial use, Springer Nature does not permit the creation of a product or service that creates revenue,
royalties, rent or income from our content or its inclusion as part of a paid for service or for other commercial gain. Springer Nature journal
content cannot be used for inter-library loans and librarians may not upload Springer Nature journal content on a large scale into their, or any
other, institutional repository.
These terms of use are reviewed regularly and may be amended at any time. Springer Nature is not obligated to publish any information or
content on this website and may remove it or features or functionality at our sole discretion, at any time with or without notice. Springer Nature
may revoke this licence to you at any time and remove access to any copies of the Springer Nature journal content which have been saved.
To the fullest extent permitted by law, Springer Nature makes no warranties, representations or guarantees to Users, either express or implied
with respect to the Springer nature journal content and all parties disclaim and waive any implied warranties or warranties imposed by law,
including merchantability or fitness for any particular purpose.
Please note that these rights do not automatically extend to content, data or other material published by Springer Nature that may be licensed
from third parties.
If you would like to use or distribute our Springer Nature journal content to a wider audience or on a regular basis or in any other manner not
expressly permitted by these Terms, please contact Springer Nature at
onlineservice@springernature.com
... Researchers can examine the differential effects of Western and Chinese music training on level and contour tone perception. Lastly, researchers have started to distinguish between professional musicians and amateur musicians (Oechslin et al., 2013;Rogenmoser et al., 2017;Schneider et al., 2002) and enduring musicians and former musicians (Toh et al., 2023). These fine distinctions prompt nuanced investigations of music-to-language transfer. ...
... Our study further suggests that the choice of musical instrument may matter, at least for lexical tone perception. In addition to musical instrument, there are many more ways to decipher musicianship (e.g., Oechslin et al., 2013;Rogenmoser et al., 2017;Toh et al., 2023). Deciphering musicianship can certainly advance the theoretical understanding of music-to-language transfer and its practical application. ...
Article
Full-text available
Different musical instruments have different pitch processing demands. However, correlational studies have seldom considered the role of musical instruments in music-to-language transfer. Addressing this research gap could contribute to a nuanced understanding of music-to-language transfer. To this end, we investigated whether pitched musicians had a unique musical advantage in lexical tone perception relative to unpitched musicians and nonmusicians. Specifically, we compared Cantonese pitched musicians, unpitched musicians, and nonmusicians on Thai tone discrimination and sequence recall. In the Thai tone discrimination task, the pitched musicians outperformed the unpitched musicians and the nonmusicians. Moreover, the unpitched musicians and the nonmusicians performed similarly. In the Thai tone sequence recall task, both pitched and unpitched musicians recalled level tone sequences more accurately than the nonmusicians, but the pitched musicians showed the largest musical advantage. However, the three groups recalled contour tone sequences with similar accuracy. Collectively, the pitched musicians had a unique musical advantage in lexical tone discrimination and the largest musical advantage in level tone sequence recall. From a theoretical perspective, this study offers correlational evidence for the Precision element of the OPERA hypothesis. The choice of musical instrumental may matter for music-to-language transfer in lexical tone discrimination and level tone sequence recall.
... During speech processing, the pitch has hyper-linguistic characteristics that provide information on emotion and intent [87] as well as linguistic characteristics. Musicians outperform non-musicians [13,22,65,69,84,[88][89][90][91][92][93][94]. This advantage was hypothesized to be a contributing factor in better speech-in-noise perception found in musicians [22]. ...
Article
Full-text available
Μusicians are reported to have enhanced auditory processing. This study aimed to assess auditory perception in Greek musicians with respect to their musical specialization and to compare their auditory processing with that of non-musicians. Auditory processing elements evaluated were speech recognition in babble, rhythmic advantage in speech recognition, short-term working memory, temporal resolution, and frequency discrimination threshold detection. All groups were of 12 participants. Three distinct experimental groups tested included western classical musicians, Byzantine chanters, and percussionists. The control group consisted of 12 non-musicians. The results revealed: (i) a rhythmic advantage for word recognition in noise for classical musicians (M = 12.42) compared to Byzantine musicians (M = 9.83), as well as for musicians compared to non-musicians (U = 120.50, p = 0.019), (ii) better frequency discrimination threshold of Byzantine musicians (M = 3.17, p = 0.002) compared to the other two musicians’ group for the 2000 Hz region, (iii) statistically significant better working memory for musicians (U = 123.00, p = 0.025) compared to non-musicians. Musical training enhances elements of auditory processing and may be used as an additional rehabilitation approach during auditory training, focusing on specific types of music for specific auditory processing deficits.
... During speech processing the pitch has hyper-linguistic characteristics that provide information on emotion and intent [88] as well as linguistic characteristics. Musicians outperform non-musicians [11,20,66,70,85,[89][90][91][92][93][94]96]. This advantage was hypothesized to be be a contributing factor in better speech-in-noise perception found in musicians [20]. ...
Preprint
Full-text available
Better auditory processing of musicians is observed in previous research. As musicians differentiate their practice method and performance environment, we aimed to assess auditory perception in Greek musicians with respect to their musical specialization. If there are differences, this may provide a basis for better shaping auditory training in individuals with auditory processing disorder. The auditory tests administered were speech in noise (Speech in Babble), with and without rhythmic advantage (Word Recognition—Rhythm Component), short-term and working memory (Digit Span - Forward and Backwards), temporal resolution (Gaps In Noise) and detection of frequency discrimination threshold (DFL). Groups consisted of classical musicians, Byzantine chanters, percussionists, and non-musicians (12 participants/group). Statistical analysis revealed significant difference in: (i) word recognition in noise with precursor synchronized pulse between classical musicians compared to Byzantine musicians, (ii) better frequency discrimination of Byzantine musicians compared to non-musicians for the 2000Hz region and (iii) working memory, an advantage detected in musicians. Considering all the above, we conclude that musicians have a superior auditory perception, regardless of musical specialization. Musical training enhances elements of auditory processing and may be used as an additional rehabilitation during auditory training, focusing on specific types of music for specific auditory processing deficits.
Article
Music training is generally assumed to improve perceptual and cognitive abilities. Although correlational data highlight positive associations, experimental results are inconclusive, raising questions about causality. Does music training have far-transfer effects, or do preexisting factors determine who takes music lessons? All behavior reflects genetic and environmental influences, but differences in emphasis—nature versus nurture—have been a source of tension throughout the history of psychology. After reviewing the recent literature, we conclude that the evidence that music training causes nonmusical benefits is weak or nonexistent, and that researchers routinely overemphasize contributions from experience while neglecting those from nature. The literature is also largely exploratory rather than theory driven. It fails to explain mechanistically how music-training effects could occur and ignores evidence that far transfer is rare. Instead of focusing on elusive perceptual or cognitive benefits, we argue that it is more fruitful to examine the social-emotional effects of engaging with music, particularly in groups, and that music-based interventions may be effective mainly for clinical or atypical populations. Expected final online publication date for the Annual Review of Psychology, Volume 75 is January 2024. Please see http://www.annualreviews.org/page/journal/pubdates for revised estimates.
Article
Full-text available
This study sought to understand the effects of tone language repertoire and musical experience on nonnative lexical tone perception and production. Thirty-one participants completed a tone discrimination task, an imitation task, and a musical abilities task. Results showed that a larger tone language repertoire and musical experience both enhanced tone discrimination performance. However, the effects were not additive, as musical experience was associated with tone discrimination performance for single-tone language speakers, but such association was not seen for dual-tone language speakers. Furthermore, among single-tone language speakers, but not among dual-tone language speakers, musical experience and musical aptitude positively correlated with tone discrimination accuracy. It is thus concluded that individuals with varying extents of tone language experience may adopt different strategies when performing tone discrimination tasks; single-tone language speakers may draw on their musical expertise while dual-tone language speakers may rely on their extensive tone language experience instead.
Article
Full-text available
Purpose: This study investigates how Cantonese language experience influences the potential effects of (i) musicianship and (ii) musical ability on English stress perception. Method: The sample contained 124 participants, evenly split into Cantonese musician, Cantonese non-musician, English musician, and English non-musician groups. They completed the English stress discrimination task, English stress sequence recall task, Musical Ear Test, and non-verbal intelligence task. Following the musicianship-based analysis, 44 Cantonese and English listeners were re-assigned to four groups based on their musical ability—Cantonese high musical ability, Cantonese low musical ability, English high musical ability, and English low musical ability groups. Results: Musicianship-based analysis on English stress perception revealed a significant interaction between musicianship and language. Specifically, musicians outperformed non-musicians only among the English but not the Cantonese listeners. By contrast, ability-based analysis showed significant main effects of musical ability and language. For both Cantonese and English listeners, those with a high musical ability outperformed those with a low musical ability. Regardless of musical ability, Cantonese listeners outperformed English listeners. Correlational analyses yielded consistent findings. Conclusions: This study has found cross-sectional evidence that musical ability, but not musicianship, facilitates Cantonese English as a second language (ESL) listeners’ English stress perception. From a theoretical perspective, the current findings motivate two potential additions to the OPERA hypothesis for music-to-language transfer—unsaturation and utilization. Practically, the findings cast doubt on the application of non-perceptual based instrumental music training to enhance Cantonese ESL learners’ perceptual learning of English stress.
Article
Full-text available
It is often claimed that music training improves auditory and linguistic skills. Results of individual studies are mixed, however, and most evidence is correlational, precluding inferences of causation. Here, we evaluated data from 62 longitudinal studies that examined whether music training programs affect behavioral and brain measures of auditory and linguistic processing (N = 3928). For the behavioral data, a multivariate meta-analysis revealed a small positive effect of music training on both auditory and linguistic measures, regardless of the type of assignment (random vs. non-random), training (instrumental vs. non-instrumental), and control group (active vs. passive). The trim-and-fill method provided suggestive evidence of publication bias, but meta-regression methods (PET-PEESE) did not. For the brain data, a narrative synthesis also documented benefits of music training, namely for measures of auditory processing and of speech and prosody processing. Thus, the available literature provides evidence that music training produces small neurobehavioral enhancements in auditory and linguistic processing, although future studies are needed to confirm that such enhancements are not due to publication bias.
Article
Full-text available
The present study investigates the prosody training benefits for interpreter trainees in perception vs. production skills in simultaneous interpreting. Two groups of student interpreters were formed. Participants were assigned to groups at random. The control group received routine instruction in interpreting skills. The experimental group spent 20 minutes less time per session on the routine curriculum and instead received awareness training on prosodic features of English. The total instruction time was the same for the students in two groups, i.e., 15 hours. Students then took a posttest in interpretation skills. The results showed that the experimental group performed better than the control group in simultaneous interpretation performance. Moreover, the study revealed that prosody training enhances the students' perception skills more than that of the production skills. These results have pedagogical implications for curriculum designers, interpreter training programs, and all who are involved in language study and pedagogy.
Article
Full-text available
Musical experience facilitates speech perception. French musicians, to whom stress is foreign, have been found to perceive English stress more accurately than French non-musicians. This study investigated whether this musical advantage also applies to native listeners. English musicians and non-musicians completed an English stress discrimination task and two control tasks. With age, non-verbal intelligence and short-term memory controlled, the musicians exhibited a perceptual advantage relative to the non-musicians. This perceptual advantage was equally potent to both trochaic and iambic stress patterns. In terms of perceptual strategy, the two groups showed differential use of acoustic cues for iambic but not trochaic stress. Collectively, the results could be taken to suggest that musical experience enhances stress discrimination even among native listeners. Remarkably, this musical advantage is highly consistent and does not particularly favour either stress pattern. For iambic stress, the musical advantage appears to stem from the differential use of acoustic cues by musicians. For trochaic stress, the musical advantage may be rooted in enhanced durational sensitivity.
Article
Full-text available
This study investigates whether L2 Mandarin learners can generalize experience with Mandarin tones to unfamiliar tones (i.e., Thai). Three language groups – L1 English/ L2 Mandarin learners (n=18), L1 Mandarin speakers (n=30), L1 monolingual English speakers (n=23) – were tested on the perception of unfamiliar Thai tones on ABX tasks. L2 Mandarin learners and L1 Mandarin speakers perceived Thai tones more accurately than L1 English non-learners. Mandarin learners L1 speakers showed priming on Mandarin tones on a lexical decision task with repetition priming, suggesting L2 tones had been encoded within lexical representations of L2 Mandarin words. However, results must be interpreted cautiously, with an absence of expected priming and presence of unexpected priming. In sum, learners can transfer L2 tone experience to unfamiliar tones, expanding the Feature Hypothesis (McAllister, Flege, & Piske, 2002) to include L2 influence as well. In addition, results indicate a potential disconnect between perception and encoding. Keywords: tone, third language, perception, phonolexical encoding Address
Article
Full-text available
Music expertise has been shown to enhance emotion recognition from speech prosody. Yet, it is currently unclear whether music training enhances the recognition of emotions through other communicative modalities such as vision and whether it enhances the feeling of such emotions. Musicians and nonmusicians were presented with visual, auditory, and audiovisual clips consisting of the biological motion and speech prosody of two agents interacting. Participants judged as quickly as possible whether the expressed emotion was happiness or anger, and subsequently indicated whether they also felt the emotion they had perceived. Measures of accuracy and reaction time were collected from the emotion recognition judgements, while yes/no responses were collected as indication of felt emotions. Musicians were more accurate than nonmusicians at recognizing emotion in the auditory-only condition, but not in the visual-only or audiovisual conditions. Although music training enhanced recognition of emotion through sound, it did not affect the felt emotion. These findings indicate that emotional processing in music and language may use overlapping but also divergent resources, or that some aspects of emotional processing are less responsive to music training than others. Hence music training may be an effective rehabilitative device for interpreting others’ emotion through speech.
Article
One of the prominent ideas developed by Jacques Mehler and his colleagues was that perceptual tuning, present from birth on, enables infants, and language learners in general, to extract regularities from speech input. Here we discuss language learners'' ability to extract basic word order (VO or OV) structure from prosodic regularities in a language. The two are closely related: in phonological phrases of VO languages, the most prominent word is the rightmost one, and in OV languages, it is the leftmost one. In speech, this prominence is realized as extended duration, or as elevated pitch, sometimes combined with changes in intensity. When learning the first (L1) or the second language (L2), exposure to relevant rhythmic structure elicits implicit learning about syntactic structure, including the basic word order. However, it remains unclear whether triggering the learning process requires a certain level of familiarity with the relevant rhythm. It is moreover unknown whether prosodic information can help L2 learners to extract and learn the vocabulary of a new language. We tested Spanish- and Italian-speaking adults' ability to learn words from an artificial language with either non-native OV or native VO word order. The results show that learners used prosodic information to identify the most prominent words in short utterances when the artificial language was similar to the native language, with duration-based prominence in prosody and a VO word order. In contrast, when the artificial language had a non-native prominence marked by pitch alternations and an OV word order, prominent words were learned only after a three-day exposure to the relevant rhythmic structure. Thus, for adult L2 learners, only repeated exposure to the relevant prosody elicited learning new words from an unknown language with non-native prosodic marking, indicating that, with familiarity, prosodic cues can facilitate learning in L2.
Article
Many studies have reported a musical advantage in perceiving lexical tones among non-native listeners, but it is unclear whether this advantage also applies to native listeners, who are likely to show ceiling-like performance and thus mask any potential musical advantage. The ongoing tone merging phenomenon in Hong Kong Cantonese provides a unique opportunity to investigate this as merging tone pairs are reported to be difficult to differentiate even among native listeners. In the present study, native Cantonese musicians and non-musicians were compared based on discrimination and identification of merging Cantonese tone pairs to determine whether a musical advantage in perception will be observed, and if so, whether this is seen on the phonetic and/or phonological level. The tonal space of the subjects' lexical tone production was also compared. Results indicated that the musicians outperformed the non-musicians on the two perceptual tasks, as indexed by a higher accuracy and faster reaction time, particularly on the most difficult tone pair. In the production task, however, there was no group difference in various indices of tonal space. Taken together, musical experience appears to facilitate native listeners' perception, but not production, of lexical tones, which partially supports a music-to-language transfer effect.
Article
Prosody is the way something is spoken. Adults and children regularly use prosody in language comprehension and production, and much of this research focuses solely on emotion or on syntactic interpretation. The current study focuses on comprehension of intentional prosody devoid of semantic information as indicators of different types of speech acts. Adults (n = 72) and preschool children (n = 72) were asked to identify the referent of an isolated label (familiar or nonsense words) based solely on the varying prosodic information. Labels were spoken with intentional prosody (warning, doubting, naming) such that their prosody implied a particular intended referent. Adults and preschoolers selected the intended referent at above chance levels of performance for both familiar and nonsense words and for all three intentions, with adults performing faster and more accurately than children. Overall, this research demonstrates that preschool children and adults can use prosody to successfully determine the intended referent behind words and nonsense words, even if they are in isolation. This finding suggests that prosody alone conveys certain aspects of meaning during the preschool years.