ArticlePDF Available

Enduring musician advantage among former musicians in prosodic pitch perception

February 2023
Scientific Reports 13(1)

February 2023
13(1)

DOI:10.1038/s41598-023-29733-3

License
CC BY 4.0

Authors:

Xin Ru Toh

Nanyang Technological University

Fun Lau

Nanyang Technological University

Show all 5 authorsHide

Musical training has been associated with various cognitive benefits, one of which is enhanced speech perception. However, most findings have been based on musicians taking part in ongoing music lessons and practice. This study thus sought to determine whether the musician advantage in pitch perception in the language domain extends to individuals who have ceased musical training and practice. To this end, adult active musicians (n = 22), former musicians (n = 27), and non-musicians (n = 47) were presented with sentences spoken in a native language, English, and a foreign language, French. The final words of the sentences were either prosodically congruous (spoken at normal pitch height), weakly incongruous (pitch was increased by 25%), or strongly incongruous (pitch was increased by 110%). Results of the pitch discrimination task revealed that although active musicians outperformed former musicians, former musicians outperformed non-musicians in the weakly incongruous condition. The findings suggest that the musician advantage in pitch perception in speech is retained to some extent even after musical training and practice is discontinued.

Fundamental frequency (F0 in Hz) for a sample sentence in the three prosodic conditions.

…

Pitch discrimination accuracy of active musicians, former musicians, and non-musicians in the three prosodic conditions. Error bars denote standard error. *p < 0.05, ***p < 0.001.

…

Pitch discrimination accuracy for the native language English and non-native language French in the three prosodic conditions. Error bars denote standard error. **p < 0.01, ***p < 0.001.

…

Figures - available from: Scientific Reports

This content is subject to copyright. Terms and conditions apply.

Access to this full-text is provided by Springer Nature.

Learn more

Content available from Scientific Reports

This content is subject to copyright. Terms and conditions apply.

Vol.:(0123456789)

Scientic Reports | (2023) 13:2657 | https://doi.org/10.1038/s41598-023-29733-3

www.nature.com/scientificreports

Enduring musician advantage

among former musicians

in prosodic pitch perception

Xin Ru Toh

1, Shen Hui Tan

1, Galston Wong

2, Fun Lau

1 & Francis C. K. Wong 1*

Musical training has been associated with various cognitive benets, one of which is enhanced speech

perception. However, most ndings have been based on musicians taking part in ongoing music

lessons and practice. This study thus sought to determine whether the musician advantage in pitch

perception in the language domain extends to individuals who have ceased musical training and

practice. To this end, adult active musicians (n = 22), former musicians (n = 27), and non-musicians

(n = 47) were presented with sentences spoken in a native language, English, and a foreign language,

French. The nal words of the sentences were either prosodically congruous (spoken at normal

pitch height), weakly incongruous (pitch was increased by 25%), or strongly incongruous (pitch was

increased by 110%). Results of the pitch discrimination task revealed that although active musicians

outperformed former musicians, former musicians outperformed non-musicians in the weakly

incongruous condition. The ndings suggest that the musician advantage in pitch perception in speech

is retained to some extent even after musical training and practice is discontinued.

Musical training has been associated with various cognitive enhancements1, making it an attractive enrich-

ment and intervention activity. In the language domain, one notable nding is that musical training is linked

to an advantage in speech perception. For instance, musicians are better than non-musicians at perceiving

speech in noisy conditions even in older adulthood2,3. In particular, a vast amount of literature has documented

positive music-to-language cross-domain transfer eects in pitch processing, which undergirds Patel’s OPERA

hypothesis4. e OPERA hypothesis describes how musical training benets the neural encoding of speech when

ve requirements are satised: there is an overlap in the brain networks employed to process an acoustic feature

common to music and speech; the processing of the shared acoustic feature occurs at higher precision in music

than in speech; and the musical activities evoke strong positive emotion, have frequent repetition, and encompass

focused attention. e OPERA hypothesis may account for the superior pitch processing abilities in speech seen

in musically trained individuals, as pitch is a basic acoustic property found in both music and speech. While pitch

dierences are used to form melodies in music, they are used to convey contrastive meaning via lexical tones,

stress, and intonation in speech. To augment the OPERA hypothesis, this study seeks to explicate whether the

musician advantage persists in former musicians who have ceased musical training and practice.

e OPERA hypothesis is well supported by empirical studies comparing musicians and non-musicians. Stud-

ies have found that among individuals with no tone language experience, musicians outperform non-musicians

in lexical tone perception5–15. In addition, musicians without tone language experience show enhanced brainstem

and cortical encoding when listening to lexical tones14,16,17. Yet, it is uncertain whether the musician advantage

in lexical tone perception also exists among tone language speakers. It was previously found that for English or

French speakers, musicians outperform non-musicians in Cantonese tone discrimination, whereas for Cantonese

speakers, musicians and non-musicians both show ceiling eects18. On the other hand, it was also found that

Cantonese musicians outperform Cantonese non-musicians in the discrimination and identication of merging

Cantonese tone pairs, especially the most dicult Tone 2/Tone 5 contrast19. More recently, Toh etal.20 found that

even among speakers of a tone language, those who have received musical training outperform non-musicians

in non-native lexical tone perception. Apart from lexical tone perception, studies have found that among indi-

viduals with no tone language experience, musicians are better than non-musicians at perceiving stress, which is

indicated by a combination of pitch, duration, and intensity variations21,22. However, it remains unclear whether

the musician advantage in stress perception also applies to tone language speakers. Among English speakers,

musicians outperform non-musicians in English stress perception, while Cantonese-English bilingual musicians

OPEN

1Linguistics and Multilingual Studies, School of Humanities, Nanyang Technological University, Singapore,

Singapore. 2School of Brain and Behavioral Sciences, The University of Texas at Dallas, Dallas, TX, USA. *email:

franciswong@ntu.edu.sg

Content courtesy of Springer Nature, terms of use apply. Rights reserved

Vol:.(1234567890)

Scientic Reports | (2023) 13:2657 | https://doi.org/10.1038/s41598-023-29733-3

www.nature.com/scientificreports/

and non-musicians perform equally well23. Tone language experience has been linked to enhanced pitch percep-

tion abilities in speech6,24–30. As such, one possible explanation for the conicting ndings is that the musician

advantage for pitch perception in speech applies across speakers irrespective of language background, but the

more subtle eect among tone language speakers is likely to be masked by ceiling-like performance in tasks that

are not suciently sensitive.

Besides lexical tones and stress, another aspect in which there is mounting evidence for the musician advan-

tage is prosody. A series of studies have consistently found that musicians outperform non-musicians in detecting

pitch contour expectancy violations31–35. is research paradigm, rst created by Schön etal.31, is designed by

manipulating the fundamental frequency of either the nal notes of musical phrases or nal words of linguistic

phrases. In particular, the weakly incongruous condition entails a small pitch change which is dicult to detect,

and hence evaluates pitch perception in a more ne-grained manner. rough both behavioural and electrophysi-

ological measures, they found that adult musicians detected these pitch variations better than non-musicians

in not only music but also their native language, thereby lending support for a domain-general pitch processing

mechanism. is nding was reinforced in follow-up cross-sectional and longitudinal studies32,33, in which they

found similar group dierences among 8-year-old children, despite the fact that the children musicians received

a shorter duration of musical training than the adult musicians in the original study. eir nding was also

expanded in follow-up studies introducing unfamiliar language contexts34,35, in which they found that partici-

pants across groups found it more dicult to detect pitch changes in a non-native language or pseudolanguage

than in their native language. e researchers posited that understanding the semantic content and being familiar

with intonational contours in sentences might help with anticipating and detecting pitch changes in one’s native

language. at said, the researchers found that musicians held an advantage over non-musicians in detecting

prosodic pitch violations across native and non-native language contexts. Moreover, behavioural studies have

found that musicians outperform non-musicians in matching spoken utterances to their intonation melodies36

and identifying emotional prosody in speech37–39. Interestingly, similar results were seen in a longitudinal study

with 6-year-old children, with those who were randomly assigned to receive 1year of musical training in the

form of keyboard or vocal lessons outperforming those who received no lessons when tested on the identica-

tion of emotional prosody in speech37. Collectively, these studies substantiate the notion that musical training

facilitates speech perception at not only the segmental but also supra-segmental level.

Furthermore, neurological studies suggest that musical training is linked to structural and functional dif-

ferences in the brain40–50. Notably, the eects of musical training on brain development seem to be causal in

nature33,45–47. For instance, Hyde etal.46,47 randomly assigned 6-year-old children without any behavioural or

brain dierences in pre-tests to receive either 15months of musical training or no training. ey found that only

those who received musical training showed structural brain changes in motor and auditory areas which were

correlated with behavioural improvements on melodic and rhythmic discrimination tests. ese studies suggest

that there may be musical training-induced brain plasticity eects that could potentially translate to long-lasting

cognitive impacts. While the data on ageing and musicianship remains scant, there is emerging evidence that

an age-related decline in auditory perception may be mitigated by musical training among lifelong musicians

who maintain regular musical practice. Older and younger adult musicians outperform non-musicians in vari-

ous auditory processing abilities, such as detecting speech-in-noise and mistuned harmonics, assessed using

neurophysiological and behavioural measures51–53.

In light of the above ndings, musical training does appear to facilitate speech perception, providing empiri-

cal evidence for the OPERA hypothesis4. A critical question to consider is whether the OPERA hypothesis can

be extended to former musicians. Studies on the eects of musical training typically characterise musicians as

individuals with ≥ 6years of musical training and ongoing instrumental practice for ≥ 1h a week54. However,

such professional musicians may not represent the general population in which many individuals who take

up music lessons in childhood eventually do not commit to it55,56. Although there has been extensive research

on professional active musicians, more research needs to be done with individuals who choose not to pursue

musicianship professionally but nonetheless have had some musical experience. Of particular interest is whether

cognitive benets such as in speech perception persist even aer musical training and practice is discontinued.

Qualifying the extent of the inuence of musical training among individuals who have undergone music attri-

tion will serve to not only provide insight on the generalisability of the OPERA hypothesis, but also inform the

eectiveness of musical training as a means of improving cognitive and linguistic abilities in the long-term, as

well as protecting against age-related cognitive decline.

As noted by Costa-Giomi56, few studies to date have investigated whether cognitive advantages exist in the

long term aer musical training and practice is discontinued. Costa-Giomi and Ryan58 (as cited in Costa-Giomi57)

conducted a longitudinal study in which children in the experimental group received 3years of piano lessons.

Seven years aer musical training was discontinued, the researchers found no dierences in IQ or memory

between the adults who had and had not received childhood musical training, suggesting that musical training

does not result in permanent cognitive benets. Nevertheless, the researchers postulated that the lack of long-

lasting cognitive improvements may have been due to low attendance and time spent practising the musical

instrument55,58. In contrast, two behavioural studies found improved performance in various cognitive tasks

such as IQ59 and executive functions60 in adulthood even aer musical training and practice had ceased, sug-

gesting that musical training has long-term benets and contributes to the establishment of a cognitive reserve.

However, the measures used in these behavioural studies have focused on general cognitive abilities rather than

speech perception abilities specically.

In terms of auditory perception, two brain imaging studies havefound that musical training in early child-

hood provide sustained enhanced neural processing of auditory stimuli in adulthood aer musical training and

practice had ceased. Skoe and Kraus61 found that young adults who had received musical training in childhood

showed more robust signal-to-noise ratio brainstem responses to pure tones, as compared to non-musicians.

Content courtesy of Springer Nature, terms of use apply. Rights reserved

Vol.:(0123456789)

Scientic Reports | (2023) 13:2657 | https://doi.org/10.1038/s41598-023-29733-3

www.nature.com/scientificreports/

White-Schwoch etal.62 found that older adults with a greater number of years of musical training in childhood

or young adulthood showed faster neural timing in response to consonant–vowel transitions in speech syllables

presented in quiet and noise, compared to older adults with fewer number of years of musical training or no

musical training at all. Although these two brain imaging studies suggest that music-related neuroplasticity is

maintained even aer music attrition, studies have yet to investigate if these neural traces translate to a clear

behavioural advantage in acoustic processing of speech stimuli. is is a critical research gap that the present

study aims to bridge.

In sum, although the literature has generally established that professional musicians have an advantage over

non-musicians in pitch perception abilities in the language domain, it remains inconclusive whether this musi-

cian advantage would also be observed behaviourally among individuals who have ceased musical training and

practice. at being the case, the overarching aim of our study is to add to the OPERA hypothesis and elucidate

whether a potential music-to-language transfer eect exists among former musicians. To this end, our study

compared active musicians, former musicians, and non-musicians in their ability to perform a well-replicated

experimental task—detecting linguistic prosodic pitch violations.

Although there has been a burgeoning number of studies revolving around various types of pitch percep-

tion in speech, such as lexical tones and stress, we were theoretically motivated to focus on prosody for several

reasons. Firstly, prosody is oen described as “the music of speech”63, thereby making it an obvious candidate for

the present study on music-to-language transfer. Patel himself has called attention to the fact that both melody

in music and prosody in speech rely primarily on the same acoustic parameter of pitch contour, with the former

necessitating more precise acoustic processing than the latter64. is overlap in neural resources has been dem-

onstrated in the studies outlined above, in which musicians tend to surpass non-musicians in prosodic pitch

perception. On top of that, Patel and other researchers have shown that individuals on the other end of the spec-

trum with a musical disorder known as amusia exhibit decits in perceiving speech prosody64–67. Accordingly,

speech prosody is of exceptional relevance to the OPERA hypothesis. Secondly, unlike lexical tones which are

only of pertinence to tone languages, speech prosody is an important aspect of all languages, thereby making it a

universal topic of interest with great practical signicance. Broadly speaking, prosody signals speaker intention

and meaning, imparting crucial information pertaining to syntax and pragmatics68. Research on rst language

acquisition has found that prosodic sensitivity is related to literacy skills69, reading comprehension70,71, and speech

comprehension72–74. In a similar vein, research on second language acquisition in children and adults has found

that prosodic sensitivity might facilitate the learning of word order and new vocabulary75,76, while exposure to

prosodic features of the target language apparently improves second language prociency and uency77,78. e

ndings yielded from this study will therefore have important pedagogical implications for language and literacy

skills as well as foreign language learning.

In order to study prosodic pitch perception, we chose to adopt the well-replicated prosodic pitch contour

expectancy violation task, as it has consistently demonstrated the musician advantage in dierent age groups and

languages with robust ndings. Given that previous studies revealed a trend in which participants, regardless of

musicianship, showed superior performance in detecting prosodic pitch violations in a familiar language rela-

tive to an unfamiliar language34,35, two dierent language contexts were implemented in the present study. We

included a non-native language context in part to help circumvent a problem we anticipated; namely, that tasks

using native language stimuli might not be adequately sensitive to tease apart group dierences18,23, especially for

tone language speakers. Furthermore, by introducing both native and non-native language contexts, we hoped

to examine music-to-language transfer eects both with and without the top-down inuence from other types

of linguistic processing, allowing us to better assess the generalisability of the eects. Finally, the two language

contexts mirror rst language competence and second language learning respectively, shedding light on the

practical application of the enduring music-to-language transfer eects in former musicians, if any.

Method

Participants. Participants were recruited to take part in the study via an online screening questionnaire.

ey were between 19 and 42years old (M = 23.04, SD = 3.90), with normal hearing based on an audiometric test

(25dB HL for octave frequencies from 500 to 4000Hz). All of the participants were either native Singaporeans

or had lived in Singapore for at least 10years to ensure that they were familiar with the local accented variety of

English. ey had no formal exposure to the French language, the non-native speech stimuli used in this study.

A total of 127 individuals participated in this study. Data from 31 participants was excluded due to the fol-

lowing cases: (a) participants with self-reported exposure to French (n = 8); (b) participants who had between

2and 6years of musical training experience (n = 23).

e nal dataset consisted of 96 participants. ey were classied into three groups based on information

obtained from a self-report questionnaire on their language and music background. In this study, active musi-

cians consisted of those who had had at least 6years of musical training and were still currently maintaining

a consistent practice schedule of at least 3h per week in the past 2years (n = 22). On the other hand, former

musicians referred to those who similarly had at least 6years of musical training but had stopped maintaining a

regular practice schedule for at least 2years (n = 27). Finally, non-musicians referred to those who had had less

than 2years of musical training (n = 47). ose with musical training predominantly had experience in string,

wind, and vocal musical training. None of the participants were musicians by profession. Reecting the diver-

sity of multilingualism in the local population, the majority of the participants were procient in English and

Mandarin Chinese (n = 86), while several were procient in English and a second language other than Mandarin

Chinese, specically Malay (n = 3), Tamil (n = 5), Tagalog (n = 1), and Burmese (n = 1). e representation of

non-Mandarin Chinese speakers was similar across groups, χ2(2) = 2.355, p = 0.308.

Content courtesy of Springer Nature, terms of use apply. Rights reserved

Vol:.(1234567890)

Scientic Reports | (2023) 13:2657 | https://doi.org/10.1038/s41598-023-29733-3

www.nature.com/scientificreports/

To validate the grouping, participants’ general musical abilities were assessed using the Musical Ear Test

(MET)79. e MET consisted of two components: the melody subtest and the rhythm subtest. For each subtest,

participants listened to 52 pairs of phrases, and had to judge whether the second phrase was the “same” or “dif-

ferent” compared to the rst phrase. Half of the trials were “same” trials and the other half were “dierent” trials.

e “dierent” trials involved a pitch violation in the melody subtest and a rhythmic change in the rhythm subtest.

e MET stimuli were delivered via headphones, and participants gave their responses on an accompanying

answer sheet. All participants completed the melody subtest followed by the rhythm subtest. Table1 shows the

nal sample and descriptive information of each participant group.

Materials and procedures

e research procedures were approved by the Institutional Review Board at the Nanyang Technological Univer-

sity. All research methods were performed in accordance with the relevant guidelines and regulations. Written

informed consent was obtained from all participants and/or their legal guardians before participation.

Aer providing their written informed consent, participants were seated comfortably in a soundproof booth,

and undertook two experimental tasks. Firstly, the participants completed a two-choice speech pitch discrimi-

nation task. e English and French language blocks were counterbalanced across participants, with half of the

participants presented with the English set rst, and the other half with the French set rst. Secondly, the partici-

pants completed a general musical abilities test, i.e., the MET. Short breaks were given between tasks to prevent

fatigue. e total length of time for participation was approximately 1h, and the participants were monetarily

compensated for their time upon successful completion of the experiment.

Participants’ linguistic perception abilities were assessed using a pitch violation discrimination task that has

been well-replicated in the literature31–35. For the pitch discrimination task, 40 spoken declarative sentences in

English and French respectively were recorded to form the experimental speech stimuli (see Supplementary

Tables1 and 2). e sentences were compiled and modied from a combination of sources, including the Har-

vard Sentences database80 for the English stimuli and Smith’s paper81 for the French stimuli, with the nal word

in each sentence being disyllabic as in Marques etal.’s study34. Two female speakers, one native in Singapore

English and the other in French, voiced the English and French sentences respectively at a normal speaking rate.

e recorded sentences were then digitised (sampling at 44.1kHz and 16 bit) using Audacity® Version 2.0.5.082.

For each language, there were three dierent auditory conditions, and 40 sentences were presented in each

auditory condition, thus leading to a total of 120 sentences. e nal word of each sentence was either prosodi-

cally congruous, weakly incongruous, or strongly incongruous. In the prosodically incongruous conditions, the

pitch (F0) of the nal words was increased using Praat83, such that there was a local pitch manipulation on the

nal words (+ 25% in the weakly incongruous condition, + 110% in the strongly incongruous condition) while

maintaining the original natural global pitch contour (Fig.1). e pitch increases used in the present study dier

from those used in past studies (+ 35% in the weakly incongruous condition, + 120% in the strongly incongru-

ous condition)31–35. Preliminary pilot testing using conventional pitch increase values revealed a ceiling eect

among our Singaporean participants, likely because enhanced pitch perception abilities in speech have been

associated with bilingualism84 and tone language experience6,24–30. As such, we reduced the pitch incongruity

Table 1. Participant group demographics. Mean values and standard deviations (in parentheses) are given

for age, musical background, and musical abilities. For the group dierences on musical abilities, one-way

ANOVA and pairwise comparisons are Bonferroni corrected.

Participant group Group dierences on musical

abilities (MET scores) as revealed

by one-way ANOVAActive musicians (AM) Former musicians (FM) Non-musicians (NM)

N 22 27 47 N/A

Age 21.45 (2.87) 23.07 (3.93) 23.77 (4.13) N/A

No. of years of musical training 11.00 (3.80) 9.37 (3.12) 0.09 (0.28)

F(2,93) = 202.564, p < 0.001

AM > NM (p < 0.001)

AM = FM (p = 0.069)

FM > NM (p < 0.001)

No. of practice hours per week within

the past two years 5.91 (2.51) 0.70 (0.77) N/A N/A

No. of years since musical training

was discontinued 0.09 (0.29) 5.26 (3.44) N/A N/A

MET melody (%) 85.58 (7.43) 77.99 (7.92) 69.23 (7.38)

F(2,93) = 37.299, p < 0.001

AM > NM (p < 0.001)

AM > FM (p = 0.002)

FM > NM (p < 0.001)

MET rhythm (%) 78.76 (7.93) 73.29 (7.77) 66.98 (9.49)

F(2,93) = 14.579, p < 0.001

AM > NM (p < 0.001)

AM = FM (p = 0.093)

FM > NM (p = 0.010)

Language background 20 English-Mandarin bilingual

1 English-Malay bilingual

1 English-Tagalog bilingual

26 English-Mandarin bilingual

1 English-Tamil bilingual

40 English-Mandarin bilingual

2 English-Malay bilingual

4 English-Tamil bilingual

1 English-Burmese bilingual

Content courtesy of Springer Nature, terms of use apply. Rights reserved

Vol.:(0123456789)

Scientic Reports | (2023) 13:2657 | https://doi.org/10.1038/s41598-023-29733-3

www.nature.com/scientificreports/

in order to increase the diculty of the task, and preliminary pilot testing using our modied pitch increase

values obtained pitch discrimination accuracy rates across the experimental conditions which were similar to

those found by Marques etal.34.

Participants listened to the speech stimuli via headphones. ey were briefed that they would be listening

to either English or French sentences, and that comprehension of the sentences was not required. In each trial,

participants were asked to judge whether the nal word of each sentence sounded normal (congruous condition)

or strange (weakly incongruous or strongly incongruous conditions). Responses were recorded via a keyboard

press, “N” or “S” respectively. Participants were asked to provide a response within 3s. e practice phase

consisted of 6 trials, with feedback provided at the end of each trial to indicate if the participants had answered

correctly. e experimental phase consisted of 120 trials, broken up into four blocks of 30 sentences each. Sen-

tence blocks were counterbalanced across participants; half of the participants in each group heard blocks one

and two rst, while the other half heard blocks three and four rst. Sentences from each experimental condition

occurred equally frequently within each block and in pseudorandom order. Up to three consecutive “strange”

trials were allowed within each block, while pitch-manipulated variants of the same sentence were not allowed

to occur within the same block.

Results

A 2

3 mixed ANOVA was conducted with pitch discrimination accuracy as the dependent variable, language

(native vs. non-native) and prosodic congruity (congruous vs. weakly incongruous vs. strongly incongruous)

as the within-subject factors, and music group (active musicians vs. former musicians vs. non-musicians) as

the between-subject factor. As Mauchly’s Test indicated that the assumption of sphericity had been violated

for the prosodic congruity eect, χ2(2) = 242.497, p < 0.001, and the language by prosodic congruity eect,

χ2(2) = 153.767, p < 0.001, Greenhouse–Geisser correction was applied, ε = 0.519 and ε = 0.552 respectively. As

Box’s M Test indicated that the assumption of equality of covariance had been violated, Box’s M = 249.251,

F = 5.307, p < 0.001, Pillai’s Trace was used. For all pairwise comparisons, Bonferroni correction was applied.

The interaction effect between prosodic congruity and music group was statistically significant,

F(2.074,96.456) = 10.124, p < 0.001 (Fig.2). e main source of the interaction eect as revealed by simple eect

analyses was from the weakly incongruous condition, F(2,93) = 13.877, p < 0.001; the mean pitch discrimination

accuracy was signicantly dierent between active musicians (68%), former musicians (54%), and non-musicians

(42%). Pairwise comparisons revealed that active musicians outperformed former musicians (p = 0.034) and

non-musicians (p < 0.001), while former musicians also outperformed non-musicians (p = 0.042).

e eect of music group was signicant for the strongly incongruous condition, F(2,93) = 3.293, p = 0.042; the

mean pitch discrimination accuracy was signicantly dierent between active musicians (99%), former musicians

(98%), and non-musicians (96%). However, pairwise comparisons revealed no signicant dierences between

groups aer Bonferroni correction. Active musicians did not dier from non-musicians (p = 0.060), and former

musicians diered from neither active musicians (p = 1.000) nor non-musicians (p = 0.257). Meanwhile. the eect

of music group was not signicant for the congruous condition, F(2,93) = 0.767, p = 0.467.

e three-way interaction between language and prosodic congruity and music group was not signi-

cant, F(2.207,102.648) = 0.335, p = 0.737; neither was the interaction between language and music group,

F(2.000,93.000) = 0.615, p = 0.543.

There was also a significant interaction effect between language and prosodic congruity,

F(1.104,102.648) = 39.450, p < 0.001 (Fig.3). e eect of language was signicant for the weakly incongruous

condition, F(1.000,93.000) = 63.833, p < 0.001, where participants showed higher pitch discrimination accuracy

Figure1. Fundamental frequency (F0 in Hz) for a sample sentence in the three prosodic conditions.

Content courtesy of Springer Nature, terms of use apply. Rights reserved

Vol:.(1234567890)

Scientic Reports | (2023) 13:2657 | https://doi.org/10.1038/s41598-023-29733-3

www.nature.com/scientificreports/

in their native language English (66%) than in their non-native language French (43%). e eect of language

was also signicant for the congruous condition, F(1.000,93.000) = 8.816, p = 0.004, where participants showed

higher pitch discrimination accuracy in their native language English (98%) than in their non-native language

French (95%). However, the eect of language was not signicant for the strongly incongruous condition,

F(1.000,93.000) = 0.008, p = 0.931).

Signicant main eects were found for language, F(1.000,93.000) = 85.129, p < 0.001, prosodic congruity,

F(1.037,96.456 = 376.582, p < 0.001, and music group, F(2,93) = 14.275, p < 0.001. Participants showed higher pitch

discrimination accuracy in their native language English (87%) than in their non-native language French (79%).

e weakly incongruous condition (52%) was the most dicult to detect compared to the congruous condition

(96%) and strongly incongruous condition (97%). Active musicians (88%) and former musicians (83%) showed

higher pitch discrimination accuracy compared to non-musicians (78%).

Figure2. Pitch discrimination accuracy of active musicians, former musicians, and non-musicians in the three

prosodic conditions. Error bars denote standard error. *p < 0.05, ***p < 0.001.

Figure3. Pitch discrimination accuracy for the native language English and non-native language French in the

three prosodic conditions. Error bars denote standard error. **p < 0.01, ***p < 0.001.

Content courtesy of Springer Nature, terms of use apply. Rights reserved

Vol.:(0123456789)

Scientic Reports | (2023) 13:2657 | https://doi.org/10.1038/s41598-023-29733-3

www.nature.com/scientificreports/

Discussion

e present study is one of the rst to ascertain whether individuals who have discontinued musical training

and practice retain a behavioural advantage over non-musicians in pitch perception abilities in speech. Our key

nding is that there was a signicant interaction eect between prosodic congruity and music group. In the

weakly incongruous condition where pitch deviations were small and dicult to detect, our results showed a

stepwise progression in pitch discrimination accuracy, with active musicians having better performance than

former musicians, who in turn had better performance than non-musicians.

Our nding of an advantage by musicians over non-musicians in pitch discrimination echoes past ndings

that musical training facilitates pitch perception in the language domain5–23,31–39, thereby pointing towards a com-

mon domain-general pitch processing mechanism in music and speech perception as described in the OPERA

hypothesis4. Our nding also coheres with ndings of long-lasting neural changes from past musical training in

young adults61 and older adults62, as well as ndings of improved cognitive performance in adulthood even aer

ceasing musical training59,60. Taken together, it appears that former musicians share similar neural enhancement

as active musicians, and that the sharpened neural processing translates to perceptual benets behaviourally.

One explanation is that musical training requires individuals to attend to subtle sound contrasts, such as in pitch

and duration. Consequently, musicians become more sensitive to such subtle acoustic cues, which has a positive

spillover eect when discriminating similar contrasts in speech. Musical training contributes to the establish-

ment of cognitive enhancement, such that there are some enduring cross-domain transfer benets of musical

training on the discrimination of subtle speech contrasts even aer musical training and practice is discontinued.

More importantly, our nding that former musicians diered from active musicians qualies the extent of the

positive music-to-language transfer eects. Drawing on previous studies showing a clear behavioural advantage

held by adult musicians over non-musicians in a pitch contour violation task similar to that used in the present

study31,34, it appears that former musicians retain some musician advantage, but that such advantage may fade

over time aer musical training and practice is discontinued. Parallel results were seen in the data from the

musical abilities tests. ere were signicant group dierences in MET melody scores (Table1), where active

musicians had better musical abilities than former musicians, and former musicians in turn had better musical

abilities than non-musicians. e dierence in pitch discrimination and musical abilities between active musi-

cians and former musicians cannot be attributed to the length of musical training, which was similar for the two

groups as seen in the pairwise comparison (Table1). erefore, our data suggests that music attrition in pitch

perception manifests in both the music and language domains among former musicians. One possible explana-

tion is that subtle acoustic cues may no longer be behaviourally relevant in former musicians’ everyday auditory

environment, such that positive music-to-language transfer benets may diminish over time. is interpretation

is corroborated by exploratory Pearson correlations conducted to assess the relationship between the number

of years since discontinuing musical practice and other factors. As reported in the Supplementary Results,

in the sample of musicians both active and former, the number of years since discontinuing musical practice

was signicantly negatively correlated with pitch discrimination accuracy for the weakly incongruous prosodic

condition in the native language English and non-native language French (see Supplementary Figs.1 and 2), as

well as with MET melody and rhythm subtest performance. e longer the period since discontinuing musical

training and practice, the poorer one is at discriminating subtle violations in speech and music, indicating that

there might be a gradual attenuation in ner acoustic discrimination abilities among former musicians. Future

research can be conducted with older adults as participants, including former musicians who have discontinued

musical training and practice for a longer period, to examine the issue further.

We also found a signicant main eect for language and a signicant interaction eect between language

and prosodic congruity. Apart from a native language context with multiple possible sources of information that

might assist in pitch processing, we implemented a non-native language context without additional information

for participants to rely on. e interaction eect revealed that in both the congruous and weakly incongruous

conditions, participants were more accurate in prosodic pitch discrimination in their native language than in a

non-native language, consistent with previous ndings by Marques etal.34 and Deguchi etal.35. ere are two

possible explanations for this native language advantage, which Deguchi etal.35 investigated by introducing jab-

berwocky sentences that preserved the intonational contours of the native language but consisted of meaningless

legal pseudowords. ey found that participants were better at detecting pitch changes in their native language

than in jabberwocky, but were also better at detecting pitch changes in jabberwocky than in the non-native lan-

guage. is suggests that participants were familiar with typical intonational contours in their native language,

and were consequently better able to detect pitch changes in the native language and jabberwocky speech stimuli

but not in the non-native language speech stimuli. At the same time, participants could understand the mean-

ing of their native language, and for that reason were better able to anticipate when the nal word carrying the

pitch variation would occur in the native language speech stimuli but not in the jabberwocky and non-native

language speech stimuli. e native language advantage observed in our study can thus be explained by the fact

that participants were making use of both prosodic and semantic information to complete the pitch incongruity

detection task. Nonetheless, regardless of the language used for the speech stimuli, group dierences were seen

in the weakly incongruous condition. Active and former musicians were more accurate than non-musicians in

detecting prosodic pitch violation no matter whether they had prior knowledge of the language tested. In the

non-native language context which better isolated the prosodic pitch dimension without inuence from other

types of linguistic processing, participants would not have been able to exploit additional top-down processing

frameworks and would have been relying solely on bottom-up pitch perceptual sensitivity to acoustic cues. In

the native language context, group dierences were also seen despite the fact that all participants were able to

draw on additional linguistic resources. e present study hence extends the OPERA hypothesis4 by underscor-

ing that—presumably due to their prior musical training with melodic pitch patterns—former musicians retain

Content courtesy of Springer Nature, terms of use apply. Rights reserved

Vol:.(1234567890)

Scientic Reports | (2023) 13:2657 | https://doi.org/10.1038/s41598-023-29733-3

www.nature.com/scientificreports/

enhanced underlying pitch processing abilities, which generalise to the perception of prosodic pitch contours

in speech for both native and non-native languages.

A caveat to keep in mind, though, is that although there is a large body of evidence in which musicians—be it

active and former—outperform non-musicians in various pitch perception tasks, it may not be straightforward

to conclude that the so-called musician advantage is a result of musical training. It is plausible that the results

reported in this present study may be driven by a third, unexplored factor, such as general intelligence, educa-

tion background, or socioeconomic status. On top of that, as with most of the previously reported literature, this

present study adopted a cross-sectional design comparing dierent population groups at a specic point in time.

In recent years, some researchers have propounded the idea that inherent musical abilities, rather than musical

training, might be linked to enhanced speech perception30,85,86. Ergo, the music-to-language transfer eects that

we speak of may be a consequence of pre-existing dierences and self-selection, as opposed to a consequence

of musical training per se. In other words, individuals pre-disposed with superior auditory or pitch processing

abilities to begin with may be more inclined to pick up and continue musical training, such that the dierences

observed between active, former, and non-musicians later in life may not be a direct outcome of musical train-

ing in and of itself.

However, as highlighted in the introduction, there is some compelling evidence in the existing literature that

musical training has a causal inuence on brain development and pitch perception. Participants initially matched

in musical aptitude, general intelligence, and socioeconomic status have been shown to demonstrate group dif-

ferences in neurological and behavioural post-tests related to pitch perception depending on the training they

are randomly assigned to33,45–47. Of particular relevance to our study, Moreno etal.33 conducted a longitudinal

experimental study with 8-year-old children without any prior musical training. Pre-tests conrmed that the

children were initially matched in pitch perception performance, general cognitive abilities, as well as socio-

economic status. ese children were then randomly assigned to receive 6months of either musical training

or painting training. e researchers recorded both electrophysiological and behavioural measures for a pitch

violation discrimination task similar to that used in this present study. ey found that children who received

musical training, but not those who received painting training, showed improved prosodic pitch discrimina-

tion abilities in speech. Along the same lines, Nan etal.45 randomly assigned 4- to 5-year-old children with

tone language experience to receive 6months of piano training, reading training, or no training. Although the

children were initially matched in general cognitive abilities and socioeconomic status, and although all groups

showed improvements in general cognitive abilities, only those who received piano training showed enhanced

cortical responses to pitch changes in music and speech which were correlated with behavioural performance.

ese ndings suggest that musical training can indeed cause experience-dependent transfer eects that cannot

be attributed to external factors or pre-existing dierences, while our study further suggests that some transfer

eects may be retained even aer musical training and practice is discontinued. Having said that, future research

can strengthen our nding by performing an intervention study with longitudinal randomised controlled trials

to track and compare the eects of long-term, short-term, and no musical training among individuals who are

otherwise matched on other variables.

In conclusion, our study shows that musical training confers positive cross-domain benets in speech per-

ception, adding to the body of literature on music-to-language transfer and suggesting that there is a common

pitch processing mechanism underlying pitch perception in the two domains. More importantly, our results

further show that these benets may be retained to some extent over time, such that former musicians show

some behavioural advantage over non-musicians even aer the discontinuation of musical training and practice.

Situated within the OPERA hypothesis4, it appears that musical training alters the shared neural networks for

music and speech in a long-lasting manner, such that the musician advantage applies not only to active musicians

but to former musicians as well. Moreover, this advantage in prosodic pitch perception is seen with both native

and non-native languages. Possible future directions for research include using neurological and behavioural

measures to compare active musicians, former musicians, and non-musicians’ pitch perception abilities in the

language domain in other areas such as the perception of lexical tones, stress, and emotional prosody. Our nd-

ings have real-life implications for boosting rst language acquisition and foreign language learning, as well as

protecting against age-related cognitive and auditory decline in the ageing population. It appears that musical

training and practice can serve as an eective enrichment activity and intervention method to improve speech

perception, and that individuals can reap some long-lasting cognitive benets throughout their lifespan even

aer musical training and practice is discontinued.

Data availability

e dataset generated during and/or analysed during the current study is included in the Supplementary Infor-

mation le.

Received: 19 September 2022; Accepted: 9 February 2023

References

1. S chellenberg, E. G. & Weiss, W. M. Music and cognitive abilities. In e Psychology of Music (ed. Deutsch, D.) 499–550 (Academic

Press, 2013).

2. Parbery-Clark, A., Anderson, S., Hittner, E. & Kraus, N. Musical experience osets age-related delays in neural timing. Neurobiol.

Aging 33(1483), e1-4. https:// doi. org/ 10. 1016/j. neuro biola ging. 2011. 12. 015 (2012).

3. Parbery-Clark, A., Skoe, E., Lam, C. & Kraus, N. Musician enhancement for speech-in-noise. Ear. Hear. 30, 653–661. https:// doi.

org/ 10. 1097/ AUD. 0b013 e3181 b412e9 (2009).

Content courtesy of Springer Nature, terms of use apply. Rights reserved

Vol.:(0123456789)

Scientic Reports | (2023) 13:2657 | https://doi.org/10.1038/s41598-023-29733-3

www.nature.com/scientificreports/

4. Patel, A. D. Why would musical training benet the neural encoding of speech? e OPERA hypothesis. Front. Psychol. https://

doi. org/ 10. 3389/ fpsyg. 2011. 00142 (2011).

5. Alexander, J. A., Wong, P. C. M. & Bradlow, A. R. Lexical tone perception in musicians and non-musicians. In Proceedings of Proc.

Annual Conference of the International Speech Communication Association Interspeech (2005).

6. Burnham, D., Brooker, R. & Reid, A. e eects of absolute pitch ability and musical training on lexical tone perception. Psychol.

Music. 43, 881–897. https:// doi. org/ 10. 1177/ 03057 35614 546359 (2015).

7. Choi, W. e selectivity of musical advantage: Musicians exhibit perceptual advantage for some but not all Cantonese tones. Music

Percept. 37, 423–434. https:// doi. org/ 10. 1525/ MP. 2020. 37.5. 423 (2020).

8. Delogu, F., Lampis, G. & Belardinelli, M. O. From melody to lexical tone: Musical ability enhances specic aspects of foreign

language perception. Eur. J. Cogn. Psychol. 22, 46–61. https:// doi. org/ 10. 1080/ 09541 44080 27081 36 (2010).

9. Gottfried, T. L. & Riester, D. Relation of pitch glide perception and Mandarin tone identication. J. Acoust. Soc. Am. 108, 2604.

https:// doi. org/ 10. 1121/1. 47436 98 (2000).

10. Gottfried, T. L., Staby, A. M. & Ziemer, C. J. Musical experience and Mandarin tone discrimination and imitation. J. Acoust. Soc.

Am. 115, 2545. https:// doi. org/ 10. 1121/1. 47836 74 (2001).

11. Han, Y., Goudbeek, M., Mos, M. & Swerts, M. Mandarin tone identication by tone-naïve musicians and non-musicians in

auditory-visual and auditory-only conditions. Front. Commun. 4, 1–14. https:// doi. org/ 10. 3389/ fcomm. 2019. 00070 (2019).

12. Hung, T.-H. & Lee, C.-Y. Processing linguistic and musical pitch by English-speaking musicians and non-musicians. In 20th North

American Conference on Chinese Linguistics (2008).

13. Lee, C.-Y. & Hung, T.-H. Identication of Mandarin tones by English-speaking musicians and nonmusicians. J. Acoust. Soc. Am.

124, 3235–3248. https:// doi. org/ 10. 1121/1. 29907 13 (2008).

14. Marie, C. L., Delogu, F., Lampis, G., Belardinelli, M. O. & Besson, M. Inuence of musical expertise on segmental and tonal pro-

cessing in Mandarin Chinese. J. Cogn. Neurosci. 23, 2701–2715. https:// doi. org/ 10. 1162/ jocn. 2010. 21585 (2011).

15. Wayland, R. P., Herrera, E. & Kaan, E. Eects of musical experience and training on pitch contour perception. J. Phon. 38, 654–662.

https:// doi. org/ 10. 1016/j. wocn. 2010. 10. 001 (2010).

16. Bidelman, G. M., Gandour, J. T. & Krishnan, A. Cross-domain eects of music and language experience on the representation of

pitch in the human auditory brainstem. J. Cogn. Neurosci. 23, 425–434. https:// doi. org/ 10. 1162/ jocn. 2009. 21362 (2011).

17. Wong, P. C. M., Skoe, E., Russo, N. M., Dees, T. & Kraus, N. Musical experience shapes human brainstem encoding of linguistic

pitch patterns. Nat. Neurosci. 10, 420–422. https:// doi. org/ 10. 1038/ nn1872 (2007).

18. Mok, P. P. K. & Zuo, D. e separation between music and speech: Evidence from the perception of Cantonese tones. J. Acoust.

Soc. Am. 132, 2711–2720. https:// doi. org/ 10. 1121/1. 47470 10 (2012).

19. Ong, J. H., Wong, P. C. M. & Liu, F. Musicians show enhanced perception, but not production, of native lexical tones. J. Acoust.

Soc. Am. 148, 3443. https:// doi. org/ 10. 1121/ 10. 00027 76 (2020).

20. Toh, X. R., Lau, F. & Wong, F. C. K. Individual dierences in nonnative lexical tone perception: Eects of tone language repertoire

and musical experience. Front. Psychol. 13, 940363. https:// doi. org/ 10. 3389/ fpsyg. 2022. 940363 (2022).

21. Kolinsky, R., Cuvelier, H., Goetry, V., Peretz, I. & Morais, J. Music training facilitates lexical stress processing. Music Percept. 26,

235–246. https:// doi. org/ 10. 1525/ mp. 2009. 26.3. 235 (2009).

22. Choi, W. Towards a native OPERA hypothesis: Musicianship and English stress perception. Lang. Speech 65, 697–712. https:// doi.

org/ 10. 1177/ 00238 30921 10494 58 (2022).

23. Choi, W. What is “music” in music-to-language transfer? Musical ability but not musicianship supports Cantonese listeners’ English

stress perception. J. Speech Lang. Hear. Res. 65, 4047–4059. https:// doi. org/ 10. 1044/ 2022_ JSLHR- 22- 00175 (2022).

24. Lee, Y.-S., Vakoch, D. A. & Lee, H. W. Tone perception in Cantonese and Mandarin: A cross-linguistic comparison. J. Psycholinguist.

Res. 25, 527–542. https:// doi. org/ 10. 1007/ BF017 58181 (1996).

25. Morett, L. M. e inuence of tonal and atonal bilingualism on children’s lexical and non-lexical tone perception. Lang. Speech

63, 221–241. https:// doi. org/ 10. 1177/ 00238 30919 834679 (2020).

26. Qin, Z. & Mok, P. K. P. Discrimination of Cantonese tones by speakers of tone and non-tone languages. Kans. Work. Pap. Linguist.

34, 26–42. https:// doi. org/ 10. 17161/ KWPL. 1808. 12864 (2013).

27. Schaefer, V. & Darcy, I. Lexical function of pitch in the rst language shapes cross-linguistic perception of ai tones. Lab. Phonol.

5, 489–522. https:// doi. org/ 10. 1515/ lp- 2014- 0016 (2014).

28. Schaefer, V. & Darcy, I. Applying a newly learned second language dimension to the unknown: e inuence of second language

Mandarin tones on the naïve perception of ai tones. Psychol. Lang. Commun. 24, 90–123. https:// doi. org/ 10. 2478/ plc- 2020- 0007

(2020).

29. Wayland, R. P. & Guion, S. G. Training English and Chinese listeners to perceive ai tones: A preliminary report. Lang. Learn.

54, 681–712. https:// doi. org/ 10. 1111/j. 1467- 9922. 2004. 00283.x (2004).

30. Wayland, R. P. & Li, B. Eects of two training procedures in cross-language perception of tones. J. Phon. 36, 250–267. https:// doi.

org/ 10. 1016/j. wocn. 2007. 06. 004 (2008).

31. Schön, D., Magne, C. & Besson, M. e music of speech: Music training facilitates pitch processing in both music and language.

Psychophysiology 41, 341–349. https:// doi. org/ 10. 1111/ 1469- 8986. 00172.x (2004).

32. Magne, C., Schön, D. & Besson, M. Musician children detect pitch violations in both music and language better than nonmusician

children: Behavioral and electrophysiological approaches. J. Cogn. Neurosci. 18, 199–211. https:// doi. org/ 10. 1162/ 08989 29067

75783 660 (2006).

33. Moreno, S. et al. Musical training inuences linguistic abilities in 8-year-old children: More evidence for brain plasticity. Cereb.

Cortex 19, 712–723. https:// doi. org/ 10. 1093/ cercor/ bhn120 (2009).

34. Marques, C., Moreno, S., Castro, S. L. & Besson, M. Musicians detect pitch violation in a foreign language better than nonmusi-

cians: Behavioral and electrophysiological evidence. J. Cogn. Neurosci. 19, 1453–1463. ht tps:// doi. org/ 10. 1162/ jocn. 2007. 19.9. 1453

(2007).

35. Deguchi, C. et al. Sentence pitch change detection in the native and unfamiliar language in musicians and non-musicians: Behav-

ioral, electrophysiological and psychoacoustic study. Brain Res. 1455, 75–89. https:// doi. org/ 10. 1016/j. brain res. 2012. 03. 034 (2012).

36. ompson, W. F., Schellenberg, E. G. & Husain, G. Perceiving prosody in speech: Eects of music lessons. Ann. N. Y. Acad. Sci.

999, 530–532. https:// doi. org/ 10. 1196/ annals. 1284. 067 (2003).

37. ompson, W. F., Schellenberg, E. G. & Husain, G. Decoding speech prosody: Do music lessons help?. Emotion 4, 46–64. https://

doi. org/ 10. 1037/ 1528- 3542.4. 1. 46 (2004).

38. Farmer, E., Jicol, C. & Petrini, K. Musicianship enhances perception but not feeling of emotion from others’ social interaction

through speech prosody. Music Percept. 37, 323–338. https:// doi. org/ 10. 1525/ mp. 2020. 37.4. 323 (2020).

39. Lima, C. F. & Castro, S. L. Speaking to the trained ear: Musical expertise enhances the recognition of emotions in speech prosody.

Emotion 11, 1021–1031. https:// doi. org/ 10. 1037/ a0024 521 (2011).

40. Pantev, C. et al. Increased auditory cortical representation in musicians. Nature 392, 811–814. https:// doi. org/ 10. 1038/ 33918 (1998).

41. Pantev, C., Engelien, A., Candia, V. & Elbert, T. Representational cortex in musicians: Plastic alterations in response to musical

practice. Ann. N. Y. Acad. Sci. 930, 300–314. https:// doi. org/ 10. 1111/j. 1749- 6632. 2001. tb057 40.x (2001).

42. Schlaug, G. e brain of musicians: A model for functional and structural adaptation. Ann. N. Y. Acad. Sci. 930, 281–299. https://

doi. org/ 10. 1111/j. 1749- 6632. 2001. tb057 39.x (2001).

Content courtesy of Springer Nature, terms of use apply. Rights reserved

Vol:.(1234567890)

Scientic Reports | (2023) 13:2657 | https://doi.org/10.1038/s41598-023-29733-3

www.nature.com/scientificreports/

43. Bermudez, P. & Zatorre, R. J. Dierences in gray matter between musicians and nonmusicians. Ann. N. Y. Acad. Sci. 1060, 395–399.

https:// doi. org/ 10. 1196/ annals. 1360. 057 (2005).

44. Gaser, C. & Schlaug, G. Brain structures dier between musicians and non-musicians. J. Neurosci. 23, 9240–9245. https:// doi. org/

10. 1523/ JNEUR OSCI. 23- 27- 09240. 2003 (2003).

45. Nan, Y. et al. Piano training enhances the neural processing of pitch and improves speech perception in Mandarin-speaking

children. Proc. Natl. Acad. Sci. USA 115, 6630–6639. https:// doi. org/ 10. 1073/ pnas. 18084 12115 (2018).

46. Hyde, K. L. et al. e eects of musical training on structural brain development. Ann. N. Y. Acad. Sci. 1169, 182–186. https:// doi.

org/ 10. 1111/j. 1749- 6632. 2009. 04852.x (2009).

47. Hyde, K. L. et al. Musical training shapes structural brain development. J. Neurosci. 29, 3019–3025. https:// do i . org/ 10. 1523/ JNEUR

OSCI. 5118- 08. 2009 (2009).

48. Kraus, N., Skoe, E., Parbery-Clark, A. & Ashley, R. Experience-induced malleability in neural encoding of pitch, timbre, and tim-

ing. Ann. N. Y. Acad. Sci. 11691, 543–557. https:// doi. org/ 10. 1111/j. 1749- 6632. 2009. 04549.x (2009).

49. Wan, C. Y. & Schlaug, G. Music making as a tool for promoting brain plasticity across the lifespan. Neuroscientist 16, 566–577.

https:// doi. org/ 10. 1177/ 10738 58410 377805 (2010).

50. Neves, L., Correia, A. I., Castro, S. L., Martins, D. & Lima, C. F. Does music training enhance auditory and linguistic processing?

A systematic review and meta-analysis of behavioral and brain evidence. Neurosci. Biobehav. Rev. 140, 104777. https:// doi. org/ 10.

1016/j. neubi orev. 2022. 104777 (2022).

51. Zendel, B. R. & Alain, C. Musicians experience less age-related decline in central auditory processing. Psychol. Aging 27, 410–417.

https:// doi. org/ 10. 1037/ a0024 816 (2012).

52. Zendel, B. R. & Alain, C. e inuence of lifelong musicianship on neurophysiological measures of concurrent sound segregation.

J. Cogn. Neurosci. 25, 503–516. https:// doi. org/ 10. 1162/ jocn_a_ 00329 (2013).

53. Alain, C., Zendel, B. R., Hutka, S. & Bidelman, G. M. Tur ning down the noise: e benet of musical training on the aging auditory

brain. Hear. Res. 308, 162–173. https:// doi. org/ 10. 1016/j. heares. 2013. 06. 008 (2014).

54. Zhang, J. D., Susino, M., McPherson, G. E. & Schubert, E. e denition of a musician in music psychology: A literature review

and the six-year rule. Psychol. Music 48, 389–409. https:// doi. org/ 10. 1177/ 03057 35618 804038 (2020).

55. Costa-Giomi, E. Music instruction and children’s intellectual development: e educational context of music participation. In

Music, Health, and Wellbeing (eds MacDonald, R. et al.) 339–355 (Oxford University Press, 2012).

56. Costa-Giomi, E. e long-term eects of childhood music instruction on intelligence and general cognitive abilities. Update Appl.

Res. Music Educ. 33, 20–26. https:// doi. org/ 10. 1177/ 87551 23314 540661 (2015).

57. Costa-Giomi, E. & Ryan, C. e benets of music insturction: What remains years later. Symp. Res. Music Behav. 20, 25 (2007).

58. Costa-Giomi, E. e eects of three years of piano instruction on children’s cognitive development. J. Res. Music Educ. 47, 198–212.

https:// doi. org/ 10. 2307/ 33457 79 (1999).

59. Schellenberg, E. G. Long-term positive associations between music lessons and IQ. J. Educ. Psychol. 98, 457–468. https:// doi. org/

10. 1037/ 0022- 0663. 98.2. 457 (2006).

60. Strong, J. V. & Midden, A. Cognitive dierences between older adult instrumental musicians: Benets of continuing to play. Psychol.

Music 48, 67–83. https:// doi. org/ 10. 1177/ 03057 35618 785020 (2020).

61. Skoe, E. & Kraus, N. A little goes a long way: How the adult brain is shaped by musical training in childhood. J. Neurosci. 32,

11507–11510. https:// doi. org/ 10. 1523/ JNEUR OSCI. 1949- 12. 2012 (2012).

62. White-Schwoch, T., Carr, K. W., Anderson, S., Strait, D. L. & Kraus, N. Older adults benet from music training early in life:

Biological evidence for long-term training-driven plasticity. J. Neurosci. 33, 17667–17674. https:// doi. org/ 10. 1523/ JNEUR OSCI.

2560- 13. 2013 (2013).

63. Wennerstrom, A. e Music of Everyday Speech: Prosody and Discourse Analysis (Oxford University Press, 2001).

64. Patel, A. D., Wong, M., Foxton, J., Lochy, A. & Peretz, I. Speech intonation perception decits in musical tone deafness (congenital

amusia). Music Percept. 25, 357–368. https:// doi. org/ 10. 1525/ mp. 2008. 25.4. 357 (2008).

65. Hutchins, S., Gosselin, N. & Peretz, I. Identication of changes along a continuum of speech intonation is impaired in congenital

amusia. Front. Psychol. 1, 236. https:// doi. org/ 10. 3389/ fpsyg. 2010. 00236 (2010).

66. Jiang, C., Hamm, J. P., Lim, V. K., Kirk, I. J. & Yang, Y. Processing melodic contour and speech intonation in congenital amusics

with Mandarin Chinese. Neuropsychologia 48, 2630–2639. https:// doi. org/ 10. 1016/j. neuro psych ologia. 2010. 05. 009 (2010).

67. Liu, F., Patel, A. D., Fourcin, A. & Stewart, L. Intonation processing in congenital amusia: Discrimination, identication and imita-

tion. Brain 133, 1682–1693. https:// doi. org/ 10. 1093/ brain/ awq089 (2010).

68. Monrad-Krohn, G. H. e prosodic quality of speech and its disorders. Acta Psychiatr. Scand. 22, 255–269. https:// doi. org/ 10.

1111/j. 1600- 0447. 1947. tb082 46.x (1947).

69. Zhang, J. & McBride-Chang, C. Auditory sensitivity, speech perception, and reading development and impairment. Educ. Psychol.

Rev. 22, 323–338. https:// doi. org/ 10. 1007/ s10648- 010- 9137-4 (2010).

70. Holliman, A. J. et al. Beginning to disentangle the prosody-literacy relat ionship: A multi-component measure of prosodic sensitiv-

it y. Read. Writ. 27, 255–266. https:// doi. org/ 10. 1007/ s11145- 013- 9443-6 (2014).

71. Groen, M. A., Veenendaal, N. J. & Verhoeven, L. e role of prosody in reading comprehension: Evidence from poor comprehend-

ers. J. Res. Read. 42, 37–57. https:// doi. org/ 10. 1111/ 1467- 9817. 12133 (2019).

72. Cutler, A., Dahan, D. & van Donselaar, W. Prosody in the comprehension of spoken language: A literature review. Lang. Speech

40, 141–201. https:// doi. org/ 10. 1177/ 00238 30997 04000 203 (1997).

73. Hellbernd, N. & Sammler, D. Prosody conveys speaker’s intentions: Acoustic cues for speech act perception. J. Mem. Lang. 88,

70–86. https:// doi. org/ 10. 1016/j. jml. 2016. 01. 001 (2016).

74. Hupp, J. M., Jungers, M. K., Hinerman, C. M. & Porter, B. L. Cup! Cup? Cup: Comprehension of intentional prosody in adults and

children. Cogn. Dev. 57, 100971. https:// doi. org/ 10. 1016/j. cogdev. 2020. 100971 (2021).

75. Campeld, D. E. & Murphy, V. A. e inuence of prosodic input in the second language classroom: Does it stimulate child

acquisition of word order and function words?. Lang. Learn. J. 45, 81–99. https:// doi. org/ 10. 1080/ 09571 736. 2013. 807864 (2017).

76. Saksida, A., Fló, A., Guedes, B., Nespor, M. & Peña Garay, M. Prosody facilitates learning the word order in a new language. Cogni-

tion 213, 104686. https:// doi. org/ 10. 1016/j. cogni tion. 2021. 104686 (2021).

77. Saito, Y. & Saito, K. Dierential eects of instruction on the development of second language comprehensibility, word stress,

rhythm, and intonation: e case of inexperienced Japanese EFL learners. Lang. Teach. Res. 21, 589–608. https:// doi. org/ 10. 1177/

13621 68816 643111 (2017).

78. Yenkimaleki, M. Prosody training benets in perception vs production skills in simultaneous interpreting: An experimental study.

Dutch J. Appl. Linguist. https:// doi. org/ 10. 51751/ dujal 9888 (2021).

79. Wallentin, M., Nielsen, A. H., Friis-Olivarius, M., Vuust, C. & Vuust, P. e Musical Ear Test, a new reliable test for measuring

musical competence. Learn. Indiv. Dier. 20, 188–196. https:// doi. org/ 10. 1016/j. lindif. 2010. 02. 004 (2010).

80. Rothauser, E. H. et al. IEEE recommended practice for speech quality measures. IEEE Trans. Audio Electroacoust. 17, 225–246.

https:// doi. org/ 10. 1109/ TAU. 1969. 11620 58 (1969).

81. Smith, C. L. Prosodic nality and sentence type in French. Lang. Speech 45, 141–178. https:// doi. org/ 10. 1177/ 00238 30902 04500

20301 (2002).

82. Audacity: Free Audio Editor and Recorder v. 2.3.2 (2018).

83. Boersma, P. Praat, a system for doing phonetics by computer. Glot Int. 5, 341–345 (2001).

Content courtesy of Springer Nature, terms of use apply. Rights reserved

Vol.:(0123456789)

Scientic Reports | (2023) 13:2657 | https://doi.org/10.1038/s41598-023-29733-3

www.nature.com/scientificreports/

84. Krizman, J., Marian, V., Shook, A., Skoe, E. & Kraus, N. Subcortical encoding of sound is enhanced in bilinguals and relates to

executive function advantages. Proc. Natl. Acad. Sci. USA 109, 7877–7881. https:// doi. org/ 10. 1073/ pnas. 12015 75109 (2012).

85. Mankel, K. & Bidelman, G. M. Inherent auditory skills rather than formal music training shape the neural encoding of speech.

Proc. Natl. Acad. Sci. USA 115, 13129–13134. https:// doi. org/ 10. 1073/ pnas. 18117 93115 (2018).

86. Swaminathan, S. & Schellenberg, E. G. Musical ability, music training, and language ability in childhood. J. Exp. Psychol. Learn.

Mem. Cogn. 46, 2340–2348. https:// doi. org/ 10. 1037/ xlm00 00798 (2020).

Acknowledgements

is study was supported by research grants from the Ministry of Education (MOE), Singapore (MOE2019-

SSRTG-016, MOE-T2EP402A20-0003). We thank Dr. Alice H. D. Chan for her insight on the design of the study.

We are also grateful to all participants for their contribution to the study.

Author contributions

S.T. and G.W. contributed to the conception, design, and implementation of the study. X.T., S.T., G.W., F.L.,

and F.W. performed the statistical analysis and interpreted the data. X.T., S.T., and G.W. wrote dras of the

manuscript. X.T., F.L., and F.W. revised and nalised the manuscript. All authors reviewed and approved the

submitted manuscript.

Additional information

Supplementary Information e online version contains supplementary material available at https:// doi. org/

10. 1038/ s41598- 023- 29733-3.

Correspondence and requests for materials should be addressed to F.C.K.W.

Reprints and permissions information is available at www.nature.com/reprints.

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and

institutional aliations.

Open Access is article is licensed under a Creative Commons Attribution 4.0 International

License, which permits use, sharing, adaptation, distribution and reproduction in any medium or

format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the

Creative Commons licence, and indicate if changes were made. e images or other third party material in this

article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the

material. If material is not included in the article’s Creative Commons licence and your intended use is not

permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from

the copyright holder. To view a copy of this licence, visit http:// creat iveco mmons. org/ licen ses/ by/4. 0/.

Content courtesy of Springer Nature, terms of use apply. Rights reserved

Terms and Conditions

Springer Nature journal content, brought to you courtesy of Springer Nature Customer Service Center GmbH (“Springer Nature”).

Springer Nature supports a reasonable amount of sharing of research papers by authors, subscribers and authorised users (“Users”), for small-

scale personal, non-commercial use provided that all copyright, trade and service marks and other proprietary notices are maintained. By

accessing, sharing, receiving or otherwise using the Springer Nature journal content you agree to these terms of use (“Terms”). For these

purposes, Springer Nature considers academic use (by researchers and students) to be non-commercial.

These Terms are supplementary and will apply in addition to any applicable website terms and conditions, a relevant site licence or a personal

subscription. These Terms will prevail over any conflict or ambiguity with regards to the relevant terms, a site licence or a personal subscription

(to the extent of the conflict or ambiguity only). For Creative Commons-licensed articles, the terms of the Creative Commons license used will

apply.

We collect and use personal data to provide access to the Springer Nature journal content. We may also use these personal data internally within

ResearchGate and Springer Nature and as agreed share it, in an anonymised way, for purposes of tracking, analysis and reporting. We will not

otherwise disclose your personal data outside the ResearchGate or the Springer Nature group of companies unless we have your permission as

detailed in the Privacy Policy.

While Users may use the Springer Nature journal content for small scale, personal non-commercial use, it is important to note that Users may

not:

use such content for the purpose of providing other users with access on a regular or large scale basis or as a means to circumvent access

control;

use such content where to do so would be considered a criminal or statutory offence in any jurisdiction, or gives rise to civil liability, or is

otherwise unlawful;

falsely or misleadingly imply or suggest endorsement, approval , sponsorship, or association unless explicitly agreed to by Springer Nature in

writing;

use bots or other automated methods to access the content or redirect messages

override any security feature or exclusionary protocol; or

share the content in order to create substitute for Springer Nature products or services or a systematic database of Springer Nature journal

content.

In line with the restriction against commercial use, Springer Nature does not permit the creation of a product or service that creates revenue,

royalties, rent or income from our content or its inclusion as part of a paid for service or for other commercial gain. Springer Nature journal

content cannot be used for inter-library loans and librarians may not upload Springer Nature journal content on a large scale into their, or any

other, institutional repository.

These terms of use are reviewed regularly and may be amended at any time. Springer Nature is not obligated to publish any information or

content on this website and may remove it or features or functionality at our sole discretion, at any time with or without notice. Springer Nature

may revoke this licence to you at any time and remove access to any copies of the Springer Nature journal content which have been saved.

To the fullest extent permitted by law, Springer Nature makes no warranties, representations or guarantees to Users, either express or implied

with respect to the Springer nature journal content and all parties disclaim and waive any implied warranties or warranties imposed by law,

including merchantability or fitness for any particular purpose.

Please note that these rights do not automatically extend to content, data or other material published by Springer Nature that may be licensed

from third parties.

If you would like to use or distribute our Springer Nature journal content to a wider audience or on a regular basis or in any other manner not

expressly permitted by these Terms, please contact Springer Nature at

onlineservice@springernature.com

Musical Advantage in Lexical Tone Perception Hinges on Musical Instrument

Article

Full-text available

Jun 2024
MUSIC PERCEPT

Different musical instruments have different pitch processing demands. However, correlational studies have seldom considered the role of musical instruments in music-to-language transfer. Addressing this research gap could contribute to a nuanced understanding of music-to-language transfer. To this end, we investigated whether pitched musicians had a unique musical advantage in lexical tone perception relative to unpitched musicians and nonmusicians. Specifically, we compared Cantonese pitched musicians, unpitched musicians, and nonmusicians on Thai tone discrimination and sequence recall. In the Thai tone discrimination task, the pitched musicians outperformed the unpitched musicians and the nonmusicians. Moreover, the unpitched musicians and the nonmusicians performed similarly. In the Thai tone sequence recall task, both pitched and unpitched musicians recalled level tone sequences more accurately than the nonmusicians, but the pitched musicians showed the largest musical advantage. However, the three groups recalled contour tone sequences with similar accuracy. Collectively, the pitched musicians had a unique musical advantage in lexical tone discrimination and the largest musical advantage in level tone sequence recall. From a theoretical perspective, this study offers correlational evidence for the Precision element of the OPERA hypothesis. The choice of musical instrumental may matter for music-to-language transfer in lexical tone discrimination and level tone sequence recall.

Auditory Processing in Musicians, a Cross-Sectional Study, as a Basis for Auditory Training Optimization

Article

Full-text available

Jul 2023

Μusicians are reported to have enhanced auditory processing. This study aimed to assess auditory perception in Greek musicians with respect to their musical specialization and to compare their auditory processing with that of non-musicians. Auditory processing elements evaluated were speech recognition in babble, rhythmic advantage in speech recognition, short-term working memory, temporal resolution, and frequency discrimination threshold detection. All groups were of 12 participants. Three distinct experimental groups tested included western classical musicians, Byzantine chanters, and percussionists. The control group consisted of 12 non-musicians. The results revealed: (i) a rhythmic advantage for word recognition in noise for classical musicians (M = 12.42) compared to Byzantine musicians (M = 9.83), as well as for musicians compared to non-musicians (U = 120.50, p = 0.019), (ii) better frequency discrimination threshold of Byzantine musicians (M = 3.17, p = 0.002) compared to the other two musicians’ group for the 2000 Hz region, (iii) statistically significant better working memory for musicians (U = 123.00, p = 0.025) compared to non-musicians. Musical training enhances elements of auditory processing and may be used as an additional rehabilitation approach during auditory training, focusing on specific types of music for specific auditory processing deficits.

Auditory Processing in Musicians: a Basis for Auditory Training Optimization

Preprint

Full-text available

May 2023

Better auditory processing of musicians is observed in previous research. As musicians differentiate their practice method and performance environment, we aimed to assess auditory perception in Greek musicians with respect to their musical specialization. If there are differences, this may provide a basis for better shaping auditory training in individuals with auditory processing disorder. The auditory tests administered were speech in noise (Speech in Babble), with and without rhythmic advantage (Word Recognition—Rhythm Component), short-term and working memory (Digit Span - Forward and Backwards), temporal resolution (Gaps In Noise) and detection of frequency discrimination threshold (DFL). Groups consisted of classical musicians, Byzantine chanters, percussionists, and non-musicians (12 participants/group). Statistical analysis revealed significant difference in: (i) word recognition in noise with precursor synchronized pulse between classical musicians compared to Byzantine musicians, (ii) better frequency discrimination of Byzantine musicians compared to non-musicians for the 2000Hz region and (iii) working memory, an advantage detected in musicians. Considering all the above, we conclude that musicians have a superior auditory perception, regardless of musical specialization. Musical training enhances elements of auditory processing and may be used as an additional rehabilitation during auditory training, focusing on specific types of music for specific auditory processing deficits.

Music Training and Nonmusical Abilities

Article

Sep 2023

Music training is generally assumed to improve perceptual and cognitive abilities. Although correlational data highlight positive associations, experimental results are inconclusive, raising questions about causality. Does music training have far-transfer effects, or do preexisting factors determine who takes music lessons? All behavior reflects genetic and environmental influences, but differences in emphasis—nature versus nurture—have been a source of tension throughout the history of psychology. After reviewing the recent literature, we conclude that the evidence that music training causes nonmusical benefits is weak or nonexistent, and that researchers routinely overemphasize contributions from experience while neglecting those from nature. The literature is also largely exploratory rather than theory driven. It fails to explain mechanistically how music-training effects could occur and ignores evidence that far transfer is rare. Instead of focusing on elusive perceptual or cognitive benefits, we argue that it is more fruitful to examine the social-emotional effects of engaging with music, particularly in groups, and that music-based interventions may be effective mainly for clinical or atypical populations. Expected final online publication date for the Annual Review of Psychology, Volume 75 is January 2024. Please see http://www.annualreviews.org/page/journal/pubdates for revised estimates.

Individual differences in nonnative lexical tone perception: Effects of tone language repertoire and musical experience

Article

Full-text available

Sep 2022

This study sought to understand the effects of tone language repertoire and musical experience on nonnative lexical tone perception and production. Thirty-one participants completed a tone discrimination task, an imitation task, and a musical abilities task. Results showed that a larger tone language repertoire and musical experience both enhanced tone discrimination performance. However, the effects were not additive, as musical experience was associated with tone discrimination performance for single-tone language speakers, but such association was not seen for dual-tone language speakers. Furthermore, among single-tone language speakers, but not among dual-tone language speakers, musical experience and musical aptitude positively correlated with tone discrimination accuracy. It is thus concluded that individuals with varying extents of tone language experience may adopt different strategies when performing tone discrimination tasks; single-tone language speakers may draw on their musical expertise while dual-tone language speakers may rely on their extensive tone language experience instead.

What Is "Music" in Music-to-Language Transfer? Musical Ability But Not Musicianship Supports Cantonese Listeners' English Stress Perception

Article

Full-text available

Sep 2022
J SPEECH LANG HEAR R

William Choi

Purpose: This study investigates how Cantonese language experience influences the potential effects of (i) musicianship and (ii) musical ability on English stress perception. Method: The sample contained 124 participants, evenly split into Cantonese musician, Cantonese non-musician, English musician, and English non-musician groups. They completed the English stress discrimination task, English stress sequence recall task, Musical Ear Test, and non-verbal intelligence task. Following the musicianship-based analysis, 44 Cantonese and English listeners were re-assigned to four groups based on their musical ability—Cantonese high musical ability, Cantonese low musical ability, English high musical ability, and English low musical ability groups. Results: Musicianship-based analysis on English stress perception revealed a significant interaction between musicianship and language. Specifically, musicians outperformed non-musicians only among the English but not the Cantonese listeners. By contrast, ability-based analysis showed significant main effects of musical ability and language. For both Cantonese and English listeners, those with a high musical ability outperformed those with a low musical ability. Regardless of musical ability, Cantonese listeners outperformed English listeners. Correlational analyses yielded consistent findings. Conclusions: This study has found cross-sectional evidence that musical ability, but not musicianship, facilitates Cantonese English as a second language (ESL) listeners’ English stress perception. From a theoretical perspective, the current findings motivate two potential additions to the OPERA hypothesis for music-to-language transfer—unsaturation and utilization. Practically, the findings cast doubt on the application of non-perceptual based instrumental music training to enhance Cantonese ESL learners’ perceptual learning of English stress.

Does Music Training Enhance Auditory and Linguistic Processing? A Systematic Review and Meta-Analysis of Behavioral and Brain Evidence

Article

Full-text available

Jul 2022
NEUROSCI BIOBEHAV R

It is often claimed that music training improves auditory and linguistic skills. Results of individual studies are mixed, however, and most evidence is correlational, precluding inferences of causation. Here, we evaluated data from 62 longitudinal studies that examined whether music training programs affect behavioral and brain measures of auditory and linguistic processing (N = 3928). For the behavioral data, a multivariate meta-analysis revealed a small positive effect of music training on both auditory and linguistic measures, regardless of the type of assignment (random vs. non-random), training (instrumental vs. non-instrumental), and control group (active vs. passive). The trim-and-fill method provided suggestive evidence of publication bias, but meta-regression methods (PET-PEESE) did not. For the brain data, a narrative synthesis also documented benefits of music training, namely for measures of auditory processing and of speech and prosody processing. Thus, the available literature provides evidence that music training produces small neurobehavioral enhancements in auditory and linguistic processing, although future studies are needed to confirm that such enhancements are not due to publication bias.

Prosody training benefits in perception vs. production skills in simultaneous interpreting: An experimental study

Article

Full-text available

Oct 2021

Mahmood Yenkimaleki

The present study investigates the prosody training benefits for interpreter trainees in perception vs. production skills in simultaneous interpreting. Two groups of student interpreters were formed. Participants were assigned to groups at random. The control group received routine instruction in interpreting skills. The experimental group spent 20 minutes less time per session on the routine curriculum and instead received awareness training on prosodic features of English. The total instruction time was the same for the students in two groups, i.e., 15 hours. Students then took a posttest in interpretation skills. The results showed that the experimental group performed better than the control group in simultaneous interpretation performance. Moreover, the study revealed that prosody training enhances the students' perception skills more than that of the production skills. These results have pedagogical implications for curriculum designers, interpreter training programs, and all who are involved in language study and pedagogy.

Towards a Native OPERA Hypothesis: Musicianship and English Stress Perception

Article

Full-text available

Oct 2021

William Choi

Musical experience facilitates speech perception. French musicians, to whom stress is foreign, have been found to perceive English stress more accurately than French non-musicians. This study investigated whether this musical advantage also applies to native listeners. English musicians and non-musicians completed an English stress discrimination task and two control tasks. With age, non-verbal intelligence and short-term memory controlled, the musicians exhibited a perceptual advantage relative to the non-musicians. This perceptual advantage was equally potent to both trochaic and iambic stress patterns. In terms of perceptual strategy, the two groups showed differential use of acoustic cues for iambic but not trochaic stress. Collectively, the results could be taken to suggest that musical experience enhances stress discrimination even among native listeners. Remarkably, this musical advantage is highly consistent and does not particularly favour either stress pattern. For iambic stress, the musical advantage appears to stem from the differential use of acoustic cues by musicians. For trochaic stress, the musical advantage may be rooted in enhanced durational sensitivity.

Applying a Newly Learned Second Language Dimension to the Unknown: The Influence of Second Language Mandarin Tones on the Naïve Perception of Thai Tones

Article

Full-text available

Nov 2020
Psychol Lang Comm

This study investigates whether L2 Mandarin learners can generalize experience with Mandarin tones to unfamiliar tones (i.e., Thai). Three language groups – L1 English/ L2 Mandarin learners (n=18), L1 Mandarin speakers (n=30), L1 monolingual English speakers (n=23) – were tested on the perception of unfamiliar Thai tones on ABX tasks. L2 Mandarin learners and L1 Mandarin speakers perceived Thai tones more accurately than L1 English non-learners. Mandarin learners L1 speakers showed priming on Mandarin tones on a lexical decision task with repetition priming, suggesting L2 tones had been encoded within lexical representations of L2 Mandarin words. However, results must be interpreted cautiously, with an absence of expected priming and presence of unexpected priming. In sum, learners can transfer L2 tone experience to unfamiliar tones, expanding the Feature Hypothesis (McAllister, Flege, & Piske, 2002) to include L2 influence as well. In addition, results indicate a potential disconnect between perception and encoding. Keywords: tone, third language, perception, phonolexical encoding Address

Musicianship Enhances Perception But Not Feeling of Emotion From Others’ Social Interaction Through Speech Prosody

Article

Full-text available

Apr 2020
MUSIC PERCEPT

Music expertise has been shown to enhance emotion recognition from speech prosody. Yet, it is currently unclear whether music training enhances the recognition of emotions through other communicative modalities such as vision and whether it enhances the feeling of such emotions. Musicians and nonmusicians were presented with visual, auditory, and audiovisual clips consisting of the biological motion and speech prosody of two agents interacting. Participants judged as quickly as possible whether the expressed emotion was happiness or anger, and subsequently indicated whether they also felt the emotion they had perceived. Measures of accuracy and reaction time were collected from the emotion recognition judgements, while yes/no responses were collected as indication of felt emotions. Musicians were more accurate than nonmusicians at recognizing emotion in the auditory-only condition, but not in the visual-only or audiovisual conditions. Although music training enhanced recognition of emotion through sound, it did not affect the felt emotion. These findings indicate that emotional processing in music and language may use overlapping but also divergent resources, or that some aspects of emotional processing are less responsive to music training than others. Hence music training may be an effective rehabilitative device for interpreting others’ emotion through speech.

Prosody facilitates learning the word order in a new language

Article

Apr 2021

One of the prominent ideas developed by Jacques Mehler and his colleagues was that perceptual tuning, present from birth on, enables infants, and language learners in general, to extract regularities from speech input. Here we discuss language learners'' ability to extract basic word order (VO or OV) structure from prosodic regularities in a language. The two are closely related: in phonological phrases of VO languages, the most prominent word is the rightmost one, and in OV languages, it is the leftmost one. In speech, this prominence is realized as extended duration, or as elevated pitch, sometimes combined with changes in intensity. When learning the first (L1) or the second language (L2), exposure to relevant rhythmic structure elicits implicit learning about syntactic structure, including the basic word order. However, it remains unclear whether triggering the learning process requires a certain level of familiarity with the relevant rhythm. It is moreover unknown whether prosodic information can help L2 learners to extract and learn the vocabulary of a new language. We tested Spanish- and Italian-speaking adults' ability to learn words from an artificial language with either non-native OV or native VO word order. The results show that learners used prosodic information to identify the most prominent words in short utterances when the artificial language was similar to the native language, with duration-based prominence in prosody and a VO word order. In contrast, when the artificial language had a non-native prominence marked by pitch alternations and an OV word order, prominent words were learned only after a three-day exposure to the relevant rhythmic structure. Thus, for adult L2 learners, only repeated exposure to the relevant prosody elicited learning new words from an unknown language with non-native prosodic marking, indicating that, with familiarity, prosodic cues can facilitate learning in L2.

Musicians show enhanced perception, but not production, of native lexical tones

Article

Dec 2020

Many studies have reported a musical advantage in perceiving lexical tones among non-native listeners, but it is unclear whether this advantage also applies to native listeners, who are likely to show ceiling-like performance and thus mask any potential musical advantage. The ongoing tone merging phenomenon in Hong Kong Cantonese provides a unique opportunity to investigate this as merging tone pairs are reported to be difficult to differentiate even among native listeners. In the present study, native Cantonese musicians and non-musicians were compared based on discrimination and identification of merging Cantonese tone pairs to determine whether a musical advantage in perception will be observed, and if so, whether this is seen on the phonetic and/or phonological level. The tonal space of the subjects' lexical tone production was also compared. Results indicated that the musicians outperformed the non-musicians on the two perceptual tasks, as indexed by a higher accuracy and faster reaction time, particularly on the most difficult tone pair. In the production task, however, there was no group difference in various indices of tonal space. Taken together, musical experience appears to facilitate native listeners' perception, but not production, of lexical tones, which partially supports a music-to-language transfer effect.

Cup! Cup? Cup: Comprehension of intentional prosody in adults and children

Article

Nov 2020
COGNITIVE DEV

Prosody is the way something is spoken. Adults and children regularly use prosody in language comprehension and production, and much of this research focuses solely on emotion or on syntactic interpretation. The current study focuses on comprehension of intentional prosody devoid of semantic information as indicators of different types of speech acts. Adults (n = 72) and preschool children (n = 72) were asked to identify the referent of an isolated label (familiar or nonsense words) based solely on the varying prosodic information. Labels were spoken with intentional prosody (warning, doubting, naming) such that their prosody implied a particular intended referent. Adults and preschoolers selected the intended referent at above chance levels of performance for both familiar and nonsense words and for all three intentions, with adults performing faster and more accurately than children. Overall, this research demonstrates that preschool children and adults can use prosody to successfully determine the intended referent behind words and nonsense words, even if they are in isolation. This finding suggests that prosody alone conveys certain aspects of meaning during the preschool years.

Enduring musician advantage among former musicians in prosodic pitch perception

Abstract and Figures

Recommended publications

Individual differences in nonnative lexical tone perception: Effects of tone language repertoire and...

The impact of musical training and tone language experience on talker identification

Speech perception is similar for musicians and non-musicians across a wide range of conditions

Pitch and Time Processing in Speech and Tones: The Effects of Musical Training and Attention