Conference Paper

Evaluation of the Potential of Automatic Naming Latency Detection for Different Initial Phonemes during Picture Naming Task

Authors:
To read the full-text of this research, you can request a copy directly from the authors.

Abstract

Naming latency (NL) represents the speech onset time after the presentation of an image. We recently developed an extended threshold-based algorithm for automatic NL (aNL) detection considering the envelope of the speech wave. The present study aims at exploring the influence of different manners (e.g., "m" and "p") and positions (e.g., "t" and "p") of articulation on the differences between manual NL (mNL) and aNL detection.Speech samples were collected from 123 healthy participants. They named 118 pictures in German, including different initial phonemes. NLs were manually (Praat, waveform and spectrogram) and automatically (developed algorithm) determined. To investigate the accuracy of automatic detections, correlations between mNLs and aNLs were analyzed for different initial phonemes.ANLs and mNLs showed a strong positive correlation and similar tendencies in initial phoneme groups. ANL mean values were shorter than the ones of mNLs. Nasal sounds (e.g., /m/) showed the largest and those for fricatives (e.g., /s/) the smallest difference. However, in fricatives, 39% of NLs were detected later by automatic detections than by manual detections, which led to a reduced mean difference with mNLs. The signal energy of the initial phonemes, i.e., if they are voiced or voiceless, influences the form of the speech envelope: initial high signal energy is often responsible for an early detection by the algorithm.Our study provides evidence of a similar tendency in mNL and aNL according to different positions of articulation in each initial phoneme group. ANLs are highly sensitive to detection of speech onsets across different initial phonemes. The dependency of the NL differences on the initial phonemes will lose importance during progress evaluations in aphasia patients if the relative changes for each picture are considered separately. Nevertheless, the algorithm will be further optimized by adapting its parameters for each initial phoneme group individually.Clinical Relevance- This underlines the feasibility to use automatic naming latency detection for the evaluation of patients with aphasia in a clinical setting as well as for practices at home during picture naming.

No full-text available

Request Full-text Paper PDF

To read the full-text of this research,
you can request a copy directly from the authors.

ResearchGate has not been able to resolve any citations for this publication.
Article
Full-text available
Individuals with aphasia frequently show lexical retrieval deficits due to increased interference of semantically related competitors, a phenomenon that can be observed in tasks such as naming pictures grouped by semantic category. These deficits are explained in terms of impaired semantic control, a set of abilities that are to some extent dependent upon executive control (EC). However, the extent to which semantic control abilities can be affected in a second and non-dominant language has not been extensively explored. Additionally, findings in healthy individuals are inconclusive regarding the degree to which semantic processing is shared between languages. In this study, we explored the effect of brain damage on semantic processing by comparing the performance of bilingual individuals with aphasia on tasks involving semantic control during word production and comprehension. Furthermore, we explored whether semantic deficits are related to domain-general EC deficits. First, we investigated the naming performance of Catalan–Spanish bilinguals with fluent aphasia and age-matched healthy controls on a semantically blocked cyclic naming task in each of their two languages (Catalan and Spanish). This task measured semantic interference in terms of the difference in naming latencies between pictures grouped by the same semantic category or different categories. Second, we explored whether lexical deficits extend to comprehension by testing participants in a word-picture matching task during a mixed language condition. Third, we used a conflict monitoring task to explore the presence of EC deficits in patients with aphasia. We found two main results. First, in both language tasks, bilingual patients’ performances were more affected than those of healthy controls when they performed the task in their non-dominant language. Second, there was a significant correlation between the speed of processing on the EC task and the magnitude of the semantic interference effect exclusively in the non-dominant language. Taken together, these results suggest that lexical retrieval may be selectively impaired in bilinguals within those conditions where semantic competition is higher, i.e.,- in their non-dominant language; this could possibly be explained by an excessive amount of inhibition placed upon this language. Moreover, lexico-semantic impairments seem to be at least somewhat related to conflict monitoring deficits, suggesting a certain degree of overlap between EC and semantic control.
Article
Full-text available
A central issue in the study of speech production is whether phonological encoding occurs sequentially or in parallel. Some of the strongest evidence for sequential phonological encoding comes from the number of segments primed effect—response latencies decrease when increasing the number of primed segments from 0 to 1 to 2 (e.g., Meyer, 1991). Although it is often assumed that all participants adopt the same response criterion in the naming task, standard instructions can lead to the strategic adoption of different response criteria (such as an initial segment-based criterion or a syllable-based criterion). Furthermore, the number of segments primed effect might be driven by the manner of the initial segment such as the acoustic characteristics of plosives. In this study, participants named monosyllabic words varying in initial segment plosivity in a 0, 1, or 2 segments primed naming task and were instructed in ways to induce either a segment or syllable criterion. Data were analyzed by acoustic latency, articulatory latency, and initial segment duration, as distinguishing between a segment and syllable criterion and sequential and parallel encoding requires more than just a single point in the time-course of articulation. Shorter acoustic latencies when priming 2 segments over 1 were contingent on the manner of the initial segment and the adoption of a segment criterion, clarifying the nature of the number of segments primed effect. Moreover, the similar acoustic latencies found across priming conditions when a syllable criterion was adopted support parallel phonological encoding.
Article
Full-text available
The aims of this study were to investigate the adequacy of electronic voice keys for the purpose of measuring naming latency and to test the assumption that voice key error can be controlled by matching conditions on initial phoneme. Three types of naming latency measurements (hand-coding and 2 types of voice keys) were used to investigate effects of onset complexity (e.g., sat vs. spat) on reading aloud (J. R. Frederiksen & J. F. Kroll, 1976; A. H. Kawamoto & C. T. Kello, 1999). The 3 measurement techniques produced the 3 logically possible results: a significant complexity advantage, a significant complexity disadvantage, and a null effect. Analyses of the performance of each voice key are carried out, and implications for studies of naming latency are discussed.
Article
Full-text available
This study examined the timing of spontaneous self-monitoring in the naming responses of people with aphasia. Twelve people with aphasia completed a 615-item naming test twice, in separate sessions. Naming attempts were scored for accuracy and error type, and verbalizations indicating detection were coded as negation (e.g., “no, not that”) or repair attempts (i.e., a changed naming attempt). Focusing on phonological and semantic errors, we measured the timing of the errors and of the utterances that provided evidence of detection. The effects of error type and detection response type on error-to-detection latencies were analyzed using mixed-effects regression modeling. We first asked whether phonological errors and semantic errors differed in the timing of the detection process or repair planning. Results suggested that the two error types primarily differed with respect to repair planning. Specifically, repair attempts for phonological errors were initiated more quickly than repair attempts for semantic errors. We next asked whether this difference between the error types could be attributed to the tendency for phonological errors to have a high degree of phonological similarity with the subsequent repair attempts, thereby speeding the programming of the repairs. Results showed that greater phonological similarity between the error and the repair was associated with faster repair times for both error types, providing evidence of error-to-repair priming in spontaneous self-monitoring. When controlling for phonological overlap, significant effects of error type and repair accuracy on repair times were also found. These effects indicated that correct repairs of phonological errors were initiated particularly quickly, whereas repairs of semantic errors were initiated relatively slowly, regardless of their accuracy. We discuss the implications of these findings for theoretical accounts of self-monitoring and the role of speech error repair in learning.
Article
Full-text available
The analysis of speech onset times has a longstanding tradition in experimental psychology as a measure of how a stimulus influences a spoken response. Yet the lack of accurate automatic methods to measure such effects forces researchers to rely on time-intensive manual or semiautomatic techniques. Here we present Chronset, a fully automated tool that estimates speech onset on the basis of multiple acoustic features extracted via multitaper spectral analysis. Using statistical optimization techniques, we show that the present approach generalizes across different languages and speaker populations, and that it extracts speech onset latencies that agree closely with those from human observations. Finally, we show how the present approach can be integrated with previous work (Jansen & Watter Behavior Research Methods, 40:744–751, 2008) to further improve the precision of onset detection. Chronset is publicly available online at www.bcbl.eu/databases/chronset. Electronic supplementary material The online version of this article (doi:10.3758/s13428-016-0830-1) contains supplementary material, which is available to authorized users.
Data
Full-text available
Timed picture naming was compared in seven languages that vary along dimensions known to affect lexical access. Analyses over items focused on factors that determine cross-language universals and cross-language disparities. With regard to universals, number of alternative names had large effects on reaction time within and across languages after target-name agreement was controlled, suggesting inhibitory effects from lexical competitors. For all the languages, word frequency and goodness of depiction had large effects, but objective picture complexity did not. Effects of word structure variables (length, syllable structure, compounding, and initial frication) varied markedly over languages. Strong cross-language correlations were found in naming latencies, frequency, and length. Other-language frequency effects were observed (e.g., Chinese frequencies predicting Spanish reaction times) even after within-language effects were controlled (e.g., Spanish frequencies predicting Spanish reaction times). These surprising cross-language correlations challenge widely held assumptions about the lexical locus of length and frequency effects, suggesting instead that they may (at least in part) reflect familiarity and accessibility at a conceptual level that is shared over languages.
Article
Full-text available
We re-examine the double dissociation view of noun-verb differences by critically reviewing past lesion studies reporting selective noun or verb deficits in picture naming, and reporting the results of a new picture naming study carried out with aphasic patients and comparison participants. Since there are theoretical arguments and empirical evidence that verb processing is more demanding than noun processing, in the review we distinguished between cases that presented with large and small differences between nouns and verbs. We argued that the latter cases may be accounted for in terms of greater difficulty in processing verbs than nouns. For the cases reporting large differences between nouns and verbs we assessed consistency in lesion localization and consistency in diagnostic classification. More variability both in terms of diagnostic category and lesion sites was found among the verb impaired than the noun impaired patients. In the experimental study, nine aphasic patients and nine age matched neurologically unimpaired individuals carried out a picture naming study that used a large set of materials matched for age of acquisition and in addition to accuracy measures, latencies were also recorded. Despite the patients' variable language deficits, diagnostic category and the matched materials, all patients performed faster and more accurately in naming the object than the action pictures. The comparison participants performed similarly. We also carried out a qualitative analysis of the errors patients made and showed that different types of errors were made in response to object and action pictures. We concluded that action naming places more and different demands on the language processor than object naming. The conclusions of the literature review and the results of the experimental study are discussed in relation to claims previous studies have made on the basis of the double dissociation found between nouns and verbs. We argue that these claims are only justified when it can be shown that the impairments to the two categories occur for the same underlying reason and that the differences between the two categories are large.
Article
Full-text available
Many researchers across many experimental domains utilize the latency of spoken responses as a dependent measure. These measurements are typically made using a voice key, an electronic device that monitors the amplitude of a voice signal, and detects when a predetermined threshold is crossed. Unfortunately, voice keys have been repeatedly shown to be alarmingly errorful and biased in accurately detecting speech onset latencies. We present SayWhen--an easy-to-use software system for offline speech onset latency measurement that (1) automatically detects speech onset latencies with high accuracy, well beyond voice key performance, (2) automatically detects and flags a subset of trials most likely to have mismeasured onsets, for optional manual checking, and (3) implements a graphical user interface that greatly speeds and facilitates the checking and correction of this flagged subset of trials. This automatic-plus-selective-checking method approaches the gold standard performance of full manual coding in a small fraction of the time.
Article
Full-text available
Factors affecting object and action naming were compared in a timed picture-naming paradigm, for drawings of 520 objects and 275 actions, named by adult native speakers of English. Massive differences between object and action naming were observed for all dependent variables, and theoretically relevant differences emerged in the variables that predict retrieval of nouns vs. verbs in this task. Matching object and action items for variables like frequency, age of acquisition, or picture complexity does not result in a match for measures of naming difficulty (name agreement or latency). Conversely, object and action items matched for naming difficulty invariably differ in their other lexical and pictorial properties. A reaction time disadvantage for action naming remains even after controlling for picture properties, target word properties, name agreement itself (reflecting the differential ambiguity of nouns and verbs) as well as a measure of conceptual or psychological complexity based on the number of relevant objects in the scene. Surprisingly, frequency effects run in opposite directions for nouns (higher frequencies yield faster RTs) and verbs (higher frequencies are associated with slower RTs, reflecting a "light verb" strategy that speakers use for difficult items). Implications for method and theory in the study of lexical access are discussed, including relevance to a growing literature on the neurobiology and development of nouns and verbs.
Article
Full-text available
The research described in this article had 2 aims: to permit greater precision in the conduct of naming experiments and to contribute to a characterization of the motor execution stage of speech production. The authors report an exhaustive inventory of consonantal and postconsonantal influences on delayed naming latency and onset acoustic duration, derived from a hand-labeled corpus of single-syllable consonant-vowel utterances. Five talkers produced 6 repetitions each of a set of 168 prepared monosyllables, a set that comprised each of the consonantal onsets of English in 3 vowel contexts. Strong and significant effects associated with phonetic characteristics of initial and noninitial phonemes were observed on both delayed naming latency and onset acoustic duration. Results are discussed in terms of the biomechanical properties of the articulatory system that may give rise to these effects and in terms of their methodological implications for naming experiments.
Article
Purpose: The effect of repeated naming on both object and action picture naming in individuals with anomic aphasia is explored. We asked whether repeatedly naming the same items leads to improved accuracy and reduced response latency. Method: Ten individuals with anomic aphasia and 6 healthy adults, 3 young and 3 old, named a set of 27 object pictures and a set of 27 action pictures presented 1 at a time on a computer screen. We examined accuracy and response times (RTs) across the 2 blocks of 10 repeated trials. Results: Results demonstrated higher accuracy and faster RTs for object than for action naming for all participants, with lower accuracy rates and slower RTs for the people with aphasia (PWA) compared with the healthy individuals, and diverging patterns of change across trials. Unlike the healthy participants, whose RTs decreased across trials, PWA continued to demonstrate variability in response latencies across the trials. Conclusions: Our preliminary results suggest that measuring RT may be useful in characterizing retrieval difficulty in anomic aphasia and that the retrieval processes in PWA, even in those who experience mild anomia, may be less efficient or different from those processes in neurologically healthy individuals.
Article
Voice response time (RT) measurements from 4 large-scale studies of oral reading of English monosyllables were analyzed for evidence that voice key measurements are biased by the leading phonemes of the response. Words with different initial phonemes did have significantly different RTs. This effect persisted after contributions of nine covariables, such as frequency, length, and spelling consistency, were factored out, as well as when variance associated with error rate was factored out. A breakdown by phoneme showed that voiceless, posterior, and obstruent consonants were detected later than others. The second phonemes of the words also had an effect on RT: Words with high or front vowels were detected later. Phoneme-based biases due to voice keys were large (range about 100 ms) and pervasive enough to cause concern in interpreting voice RT measurements. Techniques are discussed for minimizing the impact of these biases.
Article
The influence of nine variables on the latencies to write down or to speak aloud the names of pictures taken from Snodgrass and Vanderwart (1980) was investigated in French adults. The major determinants of both written and spoken picture naming latencies were image variability, image agreement and age of acquisition. To a lesser extent, name agreement was also found to have an impact in both production modes. The implications of the findings for theoretical views of both spoken and written picture naming are discussed.
Measurement of naming latency of kana characters and words based on the speech wave analysis: Manner of articulation of a word-initial phoneme considerably affects naming latency
  • sakura