ArticlePDF Available

Spatiotemporal Eye-Tracking Feature Set for Improved Recognition of Dyslexic Reading Patterns in Children

June 2022
Sensors 22(13):4900

June 2022
22(13):4900

DOI:10.3390/s22134900

License
CC BY 4.0

Authors:

Ivan Vajs

Innovation Center of the School of Electrical Engineering in Belgrade

Ković Vanja

University of Belgrade

Tamara Jakovljevic

Jožef Stefan International Postgraduate School

Andrej Savić

University of Belgrade

Show all 5 authorsHide

Considering the detrimental effects of dyslexia on academic performance and its common occurrence, developing tools for dyslexia detection, monitoring, and treatment poses a task of significant priority. The research performed in this paper was focused on detecting and analyzing dyslexic tendencies in Serbian children based on eye-tracking measures. The group of 30 children (ages 7–13, 15 dyslexic and 15 non-dyslexic) read 13 different text segments on 13 different color configurations. For each text segment, the corresponding eye-tracking trail was recorded and then processed offline and represented by nine conventional features and five newly proposed features. The features were used for dyslexia recognition using several machine learning algorithms: logistic regression, support vector machine, k-nearest neighbor, and random forest. The highest accuracy of 94% was achieved using all the implemented features and leave-one-out subject cross-validation. Afterwards, the most important features for dyslexia detection (representing the complexity of fixation gaze) were used in a statistical analysis of the individual color effects on dyslexic tendencies within the dyslexic group. The statistical analysis has shown that the influence of color has high inter-subject variability. This paper is the first to introduce features that provide clear separability between a dyslexic and control group in the Serbian language (a language with a shallow orthographic system). Furthermore, the proposed features could be used for diagnosing and tracking dyslexia as biomarkers for objective quantification.

Content uploaded by Ivan Vajs

Content may be subject to copyright.

Citation: Vajs, I.; Kovi´c, V.; Papi´c, T.;

Savi´c, A.M.; Jankovi ´c, M.M.

Spatiotemporal Eye-Tracking Feature

Set for Improved Recognition of

Dyslexic Reading Patterns in

Children. Sensors 2022,22, 4900.

https://doi.org/10.3390/s22134900

Academic Editors: Jordi Solé-Casals,

César F. Caiafa, Zhe Sun, Pere

Marti-Puig and Toshihisa Tanaka

Received: 3 June 2022

Accepted: 27 June 2022

Published: 29 June 2022

Publisher’s Note: MDPI stays neutral

with regard to jurisdictional claims in

published maps and institutional afﬁl-

iations.

Licensee MDPI, Basel, Switzerland.

This article is an open access article

distributed under the terms and

conditions of the Creative Commons

Attribution (CC BY) license (https://

creativecommons.org/licenses/by/

4.0/).

sensors

Article

Spatiotemporal Eye-Tracking Feature Set for Improved

Recognition of Dyslexic Reading Patterns in Children

Ivan Vajs 1,2, *, Vanja Kovi´c 3, Tamara Papi´c 4, Andrej M. Savi´c 1and Milica M. Jankovi´c 1

School of Electrical Engineering, University of Belgrade, Bulevar Kralja Aleksandra 73, 11120 Belgrade, Serbia;

andrej_savic@etf.rs (A.M.S.); piperski@etf.rs (M.M.J.)

2Innovation Center, School of Electrical Engineering in Belgrade, Bulevar Kralja Aleksandra 73,

11120 Belgrade, Serbia

3Faculty of Philosophy, University of Belgrade, ˇ

Cika-Ljubina 18-20, 11000 Belgrade, Serbia;

vanja.kovic@f.bg.ac.rs

4Faculty of Technical Sciences, University Singidunum, Danijelova 32, 11000 Belgrade, Serbia;

tpapic@singidunum.ac.rs

*Correspondence: ivan.vajs@ic.etf.bg.ac.rs; Tel.: +381-11-3218-455

Abstract:

Considering the detrimental effects of dyslexia on academic performance and its common

occurrence, developing tools for dyslexia detection, monitoring, and treatment poses a task of

signiﬁcant priority. The research performed in this paper was focused on detecting and analyzing

dyslexic tendencies in Serbian children based on eye-tracking measures. The group of 30 children

(ages 7–13, 15 dyslexic and 15 non-dyslexic) read 13 different text segments on 13 different color

conﬁgurations. For each text segment, the corresponding eye-tracking trail was recorded and then

processed ofﬂine and represented by nine conventional features and ﬁve newly proposed features.

The features were used for dyslexia recognition using several machine learning algorithms: logistic

regression, support vector machine, k-nearest neighbor, and random forest. The highest accuracy

of 94% was achieved using all the implemented features and leave-one-out subject cross-validation.

Afterwards, the most important features for dyslexia detection (representing the complexity of ﬁxation

gaze) were used in a statistical analysis of the individual color effects on dyslexic tendencies within

the dyslexic group. The statistical analysis has shown that the inﬂuence of color has high inter-subject

variability. This paper is the ﬁrst to introduce features that provide clear separability between a

dyslexic and control group in the Serbian language (a language with a shallow orthographic system).

Furthermore, the proposed features could be used for diagnosing and tracking dyslexia as biomarkers

for objective quantiﬁcation.

Keywords:

developmental dyslexia; reading; screening; colored background; eye-tracking; feature

extraction; machine learning; support vector machine; k-nearest neighbors; random forest; logistic

regression

1. Introduction

Individual differences in learning to read originate from biological and environmental

factors, which shape the development of the brain systems involved in the reading pro-

cess [

]. Dyslexia, a speciﬁc learning disorder with impairments in reading [

] refers to

a pattern of learning difﬁculties characterized by problems with accurate or ﬂuent word

recognition, poor decoding, and poor spelling abilities. Up to 20% of the general population

may exhibit some degree of these difﬁculties [

], while about 7% of people are affected

heavily enough to qualify for a dyslexia diagnosis [

]. Due to its nature, dyslexia is typically

diagnosed only after children have started to learn to read, when it becomes evident that

they are struggling to keep up with their peers [

]. At this point, the pupils with dyslexia

are already at risk of falling behind because reading is essential for school achievement

in most subjects. Moreover, children with poor reading skills are also at an increased

Sensors 2022,22, 4900. https://doi.org/10.3390/s22134900 https://www.mdpi.com/journal/sensors

Sensors 2022,22, 4900 2 of 18

risk of social, emotional, and mental health problems, such as school dropout, attempted

suicide, incarceration, anxiety, depression, and low self-concept [

]. Therefore, it would be

invaluable if children with dyslexia or at risk for dyslexia could be identiﬁed and involved

in prevention and treatment programs as early as possible.

The diagnosis of dyslexia, especially at its early stages, has proven to be a complex

task, especially because of the lack of a strict procedure for dyslexia screening [

]. Being

able to diagnose dyslexia and create a tool that can objectively quantify certain dyslexic

tendencies has proven to be quite important so that the diagnosis process could be as

objective and reliable as possible [8].

Recently, Carioti et al. have shown that in developmental dyslexia research (pub-

lished from 2013 to 2018), 67.4% of studies were performed on languages that could be

considered to have a deep orthographic system [

]. Considering this, performing dyslexia

research on languages that have a shallow orthographic system could be considered quite

important, not only because of their underrepresentation but because reading in a shallow

orthographic system is easier, making dyslexia even harder to diagnose. Dyslexia diagnosis

is a challenging task in the Serbian language (with one-to-one grapheme-phoneme pairs),

which belongs to the group of languages with a shallow orthography.

In this paper, the study performed on native dyslexic and non-dyslexic Serbian speak-

ers is presented. Novel spatiotemporal eye-tracking features were introduced, and the

classiﬁcation results using various machine learning (ML) algorithms were compared with

the results obtained using conventional eye-tracking features. The difference between the

subject classes (dyslexic and non-dyslexic) was analyzed using statistical tests for different

color conﬁgurations in order to examine the inﬂuence of the color conﬁguration of the read-

ing material on subject class separability. Statistical analysis was also performed within the

dyslexic subject group in order to analyze the inﬂuence of color conﬁguration on reading

performance and to determine whether a given color could inﬂuence the eye movement

features in a manner indicating facilitation or aggravation of the reading task in subjects

with dyslexia.

The contributions of the performed research are as follows:

•

Development of a novel feature set for describing and quantifying dyslexic tendencies

in the Serbian language;

•

Statistical and classiﬁcation analysis, showing the potential of the proposed features

to be used as indicators of dyslexic tendencies;

•

An analysis of the inﬂuence of colored backgrounds and overlays on reading patterns

using a selection of the proposed features that have shown to be the most indicative of

dyslexic reading patterns.

2. Related Work

Dyslexia is diagnosed by tests that include reading and writing assessments, among

other evaluations, and are standardized by experts on a large number of subjects [

The advancement of technology has made the digitalization of these tests possible, and it

has also contributed to the objectivity of the testing as certain quantiﬁable metrics can be

obtained from digitalized dyslexia tests [12–14].

Different screening methodologies can be performed to distinguish dyslexic from

non-dyslexic subjects. Brain-imaging methodology most prominently focuses on functional

magnetic resonance imaging during reading [

] and diffusion tensor imaging [

which both show, respectively, the functional or morphological differences between the

dyslexic and the control group. Brain activity can be monitored using electroencephalogra-

phy (EEG) as well, either on its own [

–

] or in combination with other biometric signals,

such as heart rate, electrodermal activity (EDA), and eye tracking [21–24].

The analysis of reading and eye movement patterns is often performed in dyslexia

research. Temelturk et al. in [

] performed a systematic review of 25 papers that in-

clude binocular eye-tracking during linguistic and non-linguistic tasks in children from

5–17 years

of age with dyslexia and with typical development. The review aimed to com-

Sensors 2022,22, 4900 3 of 18

bine the knowledge from the existing literature that observed the binocular coordination in

children with dyslexia by describing the normative development of stable binocular control.

The ﬁndings of the review indicate clearly that there is poor binocular coordination in

children with dyslexia but that the results associated with different task characteristics were

not as consistent. Another study focused on detecting dyslexia based on reading patterns

was presented by Wang et al. in [

]. A neural network was developed that was used to

predict whether or not the subject had developmental dyslexia, based on the data gathered

from 399 Chinese children. The dataset included children aged 7–13, 187 with dyslexia and

212 controls. The authors report an achieved accuracy of 94%, claiming that the reading

accuracy was the feature that had the strongest factor in detecting dyslexia, but the phono-

logical awareness, the accuracy rate of pseudo characters, the morphological awareness,

the reading ﬂuency, the rapid digit naming, and the reaction times of noncharacters made

important contributions to the classiﬁcation as well.

Eye tracking is often used in the practical diagnosis of dyslexia as it provides a direct

insight into the visual sampling strategy. The eye movements of subjects with dyslexia

show an erratic gaze pattern that can be quantitatively described by features and used for

further development of the algorithms for automatic dyslexia recognition [27].

Rello et al. in [

] claim to be the ﬁrst to attempt classifying dyslexia based on eye-

tracking features using machine learning. The language of the text used in the experiment

was Spanish, and 97 subjects were included (48 with dyslexia), with the subject age ranging

from 11 to 54. Each subject read 12 different texts, each presented in a different font type, on

white paper with black letters. A support vector machine (SVM) classiﬁer was implemented,

and the features used as inputs were the age of the participant, mean duration and the

total number of ﬁxations, total reading time, etc. The model was evaluated using 10-fold

cross-validation, and an accuracy of 80.18% was achieved.

A study with a larger number of participants and a more in-depth feature analysis

was performed in [

]. The data were gathered from 185 subjects (97 with dyslexia),

with ages ranging from 9 to 10, who read a single text written in the Swedish language.

The text was presented on white paper with black letters, and a total of 168 eye-tracking

features were considered. The features were derived from both version and vergence [

the regressive and progressive movements, the saccades, the ﬁxations, the duration of

the event, the distance spanning the event, the accumulated distance of an event, the

accumulated distance over all subsequent positions, etc. Considering the large number

of features, a recursive feature elimination (RFE) algorithm was implemented to reduce

the number of features. An SVM classiﬁer was used, and it was evaluated using 10-fold

cross-validation, which was repeated 100 times to ensure the stability of results in terms of

dataset splitting. The highest achieved accuracy was 95.6%, and it indicates that a large

number of subjects in combination with a wide range of observed features enables a reliable

classiﬁcation. This paper also effectively performed subject-wise evaluation, where the

data from a given subject are either in the training or test, creating an evaluation scenario

similar to a real use case [31].

Prabha et al. [

] analyzed the dataset introduced in [

] using several ML algorithms.

Only the features extracted from ﬁxations, in combination with an RFE feature selection

algorithm, were used for the classiﬁcation. The authors implemented an SVM classiﬁer

(with four different kernel conﬁgurations), a k-nearest neighbors (KNN), and a random

forest (RF) algorithm and achieved the highest accuracy of 95% by KNN. In their further

work [

], Prabha et al. focused on analyzing the same dataset, but with new ML algorithms,

such as particle swarm optimization (PSO)-based SVM hybrid kernel (hybrid SVM–PSO),

SVM, RF, logistic regression (LR), and KNN. They also observed features extracted from

both saccades and ﬁxations and obtained an accuracy of 95.6% with the hybrid SVM–PSO

model. Prabha et al. also focused on observing eye-tracking feature sets and several other

ML algorithms in their work performed on the same dataset [

], obtaining similar

results, although a slightly higher accuracy of 96% in [

] using a hybrid SVM–PSO model.

Sensors 2022,22, 4900 4 of 18

A study including 69 children (32 with dyslexia) was conducted in [

]. The children

were aged 8.5–12.5 and read two text paragraphs in Greek. The authors implemented

several ML algorithms (KNN, SVM, and naïve Bayes) and observed a wide range of

eye-tracking features. The best-obtained accuracy of 97% was achieved using only three

features, saccade length, the number of short forward movements, and the number of

repeatedly ﬁxated words.

A holistic approach for dyslexia detection based on a convolutional neural network

(CNN) was implemented in [

]. The authors used the dataset from [

], but rather than

extracting features, they used gaze coordinate data as a direct input to the CNN and

implemented several padding algorithms to make the data sequences the same length. The

achieved accuracy results of 96.6% (obtained with a modiﬁed cross-validation evaluation)

show that, given the right data encoding, deep learning algorithms can provide very

reliable dyslexia detection based on eye movement data.

Weiss et al. [

] analyzed the lateralization of early orthographic processing during

natural reading in subjects with dyslexia. The authors recorded the eye-tracking and

EEG activity of the subjects, 24 subjects with dyslexia (mean age 24.8) and 24 control

subjects (mean age 23), during the reading of isolated sentences in their native (Hungarian)

language, with various spacing between letters. The statistical analysis of the EEG and the

eye-tracking parameters performed in the paper has shown several interesting ﬁndings.

Increased spacing between letters was shown to reduce the silent reading speed in both

subject groups, in contrast to the beneﬁcial effects on oral reading found in previous

work. Furthermore, the authors found that the early left-hemispheric lateralization of

orthographic processing during natural reading depends on the rank of ﬁxations and that

it is most prominent when reading on the default letter spacing in control readers, as well

as that it deteriorates in subjects with dyslexia.

The detection of developmental dyslexia using machine learning and eye movement

data was performed in [

]. The authors observed a group of 165 subjects with an average

age of 12.5. Of the chosen subjects, 30 met the criteria for a reading disorder based on

choosing the 10th worst percentile of the reading ﬂuency performance score, which was

used to label them as dyslexic. The language used in the reading experiment was Finnish

(the subjects’ native language), and a variety of eye-tracking features were observed. An

RF algorithm was used for feature ranking, and an SVM was used for subject classiﬁcation

based on the selected features. The overall accuracy of 89.7% was achieved using ﬁve-fold

cross-validation.

El Hmimdi et al. [

] performed research on predicting a dyslexia diagnosis as well

as reading speed from eye movement data in both reading and non-reading tasks. The

authors used eye movement measures from four different setups, gathered from 46 dyslexic

subjects (average age 15.5) and 41 control subjects (average age 14.8), recruited from schools

in Paris. A vergence, saccade, and two reading tests were performed by each subject,

and several eye-tracking measures were derived from the obtained data. Based on the

obtained features, a variety of ML algorithms were implemented, and the ﬁndings showed

an accuracy of 81.25% percent when using the data from the reading tests and 81.25%

and 77.3% accuracy from the two no-reading tests, respectively. The prediction of reading

speed was also performed on each of the feature sets from the two reading tests and two

no-reading tests, showing that the reading speed can be predicted more accurately from

one non-reading task than from the two reading tasks.

Vajs et al. [

] presented a CNN solution for dyslexia detection based on the VGG16

neural network architecture. The eye-tracking data were gathered from 30 subjects (ages

ranging from 7–13), 15 with dyslexia, and 15 controls. The subjects read the text in their

native language (Serbian) on different colored backgrounds and overlays, and the raw eye-

tracking data were segmented, visualized, and used in the form of colored images as inputs

to the CNN model. The model was evaluated using leave-one-out subject cross-validation,

and an accuracy of 87% was achieved.

Sensors 2022,22, 4900 5 of 18

3. Materials and Methods

3.1. Dataset and Experiment Description

The data analysis in this paper was conducted on the dataset described in our previous

research [

]. The data were gathered from 30 subjects, 15 diagnosed with dyslexia and

15 control subjects (age: 7–13, gender: 19 female, 11 male), during a study approved by

the ethical committee of the Psychology Department of the University of Niš (a branch of

the Serbian Psychology Association), experimental procedure No. 9/2019. The subjects

could withdraw from the test at any time. In consultation with a certiﬁed speech therapist,

the subjects with dyslexia were selected from several elementary schools in Belgrade. The

control subjects were selected randomly from three elementary schools in Belgrade. The age

range in the group with dyslexia was 7–13 years, of which 4 were male and 11 were female,

with an average age 9.93. The age range in the group without dyslexia was

7–13 years

, of

which 7 were male and 8 were female, with an average age 9.67. All of the children in the

sample, both dyslexic and non-dyslexic, had normal (or corrected to normal) vision. The

children did the study in the morning hours during the regular school schedules.

During the experiment, the children were alone in an isolated, quiet, and bright room

with the experimenter, sitting on a chair at a table in front of a computer monitor and

keyboard. The screen size was 48 cm

27 cm, the brightness was set to 90%, the distance

from the screen was 62 cm; this was the same for all the participants. Additionally, we

used the chin-rest so that the position of the head/eyes relative to the monitor was the

same. During the experiment, each subject read 13 segments of the text extracted from

a standardized story for elementary school called “Saint Sava and the villager without

happiness”. At the beginning of the experiment, the subjects were instructed to read the

text quietly for themselves from the stimuli presentation shown and to press the space

button for the next slide of the stimuli presentation. The experiment was run applying the

pseudo randomization of color background/overlay order, always starting with a referent

slide (black text on white background). No other color was ﬁxed/related to a certain text.

Therefore, in this way, any other factors apart from the actual color would be averaged out

(paragraph complexity such as vocabulary, syntax, etc., as well as semantic/affective con-

tent). The text was prepared and presented within the SMI Experiment Center software 3.7,

keeping the same size/font for each slide, centrally presented with approximately the same

length. All the colors (color shades) used for designing the slides (stimuli) were deﬁned

within the RGB color model, and each individual color was expressed as an RGB triplet

([R,G,B]), where the value of each additive primary color component can vary from 0 to

255. A list of background shades in the slides with colored backgrounds (and black text)

with the associated numerical values of their RGB triplet is stated in Section 2.3 Experiment

Design of our previous study [

]. An example of the test boards used in the experiment is

attached in the Supplementary Materials. The reading of each text segment, for one subject,

will be called “a trial” in the further text, although 30 subjects were included in this study,

each with 13 trials. The trials with insufﬁcient focus on the displayed text (reading time

less than 5 s) were excluded, resulting in a total of 378 trials used for further analysis.

Several biometric parameters of the subjects were monitored during the reading task

using a multimodal sensor hub [

]. The hub performed heart rate monitoring, EEG,

EDA monitoring, and eye tracking. This study, however, was focused on the eye-tracking

aspect of dyslexia, recorded by an SMI RED-m 120 Hz portable remote eye tracker (iMo-

tions, Copenhagen, Denmark). Eye-tracker calibration was conducted in the SMI BeGaze

software 3.7 (SensoMotoric Instruments, Teltow, Germany), and the experiment could be

initiated only if the calibration had been successfully conducted. The acceptable accuracy

for the 5-point calibration and validation was 0.5 degrees for both axes. Data validation in

the form of the visual inspection was performed immediately after each recording session,

using the BeGaze software. Each trial was adequately characterized by 3 data sequences,

one representing the

coordinates, the other representing the

coordinates of the gaze, and

the ﬁnal one representing the event status signal of the recording (indicating the following

events: ﬁxation, saccade, blink/missing data).

Sensors 2022,22, 4900 6 of 18

3.2. Data Visualization and Feature Extraction

An original visualization technique was implemented so that the gaze data could be

easier to analyze and display in a more intuitive manner. The

and

gaze coordinates of

a given trail are plotted in an x−yplane, following several rules:

•

the color of the gaze line plotted between points

pk−1=(xk−1,yk−1)

and

pk=(xk,yk)

is calculated based on the distance between points

pk−1

and

, using a jet color map

(color map covers the line length range from 0 to 200 pixels, where 200 pixels is the

maximal length of a saccade in the experiment, excluding saccades that occur between

two lines of text);

•

lines that connect the gaze points that belong to ﬁxations are connected fully, while

the lines that connect the points belonging to saccades are dashed;

•

the last recorded gaze coordinates before and after a detected blink state are marked

with red stars;

•

the opacity of the line connecting the gaze points decreases over the course of the trial

(time t) according to the following equation

opacityt=0.9 −0.8 ∗min1, t

MRT ×Ts, (1)

•

where

represents the sampling frequency (

Ts=

Hz)

, and the opacity ranged

from 1 (completely opaque) to 0 (completely transparent). The opacity is calculated so

that it linearly decreases over time, up to the

MRT

, which represents the maximum

reading time in this study (40 s).

An example of trial visualization is given in Figure 1.

Sensors 2022, 22, x FOR PEER REVIEW 6 of 18

initiated only if the calibration had been successfully conducted. The acceptable accuracy

for the 5-point calibration and validation was 0.5 degrees for both axes. Data validation in

the form of the visual inspection was performed immediately after each recording session,

using the BeGaze software. Each trial was adequately characterized by 3 data sequences,

one representing the 𝑥 coordinates, the other representing the 𝑦 coordinates of the gaze,

and the final one representing the event status signal of the recording (indicating the fol-

lowing events: fixation, saccade, blink/missing data).

3.2. Data Visualization and Feature Extraction

An original visualization technique was implemented so that the gaze data could be

easier to analyze and display in a more intuitive manner. The 𝑥 and 𝑦 gaze coordinates

of a given trail are plotted in an 𝑥−𝑦 plane, following several rules:

• the color of the gaze line plotted between points 𝑝 =󰇛𝑥

,𝑦󰇜 and 𝑝=

󰇛𝑥,𝑦󰇜 is calculated based on the distance between points 𝑝 and 𝑝, using a jet

color map (color map covers the line length range from 0 to 200 pixels, where 200

pixels is the maximal length of a saccade in the experiment, excluding saccades that

occur between two lines of text);

• lines that connect the gaze points that belong to fixations are connected fully, while

the lines that connect the points belonging to saccades are dashed;

• the last recorded gaze coordinates before and after a detected blink state are marked

with red stars;

• the opacity of the line connecting the gaze points decreases over the course of the

trial (time t) according to the following equation

opacit

= 0.9 −0.8∗ min1, t

𝑀𝑅𝑇 𝑇, (1)

• where 𝑇 represents the sampling frequency (𝑇= 60 Hz󰇜, and the opacity ranged

from 1 (completely opaque) to 0 (completely transparent). The opacity is calculated

so that it linearly decreases over time, up to the 𝑀𝑅𝑇, which represents the maxi-

mum reading time in this study (40 s).

An example of trial visualization is given in Figure 1.

Figure 1. Trial visualization example for (A) a control subject and (B) a dyslexic subject. The color

of the line represents the line length in pixels according to the presented color scale and the red

stars represent blink events.

Figure 1.

Trial visualization example for (

) a control subject and (

) a dyslexic subject. The color of

the line represents the line length in pixels according to the presented color scale and the red stars

represent blink events.

By analyzing the visualized trials, the global tendencies of the dyslexic subjects could

be observed, which were then quantiﬁed by signal features.

The nine conventional eye-tracking metrics used as features for classiﬁcation were:

Fixation count, Saccade count,Fixation frequency,Saccade frequency,Fixation average duration,

Sensors 2022,22, 4900 7 of 18

Saccade average duration,Fixation total duration,Saccade total duration, and Total reading

time [42].

Aside from the commonly analyzed eye-tracking metrics, three new spatial features

were introduced, as well as two new temporal ones. The three spatial features are related

to ﬁxation events and do not directly rely on the ﬁxation duration or number of ﬁxations.

Rather, they are focused on quantifying the irregularity and complexity of the gaze during

ﬁxation events in the

x−y

coordinate plane. The ﬁrst proposed spatial feature is called the

Fixation intersection coefﬁcient (FIC), and it is calculated per trial as

FIC =1

∑

j=1

FIj(2)

where

represents the number of ﬁxations in a trial, and

FIj

represents the number of

self-intersections of the lines belonging to the ﬁxation

in the

x−y

plane. This feature was

introduced because a higher number of self-intersections of gaze lines during ﬁxations was

observed in dyslexic subjects when compared to the control ones. The second spatial feature

metric is called Fixation intersection variability, and it represents the standard deviation of

the previously described

FIj

array. This was introduced because the number of self-

intersections in ﬁxation gaze lines varied more within a single trial for dyslexic subjects

when compared to the control subjects. The third spatial feature is called the Fixation fractal

dimension (FFD), and it is calculated per trial as

FFD =1

∑

j=1

FDj(3)

where

FDj

represents the fractal dimension of the ﬁgure created by the lines belonging to

the ﬁxation

in the

x−y

plane, estimated by the box-counting method [

]. This feature

was introduced to directly quantify the complexity of the ﬁxation gaze lines.

In addition to the spatial features, two temporal features were introduced, named

Active reading time and Saccade variability. The Active reading time is calculated as the time

spent in the ﬁxation and saccade states, effectively excluding the time spent in the blink

state or the intervals where the gaze was not detected. It was introduced with the goal of

observing only the time spent actively reading the displayed text. Finally, Saccade variability

was calculated as the standard deviation of the time intervals between two succeeding

saccades. This feature was introduced by focusing on the observed tendency of the saccadic

events to be more equally spaced out in the control subjects, as opposed to the

dyslexic ones

The data visualization and feature extraction were implemented in the Python 3.8.1

environment.

3.3. Machine Learning and Statistical Analysis

After the feature extraction, each trial was represented by a set of 14 (9 conventional

and 5 proposed) features and its appropriate label (control or dyslexic). The obtained

dataset was used to train four ML algorithms as well as to perform statistical analysis.

The selected ML algorithms were the LR, SVM, KNN, and RF. They are implemented

in the Python 3.8.1 programming language, using the sklearn library [

]. When training

each of the algorithms, the training set was standardized (made to have a mean value of 0

and standard deviation of 1), and the parameters for standardization on the train set were

later used on the test set. The training/hyper parameters of the models were kept at their

default values from the sklearn library (aside from the probability parameter used in the

SVM implementation) and are as follows for the used ML algorithms:

•LR: penalty = l2; C = 1; solver = lbfgs; maximum iteration number = 100;

•SVM: C = 1; kernel = rbf, probability = True;

•KNN: number of neighbors = 5; algorithm = auto; distance = Euclidian;

Sensors 2022,22, 4900 8 of 18

•

RF: number of estimators = 100; criterion for split = Gini impurity; no max depth; max

features = pnumb er o f f eatures; using bootstrap.

Each ML algorithm was trained and evaluated for each individual feature (1 input);

for the conventional features (9 inputs); for the proposed features (5 inputs); and for all

the features (14 inputs). The summary of all 17 possible input options for each of the ML

algorithms is given in Table 1. Each of the algorithm and input feature combinations was

evaluated using a subject-wise leave-one-out cross-validation, where the trial data from

each subject belonged to a single fold (30 folds in total), and in each iteration, one fold

was used for testing and the remaining ones for training. The prediction value, label, and

prediction probability for each instance of a test fold were saved and concatenated so that

after the cross-validation was ﬁnished, the evaluation metrics (accuracy, ACC; sensitivity,

Se; speciﬁcity, Sp; F1 score; area under the receiver operating characteristic curve, AUROC)

could be calculated on the entirety of the test folds.

Table 1. ML algorithm input feature options.

Algorithm Input Options

No. Feature Set Input Options No. Single Feature Input (1 Input)

Conventional features (9 inputs):Fixation count, Fixation total

duration, Fixation frequency, Fixation average duration, Saccade

count, Saccade total duration, Saccade frequency, Saccade average

duration, Total reading time

4. Active reading time

5. Fixation intersection coefﬁcient

6. Saccade variability

7. Fixation intersection variability

Proposed features (5 inputs):Active reading time, Fixation

intersection coefﬁcient, Saccade variability, Fixation intersection

variability, Fixation fractal dimension

8. Fixation fractal dimension

9. Fixation count

10. Fixation total duration

11. Fixation frequency

12. Fixation average duration

13. Saccade count

14. Saccade total duration

15. Saccade frequency

3. Conventional and Proposed features (14 inputs) 16. Saccade average duration

17. Total reading time

Other forms of ML evaluation, such as stratiﬁed 5-fold or stratiﬁed 3-fold subject-

wise evaluations, were attempted (a single fold having 30/5 = 6 or 30/3 = 10 subjects,

respectively) but showed a negligible difference in terms of the evaluation metrics when

compared to the leave-one-out method.

Feature ranking was performed in order to sort the features in terms of their impor-

tance with regard to dyslexia classiﬁcation, and it was based on the decrease in impurity

in the RF algorithm [45]. The statistical analysis was then performed in the SPSS software

(16.0, IBM Corp., New York, NY, USA) for each of the features that were shown to be

indicative of dyslexic behavior by the feature ranking. First, the Mann–Whitney test was

performed to compare the feature values for each color conﬁguration separately between

the two subject groups (dyslexic and control). Second, the Levene test of homogeneity of

variances was performed with the goal of comparing the dispersity of the observed feature

for each color conﬁguration separately, between the two subject groups. The ﬁnal part of

the statistical analysis included a Wilcoxon signed ranks test performed within the dyslexic

subject group, comparing the feature values between different color conﬁgurations. This

analysis was performed for each pair of color conﬁgurations to determine whether a given

color conﬁguration was more favorable for the dyslexic subjects.

The analysis pipeline performed in this paper is given in Figure 2.

Sensors 2022,22, 4900 9 of 18

Sensors 2022, 22, x FOR PEER REVIEW 9 of 18

configurations. This analysis was performed for each pair of color configurations to

determine whether a given color configuration was more favorable for the dyslexic

subjects.

The analysis pipeline performed in this paper is given in Figure 2.

Figure 2. The analysis pipeline. LR—logistic regression; SVM—support vector machine; KNN—k-

nearest neighbors; RF—random forest.

4. Results

The average metrics achieved on the test sets for the four ML algorithms (LR, SVM,

KNN, RF), using three different feature sets (conventional, proposed, and all features) as

inputs, are given in Table 2.

Table 2. Feature group classification evaluation metrics (the proposed feature results marked with

bold text). ACC—accuracy; Se—sensitivity; Sp—specificity; AUROC—area under the receiver

operating characteristic curve.

Feature Group ML Algorithm

LR SVM KNN RF

Conventional features

ACC 0.84 0.85 0.81 0.82

Se 0.78 0.72 0.66 0.75

Sp 0.90 0.97 0.94 0.92

F1 score 0.83 0.82 0.77 0.81

AUROC 0.88 0.89 0.87 0.86

Proposed features

Figure 2.

The analysis pipeline. LR—logistic regression; SVM—support vector machine; KNN—k-

nearest neighbors; RF—random forest.

4. Results

The average metrics achieved on the test sets for the four ML algorithms (LR, SVM,

KNN, RF), using three different feature sets (conventional, proposed, and all features) as

inputs, are given in Table 2.

The achieved results show an overall high accuracy and a consistently better result

when using the proposed features as well as the all features as inputs in comparison to the

conventional ones. The best accuracy for both the proposed features as inputs and the all

features as inputs was obtained by the LR algorithm, and it convincingly surpassed the

best accuracy of 85% obtained for the conventional features by the SVM algorithm.

The average test set accuracy achieved when each individual feature is used as the

ML input is shown in Table 3. The other metrics for single feature evaluation are presented

in Appendix A.

The best accuracy was achieved for the Fixation intersection variability feature. The

second and third best accuracies were achieved for the Fixation intersection coefﬁcient and

the Fixation fractal dimension. The accuracies achieved for these three features for all the

ML algorithms were higher than the accuracies achieved when using all the conventional

features as inputs.

The importance of each individual feature was also ranked using the decrease in

impurity in the RF algorithm [45], and the results are shown in Figure 3.

Sensors 2022,22, 4900 10 of 18

Table 2.

Feature group classiﬁcation evaluation metrics (the proposed feature results marked with

bold text). ACC—accuracy; Se—sensitivity; Sp—speciﬁcity; AUROC—area under the receiver

operating characteristic curve.

Feature Group ML Algorithm

LR SVM KNN RF

Conventional features

ACC 0.84 0.85 0.81 0.82

Se 0.78 0.72 0.66 0.75

Sp 0.90 0.97 0.94 0.92

F1 score 0.83 0.82 0.77 0.81

AUROC 0.88 0.89 0.87 0.86

Proposed features

ACC 0.94 0.93 0.88 0.93

Se 0.89 0.88 0.78 0.89

Sp 0.98 0.98 0.98 0.97

F1 score 0.93 0.93 0.86 0.93

AUROC 0.96 0.98 0.94 0.95

All features

ACC 0.94 0.93 0.87 0.94

Se 0.89 0.87 0.75 0.86

Sp 0.98 0.98 0.98 0.97

F1 score 0.93 0.92 0.84 0.91

AUROC 0.96 0.97 0.94 0.94

Table 3. Classiﬁcation accuracies for single feature inputs.

Feature

ML Algorithm

SVM LR RF KNN

Proposed

Active reading time 0.78 0.75 0.74 0.76

Fixation intersection coefﬁcient 0.90 0.90 0.89 0.89

Saccade variability 0.74 0.74 0.76 0.73

Fixation intersection variability 0.91 0.90 0.91 0.91

Fixation fractal dimension 0.89 0.90 0.89 0.89

Conventional

Fixation count 0.84 0.85 0.84 0.84

Fixation total duration 0.78 0.74 0.77 0.76

Fixation frequency 0.35 0.30 0.52 0.63

Fixation average duration 0.46 0.49 0.48 0.63

Saccade count 0.81 0.81 0.83 0.82

Saccade total duration 0.78 0.74 0.76 0.76

Saccade frequency 0.57 0.47 0.63 0.57

Saccade average duration 0.48 0.56 0.60 0.56

Total reading time 0.80 0.77 0.74 0.75

The feature importance ranking indicates that the three proposed spatial features

(Fixation intersection coefﬁcient,Fixation fractal dimension, and Fixation intersection variability)

that achieved the highest individual accuracy do indeed contribute to a high classiﬁcation

accuracy when observed as part of a feature set. Considering this, the three proposed

features were used for further statistical analysis.

The boxplots of the Fixation intersection coefﬁcient,Fixation fractal dimension, and Fixation

intersection variability for each color conﬁguration and each subject group (dyslexic and

control) are shown in Figure 4.

Sensors 2022,22, 4900 11 of 18

Sensors 2022, 22, x FOR PEER REVIEW 11 of 18

Figure 3. Feature importance of the eye-tracking features based on the decrease in the impurity of

the random forest algorithm.

The feature importance ranking indicates that the three proposed spatial features

(Fixation intersection coefficient, Fixation fractal dimension, and Fixation intersection variability)

that achieved the highest individual accuracy do indeed contribute to a high classification

accuracy when observed as part of a feature set. Considering this, the three proposed

features were used for further statistical analysis.

The boxplots of the Fixation intersection coefficient, Fixation fractal dimension, and

Fixation intersection variability for each color configuration and each subject group (dyslexic

and control) are shown in Figure 4.

Figure 3.

Feature importance of the eye-tracking features based on the decrease in the impurity of

the random forest algorithm.

The boxplots show that there is a clear difference between the dyslexic and control

classes for each color conﬁguration (the control group has much lower feature values than

the dyslexic group). This was further proved by the statistical analysis. For the three

most important features, for each color conﬁguration, a statistically signiﬁcant difference

was achieved between the subject classes (p< 0.001) using the Mann–Whitney test. Fur-

thermore, the Levene test of the dispersity between the subject groups also showed a

statistically signiﬁcant difference for each of the three ﬁxation complexity features (Fixation

intersection coefﬁcient,Fixation fractal dimension,Fixation intersection variability) for every

color conﬁguration (p< 0.01). The Mann–Whitney test shows that the feature values sig-

niﬁcantly differ between the groups, and the Levene test of dispersity shows that for each

color conﬁguration, the dyslexic group has many more dispersed data points than the

control group.

In order to determine whether there was a color that had a more positive inﬂuence

on dyslexic subjects (the color that would produce the lowest feature values, as close as

possible to the values of the control group), a statistical analysis was performed within the

dyslexic subject group, comparing each pair of color conﬁgurations. The Wilcoxon signed

ranks test showed that there was a statistically signiﬁcant difference (p< 0.01) only for

three pairs of color conﬁgurations and only for a single feature (Fixation fractal dimension):

(1) yellow overlay and orange overlay, (2) orange background and yellow background, and

(3) turquoise background and yellow background. The visualization of the conﬁguration

pairs for which there was a statistically signiﬁcant difference, as well as for three arbitrary

conﬁgurations for which there was no signiﬁcant difference, is shown in Figure 5.

Sensors 2022,22, 4900 12 of 18

Sensors 2022, 22, x FOR PEER REVIEW 12 of 18

Figure 4. The boxplots of (A) Fixation intersection coefficient, (B) Fixation fractal dimension, (C) Fixation

intersection variability for each color configuration and two subject groups (dyslexic and control).

Figure 4.

The boxplots of (

)Fixation intersection coefﬁcient, (

)Fixation fractal dimension, (

)Fixation

intersection variability for each color conﬁguration and two subject groups (dyslexic and control).

Sensors 2022,22, 4900 13 of 18

Sensors 2022, 22, x FOR PEER REVIEW 13 of 18

The boxplots show that there is a clear difference between the dyslexic and control

classes for each color configuration (the control group has much lower feature values than

the dyslexic group). This was further proved by the statistical analysis. For the three most

important features, for each color configuration, a statistically significant difference was

achieved between the subject classes (p < 0.001) using the Mann–Whitney test.

Furthermore, the Levene test of the dispersity between the subject groups also showed a

statistically significant difference for each of the three fixation complexity features

(Fixation intersection coefficient, Fixation fractal dimension, Fixation intersection variability) for

every color configuration (p < 0.01). The Mann–Whitney test shows that the feature values

significantly differ between the groups, and the Levene test of dispersity shows that for

each color configuration, the dyslexic group has many more dispersed data points than

the control group.

In order to determine whether there was a color that had a more positive influence

on dyslexic subjects (the color that would produce the lowest feature values, as close as

possible to the values of the control group), a statistical analysis was performed within

the dyslexic subject group, comparing each pair of color configurations. The Wilcoxon

signed ranks test showed that there was a statistically significant difference (p < 0.01) only

for three pairs of color configurations and only for a single feature (Fixation fractal

dimension): (1) yellow overlay and orange overlay, (2) orange background and yellow

background, and (3) turquoise background and yellow background. The visualization of

the configuration pairs for which there was a statistically significant difference, as well as

for three arbitrary configurations for which there was no significant difference, is shown

in Figure 5.

Figure 5. The visualization of data for all dyslexic subjects, for the three color configurations that

(A) show a statistically significant difference and (B) show no statistical difference. Dots represent

the background color configurations, and circles represent the overlay color configurations.

5. Discussion

In this paper, several ML algorithms and statistical tests were performed with the

goal of analyzing the dyslexic tendencies in a group of 30 children (15 dyslexic and 15

control). The text was written in the subjects’ native language, Serbian, which has a perfect

matching between letters and phonemes. Considering dyslexia detection in such

languages (the ones with a shallow orthographic system) is often quite difficult; an

Figure 5.

The visualization of data for all dyslexic subjects, for the three color conﬁgurations that

(

) show a statistically signiﬁcant difference and (

) show no statistical difference. Dots represent the

background color conﬁgurations, and circles represent the overlay color conﬁgurations.

5. Discussion

In this paper, several ML algorithms and statistical tests were performed with the

goal of analyzing the dyslexic tendencies in a group of 30 children (15 dyslexic and

15 control). The text was written in the subjects’ native language, Serbian, which has

a perfect matching between letters and phonemes. Considering dyslexia detection in

such languages (the ones with a shallow orthographic system) is often quite difﬁcult;

an accuracy of 94% achieved on the balanced dataset used in this paper (F1 score 0.93

and AUROC 0.96) (Table 2) shows a promising result that is comparable to the ones

achieved in the literature

[29,30,32,33,35–37,39–41]

which were performed on languages

with deeper orthographic systems. As the Serbian language has a shallow orthographic

system, making dyslexia harder to diagnose, we consider the observed subject pool rele-

vant for the performed research purposes for a language such as Serbian. Although the

number of participants used in this study is lower than the subject groups found in the

literature [

], the number of total used trials (378 trials, explained in Section 3.1)

provided enough data for the performed type of machine learning analysis.

The three most important features (Fixation intersection coefﬁcient,Fixation fractal dimen-

sion, and Fixation intersection variability, Figure 3) that describe the ﬁxation gaze complexity

achieved a decently high accuracy (89% or higher, Table 3), even when they were used

as the single input feature for the ML algorithms. The importance of feature design and

data interpretation has shown to be quite signiﬁcant as a single spatial feature describing

ﬁxation gaze complexity achieved a better accuracy (91% for Fixation intersection coefﬁcient)

than all of the observed conventional features combined (85%). It is important to note

that the ﬁxation complexity features clearly have lower values for the control subjects and

higher values for the dyslexic ones. The ﬁxation complexity features, and consequently the

gaze pattern complexity, could therefore be considered an indication of reading difﬁculties

that can be observed in dyslexic subjects.

The proposed features should also be of use in dyslexia analysis for languages besides

Serbian as struggling to focus on words could yield similar chaotic ﬁxation movements in

other languages. The drawback of the features is that they do require a certain sampling

frequency and eye-tracker precision as the characterization of ﬁxations that is used in this

Sensors 2022,22, 4900 14 of 18

work does rely on detecting ﬁne eye movements. The ﬁeld of view of the reader can also

inﬂuence the quality of the feature as reading from a further/shorter distance from the

screen/paper could enable the reader to have a different number of words within a single

focus point. This can, in turn, make the chaotic movement of the gaze either harder to

detect or perhaps more saccadic, which might inﬂuence the separability of the classes.

The statistical analysis showed that the spatial features provide clear class separability

regardless of color conﬁguration, as seen in Figure 4. The statistical differences between the

subject groups for all the color conﬁgurations show that a single color cannot be used to

make reading easier, to the degree that the dyslexic and control groups are not separable.

The comparison between color conﬁgurations for dyslexic subjects shows that there

could be color conﬁgurations that are more favorable than others. The analysis within

the dyslexic group also showed a statistically signiﬁcant difference only between three

pairs of colors, as seen in Figure 5, indicating that none of the colors, universally, makes

reading easier or harder when compared to the other ones. A lack of a consistently superior

conﬁguration, however, indicates that the colors have a different effect on each subject and

that, in order to make reading easier for children with dyslexia, an individualistic approach

would most likely be the best solution. The same conclusion could be reached by observing

the statistical analysis between subject groups, as the statistical signiﬁcance was prominent

for each color conﬁguration, indicating that none of the colors stands out in the sense of

making dyslexic and control subjects more similar in their reading patterns.

6. Conclusions

The paper introduced a novel spatiotemporal feature set for recognition of gaze pat-

terns in dyslexic native Serbian speakers. The proposed feature set has shown a signiﬁcant

classiﬁcation improvement in comparison to conventional eye-tracking features (94% vs.

85%). The statistical analysis between subject classes (dyslexic and control) found high

class separability, independent of color conﬁguration. A statistical analysis related to the

color impact on reading performance was accomplished within the dyslexic subject group

and showed high inter-subject variability.

The performed study was limited by the number of participants and by the usage

of a high-precision eye tracker. However, the obtained results are promising in the ﬁeld

of dyslexia detection, and further work could include an introduction of the features

measured from other sensor systems (including low-cost systems), analyzing a larger

number of subjects, or a subject base of broader age distribution. Analyzing the data from

different eye trackers and combining the obtained dataset with other datasets (possibly in

different languages) would also be of interest for future work.

Supplementary Materials:

The following supporting information can be downloaded at: https:

//www.mdpi.com/article/10.3390/s22134900/s1, Figure S1: White background test board exam-

ple; Figure S2: Yellow background test board example; Figure S3: Red overlay test board example;

Figure S4

: Orange background test board example; Figure S5: Yellow overlay test board exam-

ple;

Figure S6

: Orange overlay test board example; Figure S7: Blue overlay test board example;

Figure S8

: Purple background test board example; Figure S9: Purple overlay test board example;

Figure S10: Red background test board example; Figure S11: Turquoise overlay test board ex-

ample; Figure S12: Blue background test board example; Figure S13: Turquoise background test

board example.

Author Contributions:

Conceptualization, I.V., V.K., T.P., A.M.S. and M.M.J.; methodology, I.V., V.K.,

A.M.S. and M.M.J.; software, I.V.; formal analysis, I.V., V.K. and M.M.J.; data acquisition, T.P. and

M.M.J.; resources, V.K., T.P. and M.M.J.; data curation, I.V., V.K., T.P., A.M.S. and M.M.J.; writing—

original draft preparation, I.V., V.K. and M.M.J.; writing—review and editing, I.V., V.K., T.P., A.M.S.

and M.M.J.; visualization, I.V.; project administration, M.M.J. All authors have read and agreed to the

published version of the manuscript.

Funding:

This research was supported by the Ministry of Education, Science and Technology Develop-

ment of Serbia, Belgrade, Serbia (contracts 451-03-68/2022-14/200103 and 451-03-68/2022-14/200223).

Sensors 2022,22, 4900 15 of 18

Institutional Review Board Statement:

The study was conducted in accordance with the Declaration

of Helsinki, and approved by the ethical committee of the Psychology Department of the Univer-

sity of Niš (a branch of the Serbian Psychology Association), experimental procedure No. 9/2019

(04.09.2019).

Informed Consent Statement:

Informed consent was obtained from all subjects involved in the study.

Conﬂicts of Interest: The authors declare no conﬂict of interest.

Appendix A

Table A1. Sensitivity for single feature inputs.

Feature

ML Algorithm

SVM LR RF KNN

Proposed

Active reading time 0.59 0.65 0.60 0.61

Fixation intersection coefﬁcient 0.85 0.84 0.83 0.84

Saccade variability 0.54 0.61 0.62 0.58

Fixation intersection variability 0.85 0.82 0.85 0.85

Fixation fractal dimension 0.84 0.87 0.83 0.84

Conventional

Fixation count 0.74 0.78 0.76 0.77

Fixation total duration 0.59 0.64 0.60 0.59

Fixation frequency 0.31 0.29 0.48 0.50

Fixation average duration 0.32 0.33 0.43 0.50

Saccade count 0.74 0.75 0.82 0.83

Saccade total duration 0.59 0.64 0.61 0.59

Saccade frequency 0.44 0.46 0.42 0.45

Saccade average duration 0.28 0.46 0.44 0.44

Total reading time 0.61 0.65 0.59 0.62

Table A2. Speciﬁcity for single feature inputs.

Feature

ML Algorithm

SVM LR RF KNN

Proposed

Active reading time 0.95 0.83 0.89 0.90

Fixation intersection coefﬁcient 0.95 0.96 0.93 0.94

Saccade variability 0.94 0.85 0.87 0.86

Fixation intersection variability 0.96 0.97 0.94 0.96

Fixation fractal dimension 0.93 0.92 0.93 0.93

Conventional

Fixation count 0.93 0.91 0.91 0.91

Fixation total duration 0.95 0.83 0.92 0.91

Fixation frequency 0.39 0.34 0.59 0.76

Fixation average duration 0.60 0.64 0.60 0.77

Saccade count 0.87 0.85 0.82 0.83

Saccade total duration 0.95 0.83 0.89 0.91

Saccade frequency 0.70 0.47 0.73 0.70

Saccade average duration 0.69 0.66 0.81 0.70

Total reading time 0.97 0.86 0.83 0.87

Table A3. F1 score for single feature inputs.

Feature

ML Algorithm

SVM LR RF KNN

Proposed

Active reading time 0.72 0.71 0.70 0.71

Fixation intersection coefﬁcient 0.89 0.89 0.87 0.88

Saccade variability 0.67 0.69 0.70 0.68

Fixation intersection variability 0.90 0.89 0.89 0.90

Fixation fractal dimension 0.88 0.89 0.87 0.88

Sensors 2022,22, 4900 16 of 18

Table A3. Cont.

Feature

ML Algorithm

SVM LR RF KNN

Conventional

Fixation count 0.81 0.83 0.82 0.83

Fixation total duration 0.72 0.71 0.71 0.70

Fixation frequency 0.32 0.28 0.50 0.57

Fixation average duration 0.37 0.39 0.47 0.58

Saccade count 0.78 0.79 0.82 0.83

Saccade total duration 0.72 0.71 0.71 0.70

Saccade frequency 0.50 0.46 0.50 0.51

Saccade average duration 0.34 0.51 0.53 0.50

Total reading time 0.74 0.73 0.67 0.71

Table A4. Area under the receiver operating characteristic curve for single feature inputs.

Feature

ML Algorithm

SVM LR RF KNN

Proposed

Active reading time 0.67 0.79 0.73 0.75

Fixation intersection coefﬁcient 0.94 0.95 0.92 0.94

Saccade variability 0.74 0.73 0.77 0.77

Fixation intersection variability 0.94 0.95 0.93 0.94

Fixation fractal dimension 0.93 0.96 0.92 0.94

Conventional

Fixation count 0.87 0.89 0.86 0.87

Fixation total duration 0.67 0.78 0.72 0.72

Fixation frequency 0.37 0.32 0.59 0.61

Fixation average duration 0.51 0.38 0.58 0.62

Saccade count 0.85 0.87 0.85 0.87

Saccade total duration 0.67 0.78 0.72 0.72

Saccade frequency 0.53 0.34 0.59 0.58

Saccade average duration 0.40 0.54 0.60 0.58

Total reading time 0.68 0.79 0.73 0.76

References

Hulme, C.; Snowling, M.J. Learning to Read: What We Know and What We Need to Understand Better. Child Dev. Perspect.

2013

7, 1–5. [CrossRef] [PubMed]

2. Snowling, M.J.; Hulme, C.; Nation, K. Deﬁning and Understanding Dyslexia: Past, Present and Future. Oxf. Rev. Educ. 2020,46,

501–513. [CrossRef] [PubMed]

Wagner, R.K.; Zirps, F.A.; Edwards, A.A.; Wood, S.G.; Joyner, R.E.; Becker, B.J.; Liu, G.; Beal, B. The Prevalence of Dyslexia: A New

Approach to Its Estimation. J. Learn. Disabil. 2020,53, 354–365. [CrossRef] [PubMed]

4. Peterson, R.L.; Pennington, B.F. Developmental Dyslexia. Lancet 2012,379, 1997–2007. [CrossRef]

Christo, C.; Davis, J.M.; Brock, S.E. Identifying, Assessing, and Treating Dyslexia at School; Springer Science & Business Media:

Berlin/Heidelberg, Germany, 2009; ISBN 0387886001.

Huc-Chabrolle, M.; Barthez, M.-A.; Tripi, G.; Barthélémy, C.; Bonnet-Brilhault, F. Psychocognitive and psychiatric disorders

associated with developmental dyslexia: A clinical and scientiﬁc issue. Encephale 2010,36, 172–179. [CrossRef]

Rice, M.; Gilson, C.B. Dyslexia Identiﬁcation: Tackling Current Issues in Schools. Interv. Sch. Clin.

2022

, 10534512221081278.

[CrossRef]

Eikerling, M.; Secco, M.; Marchesi, G.; Guasti, M.T.; Vona, F.; Garzotto, F.; Lorusso, M.L. Remote Dyslexia Screening for Bilingual

Children. Multimodal Technol. Interact. 2022,6, 7. [CrossRef]

Carioti, D.; Masia, M.F.; Travellini, S.; Berlingeri, M. Orthographic Depth and Developmental Dyslexia: A Meta-Analytic Study.

Ann. Dyslexia 2021,71, 399–438. [CrossRef]

10.

Roitsch, J.; Watson, S. An Overview of Dyslexia: Deﬁnition, Characteristics, Assessment, Identiﬁcation, and Intervention. Sci. J.

Educ. 2019,7, 86. [CrossRef]

11.

Capin, P.; Gillam, S.L.; Fall, A.-M.; Roberts, G.; Dille, J.T.; Gillam, R.B. Understanding the Nature and Severity of Reading

Difﬁculties among Students with Language and Reading Comprehension Difﬁculties. Ann. Dyslexia

2022

,72, 249–275. [CrossRef]

12.

Drigas, A.S.; Politi-Georgousi, S. ICTs as a Distinct Detection Approach for Dyslexia Screening: A Contemporary View. Int. J.

Online Biomed. Eng. 2019,15, 46–60. [CrossRef]

Sensors 2022,22, 4900 17 of 18

13.

Sood, M.R.; Toornstra, A.; Sereno, M.I.; Boland, M.; Filaretti, D.; Sood, A. A Digital App to Aid Detection, Monitoring, and

Management of Dyslexia in Young Children (DIMMAND): Protocol for a Digital Health and Education Solution. JMIR Res. Protoc.

2018,7, e135. [CrossRef] [PubMed]

14.

Costa, M.; Zavaleta, J.; da Cruz, S.M.S.; Manhães, M.; Cerceau, R.; Carvalho, L.A.; Mousinho, R. A Computational Approach

for Screening Dyslexia. In Proceedings of the 26th IEEE International Symposium on Computer-Based Medical Systems, Porto,

Portugal, 20–22 June 2013; pp. 565–566.

15.

Lobier, M.A.; Peyrin, C.; Pichat, C.; Le Bas, J.-F.; Valdois, S. Visual Processing of Multiple Elements in the Dyslexic Brain: Evidence

for a Superior Parietal Dysfunction. Front. Hum. Neurosci. 2014,8, 479. [CrossRef] [PubMed]

16.

García Chimeno, Y.; García Zapirain, B.; Saralegui Prieto, I.; Fernandez-Ruanova, B. Automatic Classiﬁcation of Dyslexic Children

by Applying Machine Learning to FMRI Images. Biomed. Mater. Eng. 2014,24, 2995–3002. [CrossRef] [PubMed]

17.

Vandermosten, M.; Cuynen, L.; Vanderauwera, J.; Wouters, J.; Ghesquière, P. White Matter Pathways Mediate Parental Effects on

Children’s Reading Precursors. Brain Lang. 2017,173, 10–19. [CrossRef] [PubMed]

18.

Arns, M.; Peters, S.; Breteler, R.; Verhoeven, L. Different Brain Activation Patterns in Dyslexic Children: Evidence from EEG Power

and Coherence Patterns for the Double-Deﬁcit Theory of Dyslexia. J. Integr. Neurosci. 2007,6, 175–190. [CrossRef] [PubMed]

19.

Fraga González, G.; Van der Molen, M.J.W.; Žari´c, G.; Bonte, M.; Tijms, J.; Blomert, L.; Stam, C.J.; Van der Molen, M.W. Graph

Analysis of EEG Resting State Functional Networks in Dyslexic Readers. Clin. Neurophysiol.

2016

,127, 3165–3175. [CrossRef]

[PubMed]

20.

Spironelli, C.; Penolazzi, B.; Angrilli, A. Dysfunctional Hemispheric Asymmetry of Theta and Beta EEG Activity during Linguistic

Tasks in Developmental Dyslexia. Biol. Psychol. 2008,77, 123–131. [CrossRef]

21.

Jakovljevi´c, T.; Jankovi´c, M.M.; Savi´c, A.M.; Soldatovi´c, I.; Todorovi´c, P.; Jere Jakulin, T.; Papa, G.; Kovi´c, V. The Sensor Hub for

Detecting the Developmental Characteristics in Reading in Children on a White vs. Colored Background/Colored Overlays.

Sensors 2021,21, 406. [CrossRef]

22.

Jakovljevi´c, T.; Jankovi´c, M.M.; Savi´c, A.M.; Soldatovi´c, I.; ˇ

Coli´c, G.; Jakulin, T.J.; Papa, G.; Kovi´c, V. The Relation between

Physiological Parameters and Colour Modiﬁcations in Text Background and Overlay during Reading in Children with and

without Dyslexia. Brain Sci. 2021,11, 539. [CrossRef]

23.

Jankovi´c, M.M. Biomarker-Based Approaches for Dyslexia Screening: A Review. In Proceedings of the 2022 ZINC IEEE, Novi

Sad, Serbia, 25–26 May 2022.

24.

Christoforou, C.; Fella, A.; Leppänen, P.H.T.; Georgiou, G.K.; Papadopoulos, T.C. Fixation-Related Potentials in Naming Speed:

A Combined EEG and Eye-Tracking Study on Children with Dyslexia. Clin. Neurophysiol.

2021

,132, 2798–2807. [CrossRef]

[PubMed]

25.

Temelturk, R.D.; Ozer, E. Binocular Coordination of Children with Dyslexia and Typically Developing Children in Linguistic and

Non-Linguistic Tasks: Evidence from Eye Movements. Ann. Dyslexia 2022. [CrossRef] [PubMed]

26.

Wang, R.; Bi, H.-Y. A Predictive Model for Chinese Children with Developmental Dyslexia—Based on a Genetic Algorithm

Optimized Back-Propagation Neural Network. Expert Syst. Appl. 2022,187, 115949. [CrossRef]

27. Pavlidis, G.T. Eye Movements in Dyslexia: Their Diagnostic Signiﬁcance. J. Learn. Disabil. 1985,18, 42–50. [CrossRef]

28.

Rello, L.; Ballesteros, M. Detecting Readers with Dyslexia Using Machine Learning with Eye Tracking Measures. In Proceedings

of the 12th International Web for All Conference, Florence, Italy, 18–20 May 2015; Association for Computing Machinery: New

York, NY, USA, 2015.

29.

Nilsson Benfatto, M.; Öqvist Seimyr, G.; Ygge, J.; Pansell, T.; Rydberg, A.; Jacobson, C. Screening for Dyslexia Using Eye Tracking

during Reading. PLoS ONE 2016,11, e0165508. [CrossRef]

30.

Masson, G.S.; Yang, D.-S.; Miles, F.A. Version and Vergence Eye Movements in Humans: Open-Loop Dynamics Determined by

Monocular Rather than Binocular Image Speed. Vision Res. 2002,42, 2853–2867. [CrossRef]

31.

Saeb, S.; Lonini, L.; Jayaraman, A.; Mohr, D.C.; Kording, K.P. The Need to Approximate the Use-Case in Clinical Machine

Learning. Gigascience 2017,6, gix019. [CrossRef]

32.

Prabha, J.A.; Bhargavi, R.; Harish, B. Predictive Model for Dyslexia from Eye Fixation Events. Int. J. Eng. Adv. Technol.

2019

,9, 20.

33.

Jothi Prabha, A.; Bhargavi, R. Predictive Model for Dyslexia from Fixations and Saccadic Eye Movement Events. Comput. Methods

Programs Biomed. 2020,195, 105538. [CrossRef]

34. Prabha, A.J.; Bhargavi, R.; Harish, B. An Efﬁcient Machine Learning Model for Prediction of Dyslexia from Eye Fixation Events.

New Approaches Eng. Res. 2021,10, 171–179. [CrossRef]

35.

Appadurai, J.P.; Bhargavi, R. Eye Movement Feature Set and Predictive Model for Dyslexia: Feature Set and Predictive Model for

Dyslexia. Int. J. Cogn. Inform. Nat. Intell. 2021,15, 1–22. [CrossRef]

36.

Asvestopoulou, T.; Manousaki, V.; Psistakis, A.; Smyrnakis, I.; Andreadakis, V.; Aslanides, I.M.; Papadopouli, M. Dyslexml:

Screening Tool for Dyslexia Using Machine Learning. arXiv 2019, arXiv:1903.06274.

37.

Nerušil, B.; Polec, J.; Škunda, J.; Kaˇcur, J. Eye Tracking Based Dyslexia Detection Using a Holistic Approach. Sci. Rep.

2021

11, 15687. [CrossRef] [PubMed]

38.

Weiss, B.; Nárai, Á.; Vidnyánszky, Z. Lateralization of Early Orthographic Processing during Natural Reading Is Impaired in

Developmental Dyslexia. Neuroimage 2022,258, 119383. [CrossRef] [PubMed]

39.

Raatikainen, P.; Hautala, J.; Loberg, O.; Kärkkäinen, T.; Leppänen, P.; Nieminen, P. Detection of Developmental Dyslexia with

Machine Learning Using Eye Movement Data. Array 2021,12, 100087. [CrossRef]

Sensors 2022,22, 4900 18 of 18

40.

El Hmimdi, A.E.; Ward, L.M.; Palpanas, T.; Kapoula, Z. Predicting Dyslexia and Reading Speed in Adolescents from Eye

Movements in Reading and Non-Reading Tasks: A Machine Learning Approach. Brain Sci. 2021,11, 1337. [CrossRef]

41.

Vajs, I.; Kovi´c, V.; Papi´c, T.; Savi´c, A.M.; Jankovi´c, M.M. Dyslexia Detection in Children Using Eye Tracking Data Based

on VGG16 Network. In Proceedings of the 30th European Signal Processing Conference (EUSIPCO), Belgrade, Serbia,

29 August–2 September 2022.

42.

Lai, M.-L.; Tsai, M.-J.; Yang, F.-Y.; Hsu, C.-Y.; Liu, T.-C.; Lee, S.W.-Y.; Lee, M.-H.; Chiou, G.-L.; Liang, J.-C.; Tsai, C.-C. A Review of

Using Eye-Tracking Technology in Exploring Learning from 2000 to 2012. Educ. Res. Rev. 2013,10, 90–115. [CrossRef]

43. Theiler, J. Estimating Fractal Dimension. J. Opt. Soc. Am. A 1990,7, 1055–1073. [CrossRef]

44.

Pedregosa, F.; Varoquaux, G.; Gramfort, A.; Michel, V.; Thirion, B.; Grisel, O.; Blondel, M.; Prettenhofer, P.; Weiss, R.; Dubourg, V.;

et al. Scikit-Learn: Machine Learning in Python. J. Mach. Learn. Res. 2011,12, 2825–2830.

45. Nembrini, S.; König, I.R.; Wright, M.N. The Revival of the Gini Importance? Bioinformatics 2018,34, 3711–3718. [CrossRef]

Developing an Image-Based Dyslexia Detection Model Using the Deep Learning Technique

Article

Full-text available

Dec 2023

Dyslexia is a neurological disorder. Across the globe, children are primarily affected by dyslexia. Deep learning (DL) approaches have been applied in dyslexia detection (DD). However, these approaches demand substantial computational resources to generate a meaningful outcome. In addition, healthcare centers face challenges in interpreting the DL-based DD models. Thus, this study aimed to build an effective DD model to support physicians in detecting dyslexic individuals using functional magnetic resonance imaging (FMRI). The authors applied extensive image preprocessing techniques to overcome the FMRI image complexities. They built a convolutional neural network model for extracting the key features from the FMRI images using the weights of the ShuffleNet V2 model. Random forest is ensembled to classify the extracted features. The authors evaluated the proposed model using a real-time dataset comprising 606 multidimensional FMRI images. The findings revealed that the recommended DD model outperformed the existing DD models. The proposed DD model achieved an accuracy of 98.9 and an F1-Score of 99.0. In addition, the proposed model generated an outcome with a minimum loss of 1.2, a standard deviation of 0.0002, and a confidence interval range between 98.2 and 98.7. The experimental outcome supported the effectiveness of the proposed model in detecting dyslexic individuals with few computational resources. The proposed model can be extended using graph convolutional networks for classifying complex images with optimal prediction accuracy.

Difficulty of Visual Recognition: Identifying the Direction Confusion of Reading Letters in Young Children

Article

Full-text available

Nov 2023

Early literacy skills are essential for children's academic development. This research used a comprehensive approach that included observations, assessments, and interviews with children and educators from various kindergartens, which involved 1040 children and 99 teachers from 71 early childhood institutions. The comparison of difficulties in pronouncing letters was measured through observation and assessment of children's ability to recognize the shape and pronounce of letters using augmented reality digital flashcards with a companion application called KIDOPA. The results show that children have difficulty saying the letters b, d, m, w, p, q, n, u, x, and z while reading letters. These difficulties are linear with the teacher's understanding of the importance of teaching literacy to children.

Prediction of dyslexia using eye tracking - Automated detection of reading pattern differences for normal readers compared to people with dyslexia

Thesis

Full-text available

Jun 2023

Cecilie Elisabeth Sejer Jürgens

Dyslexia is the most common learning disorder in the world. People with undiagnosed dyslexia, can’t get the helping aids they need, which affects their learning ability. It is crucial to correctly diagnose dyslexia as early as possible, to provide the best prerequisites for their future. Today, one has to be selected by the teacher/professor, in order to get a dyslexia test. This is a very subjective and biased selection. Therefore a new efficient and objective method have to be introduced. It is proven that through eye movements during reading, dyslexic readers can be differentiated from normal readers. These eye movements can be recorded using eye tracking. High qualitative eye trackers are very expensive, which is why webcam eye tracking is getting more popular. However, it is still unknown, how well webcam eye tracking performs on tasks, that the high qualitative eye tracking can solve. This thesis examines the use of two different eye tracking data, to predict dyslexia; desktop eye tracking and webcam eye tracking. The data collected was based on 26 respondents, both normal and dyslexic readers, all having Danish as their native language. The preprocessing included extracting features and the xcoordinates of the gaze, to analyse the data in two different ways. Using different machine learning models and neural networks, it was possible to achieve a good prediction of the two classes. The best result was 95.88% accuracy using the Support Vector Machine with feature selection for both the desktop and the webcam eye tracking data. This shows that highly accurate predictions of dyslexia can be obtained using low resolution eye tracking, which is a good foundation for establishing a new screening tool for diagnosing dyslexia.

A Systematic Review of Eye-Tracking Technology in Dyslexia Diagnosis

Article

Full-text available

May 2024

This paper presents a systematic literature review aimed at consolidating knowledge on the application of eye-tracking technology in the diagnosis of dyslexia among school-aged children (6-12 years). Through a meticulous search and selection process, 20 studies conducted over the last 10 years were identified and analyzed to evaluate the effectiveness of this technology. The findings highlight the varied methodologies, participant demographics, and outcomes of these studies, underscoring the potential of eye tracking as a non-invasive, objective tool in the early detection of and intervention for dyslexia. Despite facing limitations such as heterogeneity in study designs and the need for standardized protocols, this review illuminates the significant promise of eye-tracking technology in enhancing dyslexia diagnosis. It identifies gaps in current research, proposes avenues for future investigation, and offers evidence-based recommendations for practitioners. This endeavor not only enriches the present understanding of dyslexia diagnosis, but also establishes a foundation to ultimately improve educational outcomes for dyslexic learners.

What Can Eye Movements Tell Us about Reading in a Second Language: A Scoping Review of the Literature

Article

Full-text available

Apr 2024

There is a growing interest in the similarities and differences in reading processes in L1 and L2. Some researchers propose that reading shares commonalities across languages, while others state that each language has particularities that would affect reading processes. One way to better understand the reading processes is by using eye-tracking methodologies to explore reading processes online. This review focuses on the research done about reading processes in bilinguals to understand the effect of L1 in L2 processing. We found that most of the studies followed two methods of comparison: Bilinguals vs. monolinguals and L1 vs. L2. In general, bilinguals presented longer times in all reading measures; however, the results are discussed based on different characteristics of the studies and the type of comparison.

Deep Learning-Based Detection of Learning Disorders on a Large Scale Dataset of Eye Movement Records

Article

Full-text available

Feb 2024

Early detection of dyslexia and learning disorders is vital for avoiding a learning disability, as well as supporting dyslexic students by tailoring academic programs to their needs. Several studies have investigated using supervised algorithms to screen dyslexia vs control subjects; however, the data size and the conditions of data acquisition were their most significant limitation. In the current study, we leverage a large dataset, containing 4243 time series of eye movement records from children across Europe. These datasets were derived from various tests such as saccade, vergence, and reading tasks. Furthermore, our methods were evaluated with realistic test data, including real-life biases such as noise, eye tracking misalignment, and similar pathologies among non-scholar difficulty classes. In addition, we present a novel convolutional neural network architecture, adapted to our time series classification problem, that is intended to generalize on a small annotated dataset and to handle a high-resolution signal (1024 point). Our architecture achieved a precision of 80.20% and a recall of 75.1%, when trained on the vergence dataset, and a precision of 77.2% and a recall of 77.5% when trained on the saccade dataset. Finally, we performed a comparison using our ML approach, a second architecture developed for a similar problem, and two other methods that we investigated that use deep learning algorithms to predict dyslexia.

Deep learning-driven dyslexia detection model using multi-modality data

Article

Full-text available

Jun 2024

Background Dyslexia is a neurological disorder that affects an individual’s language processing abilities. Early care and intervention can help dyslexic individuals succeed academically and socially. Recent developments in deep learning (DL) approaches motivate researchers to build dyslexia detection models (DDMs). DL approaches facilitate the integration of multi-modality data. However, there are few multi-modality-based DDMs. Methods In this study, the authors built a DL-based DDM using multi-modality data. A squeeze and excitation (SE) integrated MobileNet V3 model, self-attention mechanisms (SA) based EfficientNet B7 model, and early stopping and SA-based Bi-directional long short-term memory (Bi-LSTM) models were developed to extract features from magnetic resonance imaging (MRI), functional MRI, and electroencephalography (EEG) data. In addition, the authors fine-tuned the LightGBM model using the Hyperband optimization technique to detect dyslexia using the extracted features. Three datasets containing FMRI, MRI, and EEG data were used to evaluate the performance of the proposed DDM. Results The findings supported the significance of the proposed DDM in detecting dyslexia with limited computational resources. The proposed model outperformed the existing DDMs by producing an optimal accuracy of 98.9%, 98.6%, and 98.8% for the FMRI, MRI, and EEG datasets, respectively. Healthcare centers and educational institutions can benefit from the proposed model to identify dyslexia in the initial stages. The interpretability of the proposed model can be improved by integrating vision transformers-based feature extraction.

Developmental Dyslexia: Insights from EEG-Based Findings and Molecular Signatures—A Pilot Study

Article

Full-text available

Jan 2024
BSRCCS

Developmental dyslexia (DD) is a learning disorder. Although risk genes have been identified, environmental factors, and particularly stress arising from constant difficulties, have been associated with the occurrence of DD by affecting brain plasticity and function, especially during critical neurodevelopmental stages. In this work, electroencephalogram (EEG) findings were coupled with the genetic and epigenetic molecular signatures of individuals with DD and matched controls. Specifically, we investigated the genetic and epigenetic correlates of key stress-associated genes (NR3C1, NR3C2, FKBP5, GILZ, SLC6A4) with psychological characteristics (depression, anxiety, and stress) often included in DD diagnostic criteria, as well as with brain EEG findings. We paired the observed brain rhythms with the expression levels of stress-related genes, investigated the epigenetic profile of the stress regulator glucocorticoid receptor (GR) and correlated such indices with demographic findings. This study presents a new interdisciplinary approach and findings that support the idea that stress, attributed to the demands of the school environment, may act as a contributing factor in the occurrence of the DD phenotype.

Identifying dyslexia in school pupils from eye movement and demographic data using artificial intelligence

Article

Full-text available

Nov 2023
PLOS ONE

This paper represents our research results in the pursuit of the following objectives: (i) to introduce a novel multi-sources data set to tackle the shortcomings of the previous data sets, (ii) to propose a robust artificial intelligence-based solution to identify dyslexia in primary school pupils, (iii) to investigate our psycholinguistic knowledge by studying the importance of the features in identifying dyslexia by our best AI model. In order to achieve the first objective, we collected and annotated a new set of eye-movement-during-reading data. Furthermore, we collected demographic data, including the measure of non-verbal intelligence, to form our three data sources. Our data set is the largest eye-movement data set globally. Unlike the previously introduced binary-class data sets, it contains (A) three class labels and (B) reading speed. Concerning the second objective, we formulated the task of dyslexia prediction as regression and classification problems and scrutinized the performance of 12 classifications and eight regressions approaches. We exploited the Bayesian optimization method to fine-tune the hyperparameters of the models: and reported the average and the standard deviation of our evaluation metrics in a stratified ten-fold cross-validation. Our studies showed that multi-layer perceptron, random forest, gradient boosting, and k-nearest neighbor form the group having the most acceptable results. Moreover, we showed that although separately using each data source did not lead to accurate results, their combination led to a reliable solution. We also determined the importance of the features of our best classifier: our findings showed that the IQ, gender, and age are the top three important features; we also showed that fixation along the y-axis is more important than other fixation data. Dyslexia detection, eye fixation, eye movement, demographic, classification, regression, artificial intelligence.

Dyslexia Diagnostics Based on Eye Movements and Artificial Intelligence Methods: A Review

Article

Full-text available

Oct 2023

p style="text-align: justify;">The review considers methods of dyslexia diagnostics based on eye movement data and implemented on the basis of artificial intelligence. A number of studies have shown that eye movements in people with dyslexia may differ from those of people with normal reading abilities. Since 2015, studies have begun to appear in which the eye movements of observers with and without dyslexia were analyzed using various artificial intelligence methods. To date, there are a number of papers using both simple and more complex models (with neural networks and deep learning). This review discusses what accuracy of diagnosis has been achieved by researchers, for which groups of subjects and for which languages the current results have been shown, what types of algorithms have been used, and other practical aspects of conducting such diagnosis. According to the data analyzed, dyslexia diagnostics by eye movements and artificial intelligence methods is very promising and may have a significant impact on early diagnosing of reading problems.</p

Understanding the nature and severity of reading difficulties among students with language and reading comprehension difficulties

Article

Full-text available

May 2022

This study investigated the presence of word reading difficulties in a sample of students in Grades 1-4 (n = 357) identified with language and reading comprehension difficulties. This study also examined whether distinct word reading and listening comprehension profiles emerged within this sample and the extent to which these groups varied in performance on cognitive and demographic variables. Findings showed that the majority of students (51%) with language and reading comprehension difficulties demonstrated significant risk in word reading (more than 1 SD below the mean), even though the participant screening procedures did not examine word reading directly. Three latent profiles emerged when students were classified into subgroups based on their performance in listening comprehension (LC) and word reading (WR): (1) severe difficulties in LC and moderate difficulties in WR (11%), (2) mild difficulties in both LC and WR (50%), and (3) moderate difficulties in LC and mild difficulties in WR (39%). Of note, even though students were identified for participation on the basis of poor oral language and reading comprehension abilities, all profiles demonstrated some degree of word reading difficulties. Findings revealed there were differences in age and performance on measures of working memory, nonverbal reasoning, and reading comprehension performance between profiles. Implications for educators providing instruction to students with or at risk for dyslexia and developmental language disorders were discussed.

Remote Dyslexia Screening for Bilingual Children

Article

Full-text available

Jan 2022

Ideally, language and reading skills in bilingual children are assessed in both languages spoken in order to avoid misdiagnoses of communication or learning disorders. Due to limited capacity of clinical and educational staff, computerized screenings that allow for automatic evaluation of the children’s performance on reading tasks (accuracy and speed) might pose a useful alternative in clinical and school settings. In this study, a novel web-based screening platform for language and reading assessment is presented. This tool has been preliminarily validated with monolingual Italian, Mandarin–Italian and English–Italian speaking primary school children living and schooled in Italy. Their performances in the screening tasks in Italian and—if bilingual—in their native language were compared to the results of standardized/conventional reading assessment tests as well as parental and teacher questionnaires. Correlations revealed the tasks that best contributed to the identification of risk for the presence of reading disorders and showed the general feasibility and usefulness of the computerized screening. In a further step, both screening administrators (Examiners) and child participants (Examinees) were invited to participate in usability studies, which revealed general satisfaction and provided suggestions for further improvement of the screening platform. Based on these findings, the potential of the novel web-based screening platform is discussed.

Predicting Dyslexia and Reading Speed in Adolescents from Eye Movements in Reading and Non-Reading Tasks: A Machine Learning Approach

Article

Full-text available

Oct 2021
BSRCCS

There is evidence that abnormalities in eye movements exist during reading in dyslexic individuals. A few recent studies applied Machine Learning (ML) classifiers to such eye movement data to predict dyslexia. A general problem with these studies is that eye movement data sets are limited to reading saccades and fixations that are confounded by reading difficulty, e.g., it is unclear whether abnormalities are the consequence or the cause of reading difficulty. Recently, Ward and Kapoula used LED targets (with the REMOBI & AIDEAL method) to demonstrate abnormalities of large saccades and vergence eye movements in depth demonstrating intrinsic eye movement problems independent from reading in dyslexia. In another study, binocular eye movements were studied while reading two texts: one using the “Alouette” text, which has no meaning and requires word decoding, the other using a meaningful text. It was found the Alouette text exacerbates eye movement abnormalities in dyslexics. In this paper, we more precisely quantify the quality of such eye movement descriptors for dyslexia detection. We use the descriptors produced in the four different setups as input to multiple classifiers and compare their generalization performances. Our results demonstrate that eye movement data from the Alouette test predicts dyslexia with an accuracy of 81.25%; similarly, we were able to predict dyslexia with an accuracy of 81.25% when using data from saccades to LED targets on the Remobi device and 77.3% when using vergence movements to LED targets. Noticeably, eye movement data from the meaningful text produced the lowest accuracy (70.2%). In a subsequent analysis, ML algorithms were applied to predict reading speed based on eye movement descriptors extracted from the meaningful reading, then from Remobi saccade and vergence tests. Remobi vergence eye movement descriptors can predict reading speed even better than eye movement descriptors from the meaningful reading test.

Eye Movement Feature Set and Predictive Model for Dyslexia: Feature Set and Predictive Model for Dyslexia

Article

Full-text available

Jan 2021

Jothi Prabha Appadurai

Dyslexia is a learning disorder that can cause difficulties in reading or writing. Dyslexia is not a visual problem but many dyslexics have impaired magnocellular system which causes poor eye control. Eye-trackers are used to track eye movements. This research work proposes a set of significant eye movement features that are used to build a predictive model for dyslexia. Fixation and saccade eye events are detected using the dispersion-threshold and velocity-threshold algorithms. Various machine learning models are experimented. Validation is done on 185 subjects using 10-fold cross-validation. Velocity based features gave high accuracy compared to statistical and dispersion features. Highest accuracy of 96% was achieved using the Hybrid Kernel Support Vector Machine- Particle Swarm Optimization model followed by the Xtreme Gradient Boosting model with an accuracy of 95%. The best set of features are the first fixation start time, average fixation saccade duration, the total number of fixations, total number of saccades and ratio between saccades and fixations.

Predictive Model for Dyslexia from Eye Fixation Events

Article

Dec 2019

Dyslexia is a specific learning disorder where the individual often find difficulty in spelling and reading words fluently. Dyslexia is non-curable but with right remedial support, dyslexics can become highly successful in academics and life. Eye movement patterns during reading process can provide an in-depth understanding about reading disorders caused by dyslexia. Eye movements can be captured using eye-tracker, from which the relationship between how eyes move with respect to the words they read can be understood. In this work, a set of binocular fixation and saccade features were extracted from raw eye tracking data based on statistical measures. Machine learning algorithms such as Random Forest Classifier (RF), Support Vector Machine (SVM) for classification and K-Nearest Neighbor (KNN) were analyzed to output classification models for prediction of dyslexia. KNN gave higher levels of accuracy of 95% compared to SVM and RF over a small feature set of features related to fixations and saccades. These eye features can be used as a basis for developing screening means for prediction of dyslexia. Prediction of dyslexia at an early stage can help children to go for remediation which helps them for academic excellence.

Dyslexia detection in children using eye tracking data based on VGG16 network

Conference Paper

Sep 2022

Considering the negative impact dyslexia has on school achievements, dyslexia diagnosis and treatment are found to be of great importance. In this paper, a deep convolutional neural network was developed to detect dyslexia in children ages 7-13, based on gathered eye tracking data. The children read a text written in Serbian on 13 different color configurations (including background and overlay color variations) and the raw gaze coordinates gathered during the trials were formatted into colored images and used to train a deep learning model based on the VGG16 architecture. Several configurations of the convolutional neural network were evaluated, as well as several trial segmentation configurations in order to provide the best overall result. The method was evaluated using subject-wise cross-validation and an accuracy of 87% was achieved. The obtained results show that a combination of convolutional neural network and visual encoding of the eye tracking data shows promising results in dyslexia detection with minimal preprocessing.

Biomarker-based approaches for dyslexia screening: A review

Conference Paper

May 2022

Milica M. Janković

Lateralization of early orthographic processing during natural reading is impaired in developmental dyslexia

Article

Jun 2022
NEUROIMAGE

Skilled reading requires specialized visual cortical processing of orthographic information and its impairment has been proposed as a potential correlate of compromised reading in dyslexia. However, which stage of orthographic information processing during natural reading is disturbed in dyslexics remains unexplored. Here we addressed this question by simultaneously measuring the eye movements and EEG of dyslexic and control young adults during natural reading. Isolated meaningful sentences were presented at five inter-letter spacing levels spanning the range from minimal to extra-large spacing, and participants were instructed to read the text silently at their own pace. Control participants read faster, performed larger saccades and shorter fixations compared to dyslexics. While reading speed peaked around the default letter spacing, saccade amplitude increased and fixation duration decreased with the increase of letter spacing in both groups. Lateralization of occipito-temporal fixation-related EEG activity (FREA) was found in three consecutive time intervals corresponding to early orthographic processing in control readers. Importantly, the lateralization in the time range of the first negative left occipito-temporal FREA peak was specific for first fixations and exhibited an interaction effect between reading ability and letter spacing. The interaction originated in the significant decrease of FREA lateralization at extra-large compared to default letter spacing in control readers and the lack of lateralization in both letter spacing conditions in the case of dyslexics. These findings suggest that expertise-driven hemispheric functional specialization for early orthographic processing thought to be responsible for letter identity extraction during natural reading is compromised in dyslexia.

Binocular coordination of children with dyslexia and typically developing children in linguistic and non‐linguistic tasks: Evidence from eye movements

Article

Apr 2022
ANN DYSLEXIA

Given the increased evidence suggesting the presence of binocular coordination deficits in dyslexia, investigations of binocular eye movements are beneficial to clarify the underlying causes of reading difficulties. This systematic review aims to (a) synthesize the literature through the examination of binocular coordination in children with dyslexia by describing the normative development of stable binocular control and (b) outline future directions. Boolean expressions in the PubMed search were used to define papers. Following a literature search and selection process, 25 papers were included. Studies using binocular eye tracking during linguistic and nonlinguistic tasks in children with dyslexia and typical development 5–17 years of age are reviewed. The studies reviewed provided consistent evidence of poor binocular coordination in children with dyslexia, but the results associated with different task characteristics were less consistent. The relation between binocular coordination deficits and reading difficulties needs to be further elucidated in longitudinal studies which may provide future treatments targeting the binocular viewing system in dyslexia.

Dyslexia Identification: Tackling Current Issues in Schools

Article

Mar 2022

Recent advocacy efforts and state policies have recognized the identification and support for students with dyslexia as a critical issue for schools. Current issues related to dyslexia identification include the lack of a universal definition for dyslexia and the possible confusion created by state legislation related to dyslexia. Tackling these issues in schools may include using current research to implement hybrid models of identification, evaluating and improving screening tools for dyslexia risk, and addressing data-based decision-making through response-to-intervention frameworks.

Spatiotemporal Eye-Tracking Feature Set for Improved Recognition of Dyslexic Reading Patterns in Children

Abstract

Recommended publications

Accessible Dyslexia Detection with Real-Time Reading Feedback through Robust Interpretable Eye-Track...

Eye-Tracking Image Encoding: Autoencoders for the Crossing of Language Boundaries in Developmental D...

Dyslexia detection in children using eye tracking data based on VGG16 network

Biomarker-based approaches for dyslexia screening: A review