Figure 1 - uploaded by Rajesh P. N. Rao
Content may be subject to copyright.
Measuring working memory load with an electroencephalograph. Shot taken from experiment. dynamically adapt and support users’ goals [4]. Since we have traditionally interacted with computers through our physical bodies, most of these techniques have been based on observations of user actions and behavior (e.g. [23]). Less frequently, other techniques have utilized physiological signals as indicators of user state [14,21]. While these measures have been reasonably successful, they are rather indirect, especially when the user state in question is of a cognitive nature. Fortunately, advances in cognitive neuro- science and brain-sensing technologies provide us with the ability to interface more directly with the human brain. This is possible through the use of sensors that monitor the electrical and chemical changes within the brain that correspond with certain forms of thought. While using these technologies in HCI research has been previously articulated [12,28], we believe there is an opportunity to further explore practical issues with their use in HCI applications. In our work, we explore using one of these technologies, an electroencephalograph (EEG), to estimate or classify working memory load, or the cognitive effort dedicated to hold- ing information in the mind for short periods of time while performing a cognitive task [1]. Working memory has been shown to be a key component of cognitive load, and is a 

Measuring working memory load with an electroencephalograph. Shot taken from experiment. dynamically adapt and support users’ goals [4]. Since we have traditionally interacted with computers through our physical bodies, most of these techniques have been based on observations of user actions and behavior (e.g. [23]). Less frequently, other techniques have utilized physiological signals as indicators of user state [14,21]. While these measures have been reasonably successful, they are rather indirect, especially when the user state in question is of a cognitive nature. Fortunately, advances in cognitive neuro- science and brain-sensing technologies provide us with the ability to interface more directly with the human brain. This is possible through the use of sensors that monitor the electrical and chemical changes within the brain that correspond with certain forms of thought. While using these technologies in HCI research has been previously articulated [12,28], we believe there is an opportunity to further explore practical issues with their use in HCI applications. In our work, we explore using one of these technologies, an electroencephalograph (EEG), to estimate or classify working memory load, or the cognitive effort dedicated to hold- ing information in the mind for short periods of time while performing a cognitive task [1]. Working memory has been shown to be a key component of cognitive load, and is a 

Source publication
Conference Paper
Full-text available
A reliable and unobtrusive measurement of working mem- ory load could be used to evaluate the efficacy of interfaces and to provide real-time user-state information to adaptive systems. In this paper, we describe an experiment we con- ducted to explore some of the issues around using an elec- troencephalograph (EEG) for classifying working memory l...

Context in source publication

Context 1
... reliable and unobtrusive measurement of working memory load could be used to evaluate the efficacy of interfaces and to provide real-time user-state information to adaptive systems. In this paper, we describe an experiment we conducted to explore some of the issues around using an electroencephalograph (EEG) for classifying working memory load. Within this experiment, we present our classification methodology, including a novel feature selection scheme that seems to alleviate the need for complex drift modeling and artifact rejection. We demonstrate classification accuracies of up to 99% for 2 memory load levels and up to 88% for 4 levels. We also present results suggesting that we can do this with shorter windows, much less training data, and a smaller number of EEG channels, than reported previously. Finally, we show results suggesting that the models we construct transfer across variants of the task, implying some level of generality. We believe these findings extend prior work and bring us a step closer to the use of such technologies in HCI research. Author Keywords: Brain-Computer Interface (BCI), electroencephalogram (EEG), cognitive load, memory load, machine-learning, feature selection, classification. ACM Classification Keywords : H.1.2 [User/Machine Systems]; H.5.2 [User Interfaces]: Input devices and strate- gies; B.4.2 [Input/Output Devices]: Channels and control- lers; J.3 [Life and Medical Sciences]. Human-computer interaction (HCI) researchers continually work on techniques that allow us to measure user states such as cognitive and memory workload, task engagement, surprise, satisfaction, or frustration. Such measures are useful not only for evaluating the efficacy of interfaces, but also for providing real-time information to systems that reasonable measure of how hard a user is working to solve a problem or use an interface. For example, working memory load has long been recognized in HCI to be an important indicator of potential errors as well as a predictive feature of procedural skill acquisition [3]. Given this evidence, interface designers often try to minimize the working memory load required to perform a task, and reliable real-time measures would benefit them greatly. While various researchers have worked on classifying working memory with EEG (e.g. [6,7,20,21,27]), previous work has typically relied on costly equipment and techniques that make it difficult for non-EEG-experts to replicate and use this work. Additionally, this work has often required experimenters to collect large amounts of classifier training data (sometimes on the order of days), a process that is often prohibitively expensive. While we believe that EEG is complementary to many of the other measures of memory and cognitive load, it is outside the scope of this paper to explore the detailed relationships between these measures. We leave this for future work. The contributions of this paper are three-fold: • First, we present our methodology within an experiment we ran to measure working memory load using only EEG signals. The innovation within this methodology is an automatic feature selection scheme that eliminates the need for procedures used in most previous work, such as complex device and physiological drift modeling as well as manual artifact rejection. • Second, using this methodology, we present classification results using machine learning techniques that replicate and extend prior work in the area. Specifically, we show classification accuracies of up to 99.0% between two load levels, and up to 88.0% between four levels, all with just 8 channels of EEG data. More importantly, we present results showing how classification accuracy varies with different temporal window sizes, amounts of training, and number of EEG channels. Specifically, the results suggest that our techniques allow us to attain accurate classification with less lag, much less training data, and simpler equipment. • Third, we show how our models work across variants of the memory task, providing encouraging evidence that it might be possible to develop canonical training tasks and to perform general classification of memory load. In this paper, we use an Electroencephalograph (EEG), a sensing technology that uses electrodes placed on the scalp to measure electrical potentials related to brain activity (see Figure 1). Each electrode typically consists of a wire leading to a conductive disk that is electrically connected to the scalp using conductive paste or gel. The EEG device re- cords the voltage at each of these electrodes relative to a reference point, which is often another electrode on the scalp. Because EEG is a non-invasive, passive measuring device, it is safe for extended and repeated use, a character- istic crucial for adoption in HCI research. Additionally, it does not require a highly skilled operator or medical procedure to use. For more information about electrical signals generated by the brain as well as EEG, see [5]. The signal provided by an EEG is, at best, a crude representation of brain activity due to the nature of the detector. Scalp electrodes are only sensitive to macroscopic and co- ordinated firing of large groups of neurons near the surface of the brain, and then only when they are directed along a perpendicular vector relative to the scalp. Additionally, because of the fluid, bone, and skin that separate the electrodes from the actual electrical activity, the already small signals are scattered and attenuated before reaching the electrodes. EEG data is typically analyzed by looking at the spectral power of the signal in a set of frequency bands, which have been observed to correspond with certain types of neural activity [5]. These frequency bands are commonly defined as 1-4 Hz (delta), 4-8 Hz (theta), 8-12 Hz (alpha), 12-20 Hz (beta-low), 20-30 Hz (beta-high), and >30 Hz (gamma). Early researchers observed the sensitivity of EEG to changes in mental effort. For example, Hans Berger [2] and others [11] report observing a decrease in the amplitude of the alpha (8-12 Hz) rhythm during mental arithmetic tasks Other researchers have shown that higher memory loads cause increases in theta (4-8 Hz) and low-beta (12-15 Hz) power in the frontal midline regions of the scalp [17], gamma (>30 Hz) oscillations [8], as well as inter-electrode correlation, coherence, cross phase, and cross power [24]. To test if alpha and theta bands were predictive of memory and cognitive loads in real world computing tasks, Smith et al. [27] compared EEG data when task difficulty was manipulated within a multi-attribute task battery (MATB) mul- titasking environment. They report successfully creating a user-specific index of task load, the average values of which increase with increasing task difficulty and differed significantly between the difficulty manipulations. Given this evidence of the existence of reliable indicators of memory load, researchers have attempted to build techniques that utilize these features to measure and classify memory load. Unfortunately, while these indicators may appear to be reliable when data is averaged over large time periods and many users, there is large variability within the signal for any given user at any given point in time. This makes using the features to classify memory loads an extremely difficult task. While it is reasonable to average the data when trying to make statements about the various rhythms, it is less useful when trying to classify user state in real time. For example, Jensen et al. found the increased theta power in only one of their ten subjects, and rather than an alpha decrease, they found that alpha power actually collected data from 8 users over three 6-8 hour sessions and present results showing ~95% classification accuracy between two levels of memory load [7]. They also showed relatively high cross-task and cross-session accuracies. However, subtle decisions made in their procedure leaves room for improvement. First, collecting 24 hours worth of training data from each user can be prohibitively high for some work. Second, they perform a Laplacian spatial en- hancement that requires accurate per-subject head measurements to filter noise from the signal. Third, they manu- ally inspect the data and throw out periods where there are artifacts in the data even after performing an automatic artifact rejection. This is tedious and requires expertise in read- ing the EEG signals. They report throwing away up to 20% of the data, which is not desirable in our targeted settings where data may be scarce. Furthermore, having to perform this manual step between training and classification has implications on real-time usability of the system. Finally, since their design interleaved different tasks, and used random hold out cross validation, they were training on data that was temporally fairly close to test data and we cannot be certain how well the models would generalize when applied to new data. In our work, we aim to replicate their high classification results and extend their work to further explore the space. We also set out to explore how various parameters such as temporal window size, amount of training data, and number of channels affect the classification. These factors are important to understand if EEG classification is to be used in HCI ...

Similar publications

Conference Paper
Most previous neurolinguistic experiments on scalar implicature have focused on the <some,all> scale. We examined the processing of the <maybe,definitely> scale using EEG and MEG. Participants read the word "maybe" in correct contexts, semantically incorrect contexts (where only "definitely not" would have been true), and pragmatically infelicitous...

Citations

... Moreover, EEG is considered a safe method for longterm and repeated use due to its non-invasive nature. Notably, it does not necessitate the presence of a skilled operator or involve any medical procedures [12]. EEG indices have demonstrated a remarkable sensitivity to fluctuations in brain activity. ...
... The study revealed classification success rates ranging from 44% to 72%. Grimes et al. [12] conducted an early study that classified mental workload through n-back tasks, achieving a commendable level of accuracy. By employing the Naive Bayes algorithm, they determined that the two-difficulty level model had an accuracy rate of 99%, while the four-difficulty level model achieved an accuracy rate of 88%. ...
... Considering that cognitive load theory was formulated by taking into account the constraints of working memory capacity, it makes sense to choose task categories that specifically focus on working memory for conducting research on manipulating mental workload. N-back tasks are commonly utilized in research literature due to their reliability in assessing working memory capacity and their validity, which has been demonstrated in numerous studies [12,13,[31][32][33][34][35][36]. These memory tests were also employed in this particular study due to their established validity in previous research. ...
Article
Full-text available
The main contribution of this study was the concurrent application of EEG and eye tracking techniques during n-back tasks as part of the methodology for addressing the problem of mental workload classification through machine learning algorithms. The experiments involved 15 university students, consisting of 7 women and 8 men. Throughout the experiments, the researchers utilized the n-back memory task and the NASA-Task Load Index (TLX) subjective rating scale to assess various levels of mental workload. The results indicating the relationship between EEG and eye tracking measures and mental workload are consistent with previous research. Regarding the four-class classification task, mental workload level could be predicted with 76.59% accuracy using 34 selected features. This study makes a significant contribution to the literature by presenting a four-class mental workload estimation model that utilizes different machine learning algorithms.
... A reliable determination of human systemtic cognitive states, including cognitive 2 workload, sense of urgency, and mind wandering, would be helpful in many task-based 3 settings, and even critical in some (e.g., detecting overloaded humans in safety-critical 4 contexts who will be more prone to errors and oversight due to being overloaded, or 5 detecting mind wandering in situations where human observers might miss important 6 events due to attentional drifts). Being able to assess human systemic cognitive states 7 can also be of great utility for artificial agents operating in human-agent teams in that 8 it would allow artificial agents to adapt their behavior and the teams' task allocation to 9 achieve a better load balance and increase the effectiveness of the whole team (rather 10 than acting in ways that might, for example, increase human workload, see [11]). ...
Preprint
Full-text available
Robust estimation of systemic human cognitive states is critical for a variety of applications, from simply detecting inefficiencies in task assignments, to the adaptation of artificial agents behaviors to improve team performance in mixed-initiative human-machine teams. This study showed that human eye gaze, in particular, the percentage change in pupil size (PCPS), is the most reliable biomarker for assessing three human cognitive states including workload, sense of urgency, and mind wandering compared to electroencephalogram (EEG), functional near-infrared spectroscopy (fNIRS), respiration, and skin conductance. We used comprehensive multi-modal driving dataset to examine the accuracy of signals to assess these cognitive states. We performed comprehensive statistical tests to validate the performance of several physiological signals to determine human cognitive states and demonstrated that PCPS shows noticeably superior performance. We also characterized the link between workload and sense of urgency with eye gaze and observed that consecutive occurrences of higher sense of urgency were prone to increase overall workload. Finally, we trained five machine learning (ML) models and showed that four of them had similar accuracy in cognitive state classification (with one, random forest, showing inferior performance). The results provided evidence that the PCPS is a reliable physiological marker for cognitive state estimation.
... Moreover, EEG is considered a safe method for long-term and repeated use due to its noninvasive nature. Notably, it does not necessitate the presence of a skilled operator or involve any medical procedures (Grimes et al., 2008). EEG indices have demonstrated a remarkable sensitivity to fluctuations in brain activity. ...
... The study revealed classification success rates ranging from 44% to 72%. Grimes et al. (2008) conducted an early study that classified mental workload through n-back tasks, achieving a commendable level of accuracy. By employing the Naive Bayes 5 algorithm, they determined that the 2-difficulty level model had an accuracy rate of 99%, while the 4-difficulty level model achieved an accuracy rate of 88%. ...
... Given that cognitive load theory was developed based on the limitations of working memory capacity, it is logical to select task types that specifically target working memory for experimental studies on mental workload manipulation. N-back tasks are commonly utilized in research literature due to their reliability in assessing working memory capacity and their validity, which has been demonstrated in numerous studies (Grimes et al., 2008;Herff et al., 2014;Ke et al., 2015;Liu et al., 2017;Tjolleng et al., 2017;Harputlu Aksu and Çakıt, 2022)). These memory tests were also employed in this particular study due to their established validity in previous research. ...
Preprint
Full-text available
The objective of this study was to analyze the mental workload using EEG and eye tracking data and classify it using machine learning algorithms. The machine learning model was developed based on the simultaneous recording of eye tracking and EEG measurements during the experimental process. The experiments involved 15 university students, consisting of 7 women and 8 men. Throughout the experiments, the researchers utilized the n-back memory task and the NASA-Task Load Index (TLX) subjective rating scale to assess various levels of mental workload. The findings revealed that as the task difficulty increased, there was an increase in the diameter of both the right and left pupils, the number of fixations, the number and duration of saccades, and the number and duration of blinks. Conversely, variables related to fixation duration decreased. The EEG results indicated that theta power in the prefrontal, frontal, and front central regions increased with task difficulty. Additionally, alpha power increased in the frontal regions but decreased in the temporal, parietal, and occipital regions as the task became more challenging. Furthermore, low beta power significantly decreased in almost all brain regions as the task difficulty increased. In terms of the four-class classification problem, the mental workload level can be predicted with an accuracy rate of 76.59% using 34 selected features. This study has made a significant contribution to the literature by presenting a four-class mental workload estimation model that utilizes different machine learning algorithms.
... In our direct comparison of the high versus lower load conditions, we observed significant differences in the same regions as the LMM as well as differences in frontal lower alpha, centrotemporal and frontal upper alpha, and frontal lower beta bands. Based on prior literature, we hypothesized that we would observe an increase in frontal theta [40,47] and broad beta, as well as a decrease in parietal alpha [16]. However, our results show poor alignment with our hypotheses based on the literature. ...
... Our exploratory analysis of individual channels (5, 4th column) shows an overall decrease in high alpha with stronger statistics in some electrodes, suggesting that our electrode averaging technique may have masked this effect in our data. In all of our analyses, we observed no relationship between cognitive load and theta band power, and prior research has reported this effect to be participant-dependent (Grimes et al. and jensen et al. reported that a few participants show theta peaks [40,47]). ...
... Previous studies pertaining to real-time evaluation of mental workload utilized electroencephalogram (EEG) to monitor changes in brain activity patterns during cognitively demanding task [5][6][7]. In particular, the EEG bands of theta (4)(5)(6)(7)(8) and alpha (8)(9)(10)(11)(12) Hz) reflect mental workload status [8,9]; in motor imagery studies, the difference of left/right hemisphere alpha band powers was found to be associated with the level of attention during the task [10,11]. Most previous EEG studies pertaining to mental workload focused on finding specific EEG features that are associated with cognitive status; however, the changes in brain activity patterns during tasks could be better assessed by investigating the connectivity across various brain regions. ...
... Previous studies pertaining to real-time evaluation of mental workload utilized electroencephalogram (EEG) to monitor changes in brain activity patterns during cognitively demanding task [5][6][7]. In particular, the EEG bands of theta (4)(5)(6)(7)(8) and alpha (8)(9)(10)(11)(12) Hz) reflect mental workload status [8,9]; in motor imagery studies, the difference of left/right hemisphere alpha band powers was found to be associated with the level of attention during the task [10,11]. Most previous EEG studies pertaining to mental workload focused on finding specific EEG features that are associated with cognitive status; however, the changes in brain activity patterns during tasks could be better assessed by investigating the connectivity across various brain regions. ...
... Previous studies pertaining to real-time evaluation of mental workload utilized electroencephalogram (EEG) to monitor changes in brain activity patterns during cognitively demanding task [5][6][7]. In particular, the EEG bands of theta (4)(5)(6)(7)(8) and alpha (8)(9)(10)(11)(12) Hz) reflect mental workload status [8,9]; in motor imagery studies, the difference of left/right hemisphere alpha band powers was found to be associated with the level of attention during the task [10,11]. Most previous EEG studies pertaining to mental workload focused on finding specific EEG features that are associated with cognitive status; however, the changes in brain activity patterns during tasks could be better assessed by investigating the connectivity across various brain regions. ...
Conference Paper
Full-text available
A high cognitive load could significantly impairproblem-solving skills. Electroencephalogram (EEG)-basedreal-time assessment of mental workload is feasible, and graphneural networks (GNN) can classify brain activity patternsduring cognitively demanding tasks with high accuracy.However, previous GNN studies pertaining to mental workloadclassification lack explainability. This study utilized a state-of-the-art GNN variant with GNNexplainer to find relevantconnectivity during mental arithmetic (MA) tasks. In thisendeavor, MA EEG recordings were retrieved from an open-access database. The signals were transformed to graph datathrough the envelope correlation and power spectral density(PSD), and subjected to GNN with hierarchical graph poolingwith a structure learning model to classify MA and baseline(BL). The model accuracy was 85.57 ± 6.27 and 96.26 ± 4.14%for the connectivity dataset and the PSD and the connectivityfeature, respectively. Among the connections between nodesidentified as important by GNNExplainer, two notable edgepatterns were found as 1) from the left centro-parietal region toleft frontal regions, and 2) the frontoparietal connection. Theresults indicate 1) the GNN model performance could beimproved using the connectivity and PSD feature together, and2) characteristic patterns of the connectome and PSD could beimportant for MA classification. The connectivity analysis bythe “explainable” GNN model could be beneficial in future brainactivity pattern studies (PDF) Interpretability of Hybrid Feature Using Graph Neural Networks from Mental Arithmetic Based EEG.
... Since 1998 (Gevins et al., 1998) until today (Asif et al., 2019), several publications have shown that EEG is a viable source of information regarding a person's cognitive load, by achieving classification accuracy of up to 95%. However, these results are highly dependent on the different number of EEG channels used, the amount of train and test data for machine learning (ML), the length and nature of the tasks performed, time-domain or frequencydomain features, and personalized or generalized models (Grimes et al., 2008). Most importantly, very few of the existing experimental paradigms for cognitive load assessment utilize wearable EEG with four channels or less (Katmah et al., 2021), (Ahn et al., 2019), (Fangmeng et al., 2020). ...
... Stimuli: verbal material consisted of auditory and visual stimuli, referring to seven consonants (c, g, k, p, q, t, v) already used in previous experimental protocols [61][62][63][64]. Vowels were excluded to reduce the likeliness of participants developing chunking strategies [65]. To ensure correct perception by UCI and HC groups, we performed a stimuli exposure pretest. ...
Article
Full-text available
This pilot study investigates the neurophysiological patterns of visual and auditory verbal working memory (VWM) in unilateral cochlear implant users (UCIs). We compared the task-related electroencephalogram (EEG) power spectral density of 7- to 13-year-old UCIs (n = 7) with a hearing control group (HC, n = 10) during the execution of a three-level n-back task with auditory and visual verbal (letters) stimuli. Performances improved as memory load decreased regardless of sensory modality (SM) and group factors. Theta EEG activation over the frontal area was proportionally influenced by task level; the left hemisphere (LH) showed greater activation in the gamma band, suggesting lateralization of VWM function regardless of SM. However, HCs showed stronger activation patterns in the LH than UCIs regardless of SM and in the parietal area (PA) during the most challenging audio condition. Linear regressions for gamma activation in the PA suggest the presence of a pattern-supporting auditory VWM only in HCs. Our findings seem to recognize gamma activation in the PA as the signature of effective auditory VWM. These results, although preliminary, highlight this EEG pattern as a possible cause of the variability found in VWM outcomes in deaf children, opening up new possibilities for interdisciplinary research and rehabilitation intervention.
... Cognitive workload is defined as the load imposed on working memory during a cognitive process and is one of the most important factors in designing and implementing robust human-machine systems [10,11]. Assessment of cognitive workload is mainly based on two methods: (a) subjective measurements and (b) objective measurements [2,12]. ...
Article
Full-text available
Robots interacting with humans in assistive contexts have to be sensitive to human cognitive states to be able to provide help when it is needed and not overburden the human when the human is busy. Yet, it is currently still unclear which sensing modality might allow robots to derive the best evidence of human workload. In this work, we analyzed and modeled data from a multi-modal simulated driving study specifically designed to evaluate different levels of cognitive workload induced by various secondary tasks such as dialogue interactions and braking events in addition to the primary driving task. Specifically, we performed statistical analyses of various physiological signals including eye gaze, electroencephalography, and arterial blood pressure from the healthy volunteers and utilized several machine learning methodologies including k-nearest neighbor, naive Bayes, random forest, support-vector machines, and neural network-based models to infer human cognitive workload levels. Our analyses provide evidence for eye gaze being the best physiological indicator of human cognitive workload, even when multiple signals are combined. Specifically, the highest accuracy (in %) of binary workload classification based on eye gaze signals is 80.45 ∓ 3.15 achieved by using support-vector machines, while the highest accuracy combining eye gaze and electroencephalography is only 77.08 ∓ 3.22 achieved by a neural network-based model. Our findings are important for future efforts of real-time workload estimation in the multimodal human-robot interactive systems given that eye gaze is easy to collect and process and less susceptible to noise artifacts compared to other physiological signal modalities.
... Articles HCI evaluation [56], [55], [93], [66], [102], [79], [3], [5], [63], [91], [19], [118], [83], [26], [44], [31], [92], [132], [49], [14], [71], [12], [25], [78], [47], [142], [133], [134], [28], explicit control [35], [72], [119], [145], [151], [101], [99], [114], [51], [52], [33], [150], [98], [106], [76], [90], [70], [69], [103], [39], [84], [65], [95], [36], implicit open loop [121], [131], [108], [58], [140], [122], [2], [107], [7], [109], [27], [73], [112], implicit closed loop [128], [126], [130], [1], [111], [152], [147], [115], [15], neurofeedback [45], [43], [54], [81], [6], mental state assessment [138], [77], [50], [23], [154], [61], [157], [139], [110], [125], [11], [88], [34], [148], [48], [158], [96], other [127], [123], [104], [85], [21], [136], [87], [41], [38], [10], [4], [24], [97], category). Systems which are able to bring all components together show that using brain signals in runtime can yield substantial usability improvements [1,152] or unlock completely novel kinds of applications [103]. ...
Article
In human-computer interaction (HCI), there has been a push towards open science, but to date, this has not happened consistently for HCI research utilizing brain signals due to unclear guidelines to support reuse and reproduction. To understand existing practices in the field, this paper examines 110 publications, exploring domains, applications, modalities, mental states and processes, and more. This analysis reveals variance in how authors report experiments, which creates challenges to understand, reproduce, and build on that research. It then describes an overarching experiment model that provides a formal structure for reporting HCI research with brain signals, including definitions, terminology, categories, and examples for each aspect. Multiple distinct reporting styles were identified through factor analysis and tied to different types of research. The paper concludes with recommendations and discusses future challenges. This creates actionable items from the abstract model and empirical observations to make HCI research with brain signals more reproducible and reusable.
... Within the scope of this study, it was aimed to investigate whether the change in mental workload during n-back memory tasks is related to brain waves, to compare it with the results obtained in the literature before, and to develop a model estimating mental workload based on machine learning algorithms by using EEG data. In the literature, the studies where this topic is addressed as a classification problem show that the problem becomes more difficult as the number of class increases and therefore the performance of the model is reduced (Grimes et al., 2008;Borys et al., 2017). Therefore, it was tried to obtain a model that predicts the mental workload with the highest possible accuracy according to four difficulty levels. ...
... In the same study, based on only EEG features, maximum accuracy of~51% could be reached. Grimes et al. (2008) reported 99% classification accuracy for two classes and 88% for four classes (both results achieved for eight subjects). The study of Grimes et al. (2008) shed light on this study in terms of showing that high classification performance can be achieved with a small number of participants. ...
... Grimes et al. (2008) reported 99% classification accuracy for two classes and 88% for four classes (both results achieved for eight subjects). The study of Grimes et al. (2008) shed light on this study in terms of showing that high classification performance can be achieved with a small number of participants. ...
Conference Paper
Full-text available
Mental workload is related to the difference between the available mental resource capacity of the operator and the mental resource required by the job. To decide the number of tasks assigned to operator and the difficulty levels of those tasks, it is important to know the operator's mental workload. An overload occurs if the amount of resources required by the task exceeds the available capacity of the person. Mental workload analysis helps to recognize the mental fatigue, evaluate the human performance of different level tasks and adjust cognitive sources for safe and efficient human-machine interactions. Excessive levels of mental workload can lead to errors or delays in information processing. Monitoring brain activity has been verified to be sensitive and consistent reflector of mental workload changes. Classification, regression, clustering, anomaly detection, dimensionality reduction, and reward maximization are common machine learning models. Classification of mental workload has critical importance in the domain of human factors and ergonomics. In recent years, with the need to analyze continuous and large-scale data obtained by physiological methods, the use of machine learning algorithms has become widespread in estimating and classifying mental workload. The objectives of the current study were two-fold: (1) to investigate the relationship among EEG features, task difficulty levels and subjective self-assessment (NASA-TLX) scores and (2) to develop machine learning algorithms for classifying mental workload using EEG features. N-back tasks have been commonly used in the literature. In this study, N-back memory tests were performed at four different difficulty levels. As the number of n increases, so does the difficulty of the task. Four participants performed the tests. Seventy EEG features (5 frequency band power for 14 channels) were selected as independent variables. One output variable reflecting the difficulty level of N-Back memory was classified. The machine learning algorithms used in our study were K-Nearest Neighbor (KNN), Support Vector Machine (SVM), Artificial Neural Network (ANN), Random Forest (RF), Gradient Boosting Machine (GBM), Light Gradient Boosting Machine (LightGBM) and Extreme Gradient Boosting (XGBoost) algorithms. As the task difficulty increased, theta activity in prefrontal and frontal regions increased. Especially frontal theta power, parietal and occipital gamma power were significantly correlated to perceived workload scores obtained via NASA-TLX. Prefrontal beta-high activity had a significant negative relationship with self-assessment workload ratings. Prefrontal and frontal theta, prefrontal beta-high, occipital, parietal and temporal gamma and occipital alpha activities were found to be the most effective parameters. The results obtained for the four classes of classification problem reached the accuracy of 68% with EEG features as input and the Random Forest algorithm. In addition, the results obtained for the two classes of classification problem reached the accuracy of 87% with EEG features as input and the GBM algorithm. The results from the analysis indicate that EEG signals play an important role in the classification of mental workload. Another remarkable result was high classification performance of GBM, LightGBM and XGBoost algorithms that have been developed in the recent past and therefore not frequently used in studies on this subject in the literature.