Measuring working memory load with an electroencephalograph. Shot taken from experiment. dynamically adapt and support users’ goals [4]. Since we have traditionally interacted with computers through our physical bodies, most of these techniques have been based on observations of user actions and behavior (e.g. [23]). Less frequently, other techniques have utilized physiological signals as indicators of user state [14,21]. While these measures have been reasonably successful, they are rather indirect, especially when the user state in question is of a cognitive nature. Fortunately, advances in cognitive neuro- science and brain-sensing technologies provide us with the ability to interface more directly with the human brain. This is possible through the use of sensors that monitor the electrical and chemical changes within the brain that correspond with certain forms of thought. While using these technologies in HCI research has been previously articulated [12,28], we believe there is an opportunity to further explore practical issues with their use in HCI applications. In our work, we explore using one of these technologies, an electroencephalograph (EEG), to estimate or classify working memory load, or the cognitive effort dedicated to hold- ing information in the mind for short periods of time while performing a cognitive task [1]. Working memory has been shown to be a key component of cognitive load, and is a

Source publication

Feasibility and Pragmatics of Classifying Working Memory Load with an Electroencephalograph

Conference Paper

Full-text available

Apr 2008

A reliable and unobtrusive measurement of working mem- ory load could be used to evaluate the efficacy of interfaces and to provide real-time user-state information to adaptive systems. In this paper, we describe an experiment we con- ducted to explore some of the issues around using an elec- troencephalograph (EEG) for classifying working memory l...

Context 1

... reliable and unobtrusive measurement of working memory load could be used to evaluate the efficacy of interfaces and to provide real-time user-state information to adaptive systems. In this paper, we describe an experiment we conducted to explore some of the issues around using an electroencephalograph (EEG) for classifying working memory load. Within this experiment, we present our classification methodology, including a novel feature selection scheme that seems to alleviate the need for complex drift modeling and artifact rejection. We demonstrate classification accuracies of up to 99% for 2 memory load levels and up to 88% for 4 levels. We also present results suggesting that we can do this with shorter windows, much less training data, and a smaller number of EEG channels, than reported previously. Finally, we show results suggesting that the models we construct transfer across variants of the task, implying some level of generality. We believe these findings extend prior work and bring us a step closer to the use of such technologies in HCI research. Author Keywords: Brain-Computer Interface (BCI), electroencephalogram (EEG), cognitive load, memory load, machine-learning, feature selection, classification. ACM Classification Keywords : H.1.2 [User/Machine Systems]; H.5.2 [User Interfaces]: Input devices and strate- gies; B.4.2 [Input/Output Devices]: Channels and control- lers; J.3 [Life and Medical Sciences]. Human-computer interaction (HCI) researchers continually work on techniques that allow us to measure user states such as cognitive and memory workload, task engagement, surprise, satisfaction, or frustration. Such measures are useful not only for evaluating the efficacy of interfaces, but also for providing real-time information to systems that reasonable measure of how hard a user is working to solve a problem or use an interface. For example, working memory load has long been recognized in HCI to be an important indicator of potential errors as well as a predictive feature of procedural skill acquisition [3]. Given this evidence, interface designers often try to minimize the working memory load required to perform a task, and reliable real-time measures would benefit them greatly. While various researchers have worked on classifying working memory with EEG (e.g. [6,7,20,21,27]), previous work has typically relied on costly equipment and techniques that make it difficult for non-EEG-experts to replicate and use this work. Additionally, this work has often required experimenters to collect large amounts of classifier training data (sometimes on the order of days), a process that is often prohibitively expensive. While we believe that EEG is complementary to many of the other measures of memory and cognitive load, it is outside the scope of this paper to explore the detailed relationships between these measures. We leave this for future work. The contributions of this paper are three-fold: • First, we present our methodology within an experiment we ran to measure working memory load using only EEG signals. The innovation within this methodology is an automatic feature selection scheme that eliminates the need for procedures used in most previous work, such as complex device and physiological drift modeling as well as manual artifact rejection. • Second, using this methodology, we present classification results using machine learning techniques that replicate and extend prior work in the area. Specifically, we show classification accuracies of up to 99.0% between two load levels, and up to 88.0% between four levels, all with just 8 channels of EEG data. More importantly, we present results showing how classification accuracy varies with different temporal window sizes, amounts of training, and number of EEG channels. Specifically, the results suggest that our techniques allow us to attain accurate classification with less lag, much less training data, and simpler equipment. • Third, we show how our models work across variants of the memory task, providing encouraging evidence that it might be possible to develop canonical training tasks and to perform general classification of memory load. In this paper, we use an Electroencephalograph (EEG), a sensing technology that uses electrodes placed on the scalp to measure electrical potentials related to brain activity (see Figure 1). Each electrode typically consists of a wire leading to a conductive disk that is electrically connected to the scalp using conductive paste or gel. The EEG device re- cords the voltage at each of these electrodes relative to a reference point, which is often another electrode on the scalp. Because EEG is a non-invasive, passive measuring device, it is safe for extended and repeated use, a character- istic crucial for adoption in HCI research. Additionally, it does not require a highly skilled operator or medical procedure to use. For more information about electrical signals generated by the brain as well as EEG, see [5]. The signal provided by an EEG is, at best, a crude representation of brain activity due to the nature of the detector. Scalp electrodes are only sensitive to macroscopic and co- ordinated firing of large groups of neurons near the surface of the brain, and then only when they are directed along a perpendicular vector relative to the scalp. Additionally, because of the fluid, bone, and skin that separate the electrodes from the actual electrical activity, the already small signals are scattered and attenuated before reaching the electrodes. EEG data is typically analyzed by looking at the spectral power of the signal in a set of frequency bands, which have been observed to correspond with certain types of neural activity [5]. These frequency bands are commonly defined as 1-4 Hz (delta), 4-8 Hz (theta), 8-12 Hz (alpha), 12-20 Hz (beta-low), 20-30 Hz (beta-high), and >30 Hz (gamma). Early researchers observed the sensitivity of EEG to changes in mental effort. For example, Hans Berger [2] and others [11] report observing a decrease in the amplitude of the alpha (8-12 Hz) rhythm during mental arithmetic tasks Other researchers have shown that higher memory loads cause increases in theta (4-8 Hz) and low-beta (12-15 Hz) power in the frontal midline regions of the scalp [17], gamma (>30 Hz) oscillations [8], as well as inter-electrode correlation, coherence, cross phase, and cross power [24]. To test if alpha and theta bands were predictive of memory and cognitive loads in real world computing tasks, Smith et al. [27] compared EEG data when task difficulty was manipulated within a multi-attribute task battery (MATB) mul- titasking environment. They report successfully creating a user-specific index of task load, the average values of which increase with increasing task difficulty and differed significantly between the difficulty manipulations. Given this evidence of the existence of reliable indicators of memory load, researchers have attempted to build techniques that utilize these features to measure and classify memory load. Unfortunately, while these indicators may appear to be reliable when data is averaged over large time periods and many users, there is large variability within the signal for any given user at any given point in time. This makes using the features to classify memory loads an extremely difficult task. While it is reasonable to average the data when trying to make statements about the various rhythms, it is less useful when trying to classify user state in real time. For example, Jensen et al. found the increased theta power in only one of their ten subjects, and rather than an alpha decrease, they found that alpha power actually collected data from 8 users over three 6-8 hour sessions and present results showing ~95% classification accuracy between two levels of memory load [7]. They also showed relatively high cross-task and cross-session accuracies. However, subtle decisions made in their procedure leaves room for improvement. First, collecting 24 hours worth of training data from each user can be prohibitively high for some work. Second, they perform a Laplacian spatial en- hancement that requires accurate per-subject head measurements to filter noise from the signal. Third, they manu- ally inspect the data and throw out periods where there are artifacts in the data even after performing an automatic artifact rejection. This is tedious and requires expertise in read- ing the EEG signals. They report throwing away up to 20% of the data, which is not desirable in our targeted settings where data may be scarce. Furthermore, having to perform this manual step between training and classification has implications on real-time usability of the system. Finally, since their design interleaved different tasks, and used random hold out cross validation, they were training on data that was temporally fairly close to test data and we cannot be certain how well the models would generalize when applied to new data. In our work, we aim to replicate their high classification results and extend their work to further explore the space. We also set out to explore how various parameters such as temporal window size, amount of training data, and number of channels affect the classification. These factors are important to understand if EEG classification is to be used in HCI ...

View in full-text

"Maybe" not all scalar implicatures are created equal

Conference Paper

Apr 2015

Stephen Politzer-Ahles

Most previous neurolinguistic experiments on scalar implicature have focused on the <some,all> scale. We examined the processing of the <maybe,definitely> scale using EEG and MEG. Participants read the word "maybe" in correct contexts, semantically incorrect contexts (where only "definitely not" would have been true), and pragmatically infelicitous...

Mental Workload Assessment Using Machine Learning Techniques Based on EEG and Eye Tracking Data

Article

Full-text available

Mar 2024

The main contribution of this study was the concurrent application of EEG and eye tracking techniques during n-back tasks as part of the methodology for addressing the problem of mental workload classification through machine learning algorithms. The experiments involved 15 university students, consisting of 7 women and 8 men. Throughout the experiments, the researchers utilized the n-back memory task and the NASA-Task Load Index (TLX) subjective rating scale to assess various levels of mental workload. The results indicating the relationship between EEG and eye tracking measures and mental workload are consistent with previous research. Regarding the four-class classification task, mental workload level could be predicted with 76.59% accuracy using 34 selected features. This study makes a significant contribution to the literature by presenting a four-class mental workload estimation model that utilizes different machine learning algorithms.

Assessing the Effects of Various Physiological Signal Modalities on Predicting Different Human Cognitive States

Preprint

Full-text available

Mar 2024

Robust estimation of systemic human cognitive states is critical for a variety of applications, from simply detecting inefficiencies in task assignments, to the adaptation of artificial agents behaviors to improve team performance in mixed-initiative human-machine teams. This study showed that human eye gaze, in particular, the percentage change in pupil size (PCPS), is the most reliable biomarker for assessing three human cognitive states including workload, sense of urgency, and mind wandering compared to electroencephalogram (EEG), functional near-infrared spectroscopy (fNIRS), respiration, and skin conductance. We used comprehensive multi-modal driving dataset to examine the accuracy of signals to assess these cognitive states. We performed comprehensive statistical tests to validate the performance of several physiological signals to determine human cognitive states and demonstrated that PCPS shows noticeably superior performance. We also characterized the link between workload and sense of urgency with eye gaze and observed that consecutive occurrences of higher sense of urgency were prone to increase overall workload. Finally, we trained five machine learning (ML) models and showed that four of them had similar accuracy in cognitive state classification (with one, random forest, showing inferior performance). The results provided evidence that the PCPS is a reliable physiological marker for cognitive state estimation.

Mental Workload Modeling by Using Machine Learning Techniques Based on EEG and Eye Tracking Data

Preprint

Full-text available

Feb 2024

The objective of this study was to analyze the mental workload using EEG and eye tracking data and classify it using machine learning algorithms. The machine learning model was developed based on the simultaneous recording of eye tracking and EEG measurements during the experimental process. The experiments involved 15 university students, consisting of 7 women and 8 men. Throughout the experiments, the researchers utilized the n-back memory task and the NASA-Task Load Index (TLX) subjective rating scale to assess various levels of mental workload. The findings revealed that as the task difficulty increased, there was an increase in the diameter of both the right and left pupils, the number of fixations, the number and duration of saccades, and the number and duration of blinks. Conversely, variables related to fixation duration decreased. The EEG results indicated that theta power in the prefrontal, frontal, and front central regions increased with task difficulty. Additionally, alpha power increased in the frontal regions but decreased in the temporal, parietal, and occipital regions as the task became more challenging. Furthermore, low beta power significantly decreased in almost all brain regions as the task difficulty increased. In terms of the four-class classification problem, the mental workload level can be predicted with an accuracy rate of 76.59% using 34 selected features. This study has made a significant contribution to the literature by presenting a four-class mental workload estimation model that utilizes different machine learning algorithms.

Cognitive Load Measurement with Physiological Sensors in Virtual Reality during Physical Activity

Conference Paper

Oct 2023

Interpretability of Hybrid Feature Using Graph Neural Networks from Mental Arithmetic Based EEG

Conference Paper

Full-text available

Feb 2023

A high cognitive load could significantly impairproblem-solving skills. Electroencephalogram (EEG)-basedreal-time assessment of mental workload is feasible, and graphneural networks (GNN) can classify brain activity patternsduring cognitively demanding tasks with high accuracy.However, previous GNN studies pertaining to mental workloadclassification lack explainability. This study utilized a state-of-the-art GNN variant with GNNexplainer to find relevantconnectivity during mental arithmetic (MA) tasks. In thisendeavor, MA EEG recordings were retrieved from an open-access database. The signals were transformed to graph datathrough the envelope correlation and power spectral density(PSD), and subjected to GNN with hierarchical graph poolingwith a structure learning model to classify MA and baseline(BL). The model accuracy was 85.57 ± 6.27 and 96.26 ± 4.14%for the connectivity dataset and the PSD and the connectivityfeature, respectively. Among the connections between nodesidentified as important by GNNExplainer, two notable edgepatterns were found as 1) from the left centro-parietal region toleft frontal regions, and 2) the frontoparietal connection. Theresults indicate 1) the GNN model performance could beimproved using the connectivity and PSD feature together, and2) characteristic patterns of the connectome and PSD could beimportant for MA classification. The connectivity analysis bythe “explainable” GNN model could be beneficial in future brainactivity pattern studies (PDF) Interpretability of Hybrid Feature Using Graph Neural Networks from Mental Arithmetic Based EEG.

Wearable EEG-Based Cognitive Load Classification by Personalized and Generalized Model Using Brain Asymmetry

Conference Paper

Full-text available

Jan 2023

Gamma-Band Modulation in Parietal Area as the Electroencephalographic Signature for Performance in Auditory–Verbal Working Memory: An Exploratory Pilot Study in Hearing and Unilateral Cochlear Implant Children

Article

Full-text available

Sep 2022
BSRCCS

This pilot study investigates the neurophysiological patterns of visual and auditory verbal working memory (VWM) in unilateral cochlear implant users (UCIs). We compared the task-related electroencephalogram (EEG) power spectral density of 7- to 13-year-old UCIs (n = 7) with a hearing control group (HC, n = 10) during the execution of a three-level n-back task with auditory and visual verbal (letters) stimuli. Performances improved as memory load decreased regardless of sensory modality (SM) and group factors. Theta EEG activation over the frontal area was proportionally influenced by task level; the left hemisphere (LH) showed greater activation in the gamma band, suggesting lateralization of VWM function regardless of SM. However, HCs showed stronger activation patterns in the LH than UCIs regardless of SM and in the parietal area (PA) during the most challenging audio condition. Linear regressions for gamma activation in the PA suggest the presence of a pattern-supporting auditory VWM only in HCs. Our findings seem to recognize gamma activation in the PA as the signature of effective auditory VWM. These results, although preliminary, highlight this EEG pattern as a possible cause of the variability found in VWM outcomes in deaf children, opening up new possibilities for interdisciplinary research and rehabilitation intervention.

Investigating Methods for Cognitive Workload Estimation for Assistive Robots

Article

Full-text available

Sep 2022
SENSORS-BASEL

Robots interacting with humans in assistive contexts have to be sensitive to human cognitive states to be able to provide help when it is needed and not overburden the human when the human is busy. Yet, it is currently still unclear which sensing modality might allow robots to derive the best evidence of human workload. In this work, we analyzed and modeled data from a multi-modal simulated driving study specifically designed to evaluate different levels of cognitive workload induced by various secondary tasks such as dialogue interactions and braking events in addition to the primary driving task. Specifically, we performed statistical analyses of various physiological signals including eye gaze, electroencephalography, and arterial blood pressure from the healthy volunteers and utilized several machine learning methodologies including k-nearest neighbor, naive Bayes, random forest, support-vector machines, and neural network-based models to infer human cognitive workload levels. Our analyses provide evidence for eye gaze being the best physiological indicator of human cognitive workload, even when multiple signals are combined. Specifically, the highest accuracy (in %) of binary workload classification based on eye gaze signals is 80.45 ∓ 3.15 achieved by using support-vector machines, while the highest accuracy combining eye gaze and electroencephalography is only 77.08 ∓ 3.22 achieved by a neural network-based model. Our findings are important for future efforts of real-time workload estimation in the multimodal human-robot interactive systems given that eye gaze is easy to collect and process and less susceptible to noise artifacts compared to other physiological signal modalities.

Understanding HCI Practices and Challenges of Experiment Reporting with Brain Signals: Towards Reproducibility and Reuse

Article

Aug 2022
ACM T COMPUT-HUM INT

In human-computer interaction (HCI), there has been a push towards open science, but to date, this has not happened consistently for HCI research utilizing brain signals due to unclear guidelines to support reuse and reproduction. To understand existing practices in the field, this paper examines 110 publications, exploring domains, applications, modalities, mental states and processes, and more. This analysis reveals variance in how authors report experiments, which creates challenges to understand, reproduce, and build on that research. It then describes an overarching experiment model that provides a formal structure for reporting HCI research with brain signals, including definitions, terminology, categories, and examples for each aspect. Multiple distinct reporting styles were identified through factor analysis and tied to different types of research. The paper concludes with recommendations and discusses future challenges. This creates actionable items from the abstract model and empirical observations to make HCI research with brain signals more reproducible and reusable.

Classifying mental workload using EEG data: A machine learning approach

Conference Paper

Full-text available

Jul 2022

Mental workload is related to the difference between the available mental resource capacity of the operator and the mental resource required by the job. To decide the number of tasks assigned to operator and the difficulty levels of those tasks, it is important to know the operator's mental workload. An overload occurs if the amount of resources required by the task exceeds the available capacity of the person. Mental workload analysis helps to recognize the mental fatigue, evaluate the human performance of different level tasks and adjust cognitive sources for safe and efficient human-machine interactions. Excessive levels of mental workload can lead to errors or delays in information processing. Monitoring brain activity has been verified to be sensitive and consistent reflector of mental workload changes. Classification, regression, clustering, anomaly detection, dimensionality reduction, and reward maximization are common machine learning models. Classification of mental workload has critical importance in the domain of human factors and ergonomics. In recent years, with the need to analyze continuous and large-scale data obtained by physiological methods, the use of machine learning algorithms has become widespread in estimating and classifying mental workload. The objectives of the current study were two-fold: (1) to investigate the relationship among EEG features, task difficulty levels and subjective self-assessment (NASA-TLX) scores and (2) to develop machine learning algorithms for classifying mental workload using EEG features. N-back tasks have been commonly used in the literature. In this study, N-back memory tests were performed at four different difficulty levels. As the number of n increases, so does the difficulty of the task. Four participants performed the tests. Seventy EEG features (5 frequency band power for 14 channels) were selected as independent variables. One output variable reflecting the difficulty level of N-Back memory was classified. The machine learning algorithms used in our study were K-Nearest Neighbor (KNN), Support Vector Machine (SVM), Artificial Neural Network (ANN), Random Forest (RF), Gradient Boosting Machine (GBM), Light Gradient Boosting Machine (LightGBM) and Extreme Gradient Boosting (XGBoost) algorithms. As the task difficulty increased, theta activity in prefrontal and frontal regions increased. Especially frontal theta power, parietal and occipital gamma power were significantly correlated to perceived workload scores obtained via NASA-TLX. Prefrontal beta-high activity had a significant negative relationship with self-assessment workload ratings. Prefrontal and frontal theta, prefrontal beta-high, occipital, parietal and temporal gamma and occipital alpha activities were found to be the most effective parameters. The results obtained for the four classes of classification problem reached the accuracy of 68% with EEG features as input and the Random Forest algorithm. In addition, the results obtained for the two classes of classification problem reached the accuracy of 87% with EEG features as input and the GBM algorithm. The results from the analysis indicate that EEG signals play an important role in the classification of mental workload. Another remarkable result was high classification performance of GBM, LightGBM and XGBoost algorithms that have been developed in the recent past and therefore not frequently used in studies on this subject in the literature.

Context in source publication

Similar publications

Citations