Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

MambaGaze: Bidirectional Mamba with Explicit Missing Data Modeling for Cognitive Load Assessment from Eye-Gaze Tracking Data

About

Real-time cognitive load assessment from eye-tracking signals could potentially enable adaptive human-centered-AI such as safety-critical applications such as driver vigilance monitoring or automated flight deck assistance, yet two challenges persist: handling frequent data missingness from blinks and tracking failures, and efficiently modeling long-range temporal dependencies. We propose MambaGaze, a framework that addresses these challenges through 1) XMD encoding, which augments raw features with observation masks and time-deltas to explicitly model data uncertainty, and 2) bidirectional Mamba-2, which captures temporal dependencies with linear computational complexity. Experiments on CLARE and CL-Drive datasets under leave-one-subject-out evaluation show that MambaGaze achieves 76.8% and 73.1% accuracy, respectively, outperforming CNN, Transformer, ResNet, and VGG baselines by 4-12 percentage points. Edge deployment benchmarks on NVIDIA Jetson platforms demonstrate real-time inference at 43-68 FPS with power consumption below 7.5W, confirming feasibility for wearable cognitive load monitoring.

Amir Mousavi, Mohammad Sadegh Sirjani, Erfan Nourbakhsh, Mimi Xie, Rocky Slavin, Leslie Neely, John Davis, John Quarles• 2026

Related benchmarks

TaskDatasetResultRank
Cognitive Load ClassificationCLARE (LOSO)
Accuracy76.8
8
Cognitive Load ClassificationCL-Drive (LOSO)
Accuracy (LOSO)73.1
8
Cognitive Load ClassificationCLARE (K-Fold)
Accuracy73.3
8
Cognitive Load ClassificationCL-Drive (K-Fold)
Accuracy68.8
8
Showing 4 of 4 rows

Other info

Follow for update