SEAR: A Multimodal Dataset for Analyzing AR-LLM-Driven Social Engineering Behaviors
About
The SEAR Dataset is a novel multimodal resource designed to study the emerging threat of social engineering (SE) attacks orchestrated through augmented reality (AR) and multimodal large language models (LLMs). This dataset captures 180 annotated conversations across 60 participants in simulated adversarial scenarios, including meetings, classes and networking events. It comprises synchronized AR-captured visual/audio cues (e.g., facial expressions, vocal tones), environmental context, and curated social media profiles, alongside subjective metrics such as trust ratings and susceptibility assessments. Key findings reveal SEAR's alarming efficacy in eliciting compliance (e.g., 93.3% phishing link clicks, 85% call acceptance) and hijacking trust (76.7% post-interaction trust surge). The dataset supports research in detecting AR-driven SE attacks, designing defensive frameworks, and understanding multimodal adversarial manipulation. Rigorous ethical safeguards, including anonymization and IRB compliance, ensure responsible use. The SEAR dataset is available at https://github.com/INSLabCN/SEAR-Dataset.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Latency Measurement | Social Engineering Defense Evaluation | Min Latency1 | 6 | |
| Latency Analysis | 60 Conversations | Minimum Latency (s)1 | 4 | |
| Social Experience Evaluation | User Study Social Experience 1.0 (Evaluation set (60 participants)) | Score 5 Distribution (%)76.7 | 4 | |
| Social-engineering effectiveness evaluation | Social-engineering Photo Link channel | 5pt (%)40 | 2 | |
| Social-engineering effectiveness evaluation | Social-engineering Social App channel | 5 Points (%)43.3 | 2 | |
| Social-engineering effectiveness evaluation | Social-engineering SMS channel | 5pt Score (%)45 | 2 | |
| Social-engineering effectiveness evaluation | Social-engineering Phone Call channel | 5pt (%)35 | 2 |