RAG-HAR: Retrieval Augmented Generation-based Human Activity Recognition
About
Human Activity Recognition (HAR) underpins applications in healthcare, rehabilitation, fitness tracking, and smart environments, yet existing deep learning approaches demand dataset-specific training, large labeled corpora, and significant computational resources.We introduce RAG-HAR, a training-free retrieval-augmented framework that leverages large language models (LLMs) for HAR. RAG-HAR computes lightweight statistical descriptors, retrieves semantically similar samples from a vector database, and uses this contextual evidence to make LLM-based activity identification. We further enhance RAG-HAR by first applying prompt optimization and introducing an LLM-based activity descriptor that generates context-enriched vector databases for delivering accurate and highly relevant contextual information. Along with these mechanisms, RAG-HAR achieves state-of-the-art performance across six diverse HAR benchmarks. Most importantly, RAG-HAR attains these improvements without requiring model training or fine-tuning, emphasizing its robustness and practical applicability. RAG-HAR moves beyond known behaviors, enabling the recognition and meaningful labelling of multiple unseen human activities.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Human Activity Recognition | PAMAP2 | F1 Score91.12 | 26 | |
| Human Activity Recognition | PAMAP2 (test) | -- | 21 | |
| Activity Recognition | mHealth | F1 Score96.74 | 17 | |
| Human Activity Recognition | HHAR | F1 Score59.86 | 11 | |
| Human Activity Recognition | GOTOV | F1 Score79.92 | 7 | |
| Human Activity Recognition | SKODA | F1 Score95.21 | 7 | |
| Human Activity Recognition | USC-HAD | Macro F158.63 | 6 |