Query-Conditioned Graph Retrieval for Contextualized LLM Reasoning in Personalized Wearable Data
About
Large language models (LLMs) are increasingly applied to analyzing wearable sensing data, which are long-term, multimodal, and highly personalized. A key challenge is context selection: providing insufficient context limits reasoning, while including all available data leads to inefficiency and degraded generation quality. We propose Wearable As Graph (WAG), a graph-based context retrieval framework that enables query-adaptive reasoning over wearable data with LLMs. WAG organizes wearable metrics and user-specific signals into a personalized knowledge graph, and retrieves a query-conditioned subgraph to support downstream generation. The retrieval process integrates global relationships, capturing prior knowledge and population- and individual-level patterns via hierarchical Bayesian modeling, with local relationships that reflect short-term signal deviations. A query openness signal further controls retrieval breadth. We evaluate WAG on over 10,000 data-grounded queries from real-world wearable datasets. Across LLM-based and human evaluations, WAG achieves an approximately 70% win rate over baseline and standard RAG methods, demonstrating the effectiveness of structured, query-adaptive context retrieval for LLM-driven analysis of wearable data.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Human Ranking | Main Experiment | Mean Rank1.37 | 15 | |
| Global Modeling Weighting Strategy Analysis | Overall (Globem, IFH Affect, Lifesnap, Pmdata) (test) | Average Rank2.16 | 8 | |
| Local Modeling Effectiveness Analysis | Overall (Globem, IFH Affect, Lifesnap, Pmdata) (test) | Average Rank1.88 | 6 | |
| Personalized Retrieval | Overall (Globem, IFH Affect, Lifesnap, Pmdata) (test) | Average Rank1.41 | 6 | |
| Retrieval-Augmented Generation | Globem | Insight1.43 | 3 | |
| Retrieval-Augmented Generation | IFH Affect | Insight1.4 | 3 | |
| Retrieval-Augmented Generation | Lifesnap | Insight Score1.39 | 3 | |
| Retrieval-Augmented Generation | Pmdata | Insight Score1.41 | 3 |