Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

HARGPT: Are LLMs Zero-Shot Human Activity Recognizers?

About

There is an ongoing debate regarding the potential of Large Language Models (LLMs) as foundational models seamlessly integrated with Cyber-Physical Systems (CPS) for interpreting the physical world. In this paper, we carry out a case study to answer the following question: Are LLMs capable of zero-shot human activity recognition (HAR). Our study, HARGPT, presents an affirmative answer by demonstrating that LLMs can comprehend raw IMU data and perform HAR tasks in a zero-shot manner, with only appropriate prompts. HARGPT inputs raw IMU data into LLMs and utilizes the role-play and think step-by-step strategies for prompting. We benchmark HARGPT on GPT4 using two public datasets of different inter-class similarities and compare various baselines both based on traditional machine learning and state-of-the-art deep classification models. Remarkably, LLMs successfully recognize human activities from raw IMU data and consistently outperform all the baselines on both datasets. Our findings indicate that by effective prompting, LLMs can interpret raw IMU data based on their knowledge base, possessing a promising potential to analyze raw sensor data of the physical world effectively.

Sijie Ji, Xinzhe Zheng, Chenshu Wu• 2024

Related benchmarks

TaskDatasetResultRank
Human Activity RecognitionPAMAP2
Accuracy32.11
54
Activity RecognitionShoaib
Accuracy21
42
Activity RecognitionmHealth
F1 Score7.4
35
Activity RecognitionPAMAP2
Accuracy11.1
22
Activity RecognitionDSADS--
20
Activity RecognitionOpportunity
Accuracy28.8
18
Activity RecognitionUSC-HAD
Accuracy9.5
18
Activity RecognitionTNDA-HAR
Accuracy13.7
18
Activity RecognitionUTD-MHAD
Accuracy3.3
18
Activity Recognitionw-HAR
Accuracy4.9
18
Showing 10 of 34 rows

Other info

Follow for update