Share your thoughts, 1 month free Claude Pro on usSee more

Action Prediction on Human Evaluation User Actions Dataset (test)

79Win Rate

LongNAP

Updated 1mo ago

Evaluation Results

Method	Links
LongNAP 2026.03		79
Few-shot RAG 2026.03		58.5
Zero-shot 2026.03		55
Few-shot RAG 2026.03		45
Zero-shot 2026.03		33
SFT 2026.03		29.5