Share your thoughts, 1 month free Claude Pro on usSee more

Long-context retrieval and reasoning on Loong full (evaluation)

68Average Score

Baseline (Full Context)

Updated 4mo ago

Evaluation Results

Method	Links
Baseline (Full Context) 2026.03		68	31.4
SPD-RAG 2026.03		58.1	18.6
Normal RAG 2026.03		33	13.7
Agentic RAG 2026.03		32.8	8.8