Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Evidence Identification on NarrativeQA, FairyTaleQA, and TriviaQA Average
Loading...
62
F1 Score
XGRAG
32.88
40.44
48
55.56
Apr 27, 2026
F1 Score
MRR
Precision@10
Precision@30
Precision@50
Updated 1mo ago
Evaluation Results
Method
Method
Links
F1 Score
MRR
Precision@10
Precision@30
Precision@50
XGRAG
Granularity=node-level
2026.04
62
72
66
44
57
RAG-Ex
Granularity=word-level
2026.04
54
23
8
11
19
XGRAG
Granularity=edge-level
2026.04
52
65
22
42
48
RAG-Ex
Granularity=sentence-l...
2026.04
34
61
35
42
54
Feedback
Search any
task
Search any
task