Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Overall on Loong Set 1: 10K–50K Tokens
Loading...
71
LLM Score
Disco-RAG
39.6128
47.7614
55.91
64.0586
Jan 7, 2026
LLM Score
Exact Match
Updated 4d ago
Evaluation Results
Method
Method
Links
LLM Score
Exact Match
Disco-RAG
Base Model=Llama-3.3-70B
2026.01
71
38
StructRAG
Condition=SOTA Results
2026.01
69.43
35
Disco-RAG
Base Model=Qwen2.5-72B
2026.01
69.39
33
Disco-RAG
Base Model=Llama-3.1-8B
2026.01
69.18
32
Llama-3.3-70B
Condition=Standard RAG
2026.01
62.78
34
Qwen2.5-72B
Condition=Standard RAG
2026.01
61.58
33
Llama-3.1-8B
Condition=Standard RAG
2026.01
60.08
25
Llama-3.3-70B
Condition=Full Context
2026.01
59.54
32
Qwen2.5-72B
Condition=Full Context
2026.01
56.59
31
Llama-3.1-8B
Condition=Full Context
2026.01
56.16
30
RQ-RAG
Condition=SOTA Results
2026.01
53.51
17
GraphRAG
Condition=SOTA Results
2026.01
40.82
18
Feedback
Search any
task
Search any
task