Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
General Question Answering on TriviaQA (Acc., Final Gap)
Loading...
76.1
Accuracy
Pioneer Agent
31.38
42.99
54.6
66.21
Apr 10, 2026
Accuracy
Final Gap (%)
Updated 4d ago
Evaluation Results
Method
Method
Links
Accuracy
Final Gap (%)
Pioneer Agent
Model=Llama 3.2-3B, Sy...
2026.04
76.1
43
Pioneer Agent
Model=Llama 3.2-3B, Sy...
2026.04
73.5
43
Naive Baseline
Model=Llama 3.2-3B, Sy...
2026.04
34
43
Naive Baseline
Model=Llama 3.2-3B, Sy...
2026.04
33.1
43
Feedback
Search any
task
Search any
task