Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Retrieval-Augmented Generation on TriviaQA
Loading...
88.26
Accuracy
ConsJudge
79.4616
81.7458
84.03
86.3142
Feb 26, 2025
Accuracy
Updated 4d ago
Evaluation Results
Method
Method
Links
Accuracy
ConsJudge
Generator=Llama3-8B-In...
2025.02
88.26
Raw Metric
Generator=Llama3-8B-In...
2025.02
86.83
SFT
Generator=Llama3-8B-In...
2025.02
86.1
Vanilla LLM
Generator=Llama3-8B-In...
2025.02
85.13
ConsJudge
Generator=MiniCPM-2.4B...
2025.02
80.8
ConsJudge
Generator=MiniCPM-2.4B...
2025.02
80.69
Vanilla LLM
Generator=MiniCPM-2.4B...
2025.02
80.4
SFT
Generator=MiniCPM-2.4B...
2025.02
80.33
Raw Metric
Generator=MiniCPM-2.4B...
2025.02
80.03
Vanilla LLM
Generator=MiniCPM-2.4B...
2025.02
80.03
SFT
Generator=MiniCPM-2.4B...
2025.02
79.8
Feedback
Search any
task
Search any
task