Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Retrieval-Augmented Generation on WoW
Loading...
88.87
LLM Score
ConsJudge
82.4116
84.0883
85.765
87.4417
Feb 26, 2025
LLM Score
Updated 4d ago
Evaluation Results
Method
Method
Links
LLM Score
ConsJudge
Generator=Llama3-8B-In...
2025.02
88.87
Vanilla LLM
Generator=Llama3-8B-In...
2025.02
88.31
SFT
Generator=Llama3-8B-In...
2025.02
87.97
SFT
Generator=MiniCPM-2.4B...
2025.02
87.51
Vanilla LLM
Generator=MiniCPM-2.4B...
2025.02
87.49
ConsJudge
Generator=MiniCPM-2.4B...
2025.02
87.21
ConsJudge
Generator=MiniCPM-2.4B...
2025.02
86.13
Vanilla LLM
Generator=MiniCPM-2.4B...
2025.02
85.98
SFT
Generator=MiniCPM-2.4B...
2025.02
85.98
Raw Metric
Generator=MiniCPM-2.4B...
2025.02
85.48
Raw Metric
Generator=Llama3-8B-In...
2025.02
82.66
Feedback
Search any
task
Search any
task