Our new X account is live! Follow @wizwand_team for updates

Groundedness on RagTruth

0.57Kendall's Tau

Jury-on-Demand

Updated 4d ago

Evaluation Results

Method	Links
Jury-on-Demand 2025.12		0.57	0.03
Gemini 2.5 Flash 2025.12		0.56	0.03
GPT-OSS-120B 2025.12		0.52	0.05
GPT-OSS-20B 2025.12		0.51	0.05
Gemini 2.5 Pro 2025.12		0.49	0.05
Claude 3.7 2025.12		0.4	0.06
Gemini 2.0 Flash 2025.12		0.3	0.04
Gemma 3 2025.12		0.14	0.05
DeepSeek R1 2025.12		0.14	0.07
LLAMA 3.2 2025.12		0.12	0.05
Phi 4 2025.12		0.07	0.09