Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Hallucination Prediction on NQ-Swap
Loading...
29.4
Accuracy (entity)
Alarmer
2.776
9.688
16.6
23.512
Feb 22, 2025
Accuracy (entity)
Updated 3d ago
Evaluation Results
Method
Method
Links
Accuracy (entity)
Alarmer
Model=Mistral 7b, Prot...
2025.02
29.4
Alarmer
Model=Llama-2-7b-chat,...
2025.02
28.7
Prompt
Model=Mistral 7b, Prot...
2025.02
5
Prompt
Model=Llama-2-7b-chat,...
2025.02
3.8
Feedback
Search any
task
Search any
task