Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Veracity Inference on PRONTOQA 5-hop (test)
Loading...
0.955
Hamming Similarity
AVI
0.6534
0.7317
0.81
0.8883
May 17, 2025
Hamming Similarity
Updated 4d ago
Evaluation Results
Method
Method
Links
Hamming Similarity
AVI
Base LLM=Qwen 8B
2025.05
0.955
AVI
Base LLM=Qwen 4B
2025.05
0.913
Many2Many
Base LLM=Qwen 4B
2025.05
0.684
Many2Many
Base LLM=Qwen 8B
2025.05
0.665
Feedback
Search any
task
Search any
task