Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Information Coverage and Truthfulness Evaluation on Corpus-based Retrieval
Loading...
34.4
S_fact
Mixtral-8x22B
32.632
33.091
33.55
34.009
Jan 7, 2025
S_fact
ICAT-M Coverage
ICAT-M Score
ICAT-S Coverage
ICAT-S Score
ICAT-A Coverage
ICAT-A Score
Updated 4d ago
Evaluation Results
Method
Method
Links
S_fact
ICAT-M Coverage
ICAT-M Score
ICAT-S Coverage
ICAT-S Score
ICAT-A Coverage
ICAT-A Score
Mixtral-8x22B
Alignment LLM=Llama-3....
2025.01
34.4
37
29.7
41.4
34.2
40.9
33.9
GPT-4
Alignment LLM=Llama-3....
2025.01
34.3
41.6
32.7
45.3
34.6
46.3
35.4
Openchat 3.5 (7B)
Alignment LLM=Llama-3....
2025.01
34
41.3
32.9
42.9
34.8
42.4
34.7
Llama-3-70B
Alignment LLM=Llama-3....
2025.01
32.7
45.1
33.5
46.4
35.5
46.6
35.4
Feedback
Search any
task
Search any
task