Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Factual Precision Evaluation on Geographic
Loading...
86.7
FactScore
MistralINST
-2.1992
20.8804
43.96
67.0396
Jul 4, 2024
FactScore
Updated 4d ago
Evaluation Results
Method
Method
Links
FactScore
MistralINST
Scenario=INFO, CORE=w/o
2024.07
86.7
MistralINST
Scenario=REP, CORE=w/o
2024.07
84.9
GPT-2
Scenario=REP, CORE=w/o
2024.07
80.7
MistralINST
Scenario=NORMAL, CORE=w/o
2024.07
79.9
MistralINST
Scenario=NORMAL, CORE=w/
2024.07
78.3
GPT-2
Scenario=INFO, CORE=w/o
2024.07
75.8
MistralINST
Scenario=INFO, CORE=w/
2024.07
53.7
MistralINST
Scenario=REP, CORE=w/
2024.07
32.7
GPT-2
Scenario=INFO, CORE=w/
2024.07
1.68
GPT-2
Scenario=REP, CORE=w/
2024.07
1.22
Feedback
Search any
task
Search any
task