Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Factual Precision Evaluation on Culture & Entertainment
Loading...
0.887
FACTSCORE
MistralINST
-0.02508
0.21171
0.4485
0.68529
Jul 4, 2024
FACTSCORE
Updated 4d ago
Evaluation Results
Method
Method
Links
FACTSCORE
MistralINST
Scenario=INFO, CORE=w/o
2024.07
0.887
MistralINST
Scenario=REP, CORE=w/o
2024.07
0.879
GPT-2
Scenario=INFO, CORE=w/o
2024.07
0.871
MistralINST
Scenario=NORMAL, CORE=w/o
2024.07
0.815
MistralINST
Scenario=NORMAL, CORE=w/
2024.07
0.797
GPT-2
Scenario=REP, CORE=w/o
2024.07
0.727
MistralINST
Scenario=INFO, CORE=w/
2024.07
0.404
MistralINST
Scenario=REP, CORE=w/
2024.07
0.264
GPT-2
Scenario=INFO, CORE=w/
2024.07
0.0341
GPT-2
Scenario=REP, CORE=w/
2024.07
0.01
Feedback
Search any
task
Search any
task