Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Story Premise Evaluation on Story Premises 100 samples 1.0 (test)
Loading...
73.66
Fascination
STM
70.6232
71.4116
72.2
72.9884
Jun 9, 2024
Fascination
Completeness
Originality
Average Score
Updated 1mo ago
Evaluation Results
Method
Method
Links
Fascination
Completeness
Originality
Average Score
STM
Evaluation Judge=Claud...
2024.06
73.66
67.4
89.65
76.9
MoPS
Evaluation Judge=Claud...
2024.06
73.65
72.35
94.75
80.25
CPX
Evaluation Judge=Claud...
2024.06
71.22
66.4
84.65
74.09
WP
Evaluation Judge=Claud...
2024.06
70.74
51.9
93.7
72.11
Feedback
Search any
task
Search any
task