Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Story Premise Diversity Evaluation on Story Premise Human Evaluation Set 600 premises 1.0 (test)
Loading...
3.875
Average Score
MoPS
2.185
2.62375
3.0625
3.50125
Jun 9, 2024
Average Score
E1*
E2*
E3*
E4
E5
E6
E7*
E8
Updated 4d ago
Evaluation Results
Method
Method
Links
Average Score
E1*
E2*
E3*
E4
E5
E6
E7*
E8
MoPS
2024.06
3.875
4
5
3
4
4
3
4
4
WritingPrompts (WP)
source=Reddit
2024.06
3.75
2
5
5
2
5
5
4
2
DOC
backbone=llama2-13b-chat
2024.06
3.5
3
2
4
5
5
4
3
4
Storium (STM)
source=RPG platform
2024.06
3.125
3
3
3
3
4
4
2
2
Vanilla (VIL)
backbone=gpt-3.5-turbo...
2024.06
2.625
3
2
4
3
3
2
2
2
Complex (CPX)
mode=few-shot, shots=3
2024.06
2.25
4
2
2
2
3
1
1
3
Feedback
Search any
task
Search any
task