Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Poem Generation on Creative Tasks Generate a poem
Loading...
0.86
Lexical Score
VOYAGER
0.704
0.7445
0.785
0.8255
Dec 12, 2025
Lexical Score
Cosine Similarity
Vendi Score
Overall Quality
Updated 4d ago
Evaluation Results
Method
Method
Links
Lexical Score
Cosine Similarity
Vendi Score
Overall Quality
VOYAGER
LLM calls=615
2025.12
0.86
0.3
7.31
24.51
HIERARCHICAL
LLM calls=550
2025.12
0.82
0.3
5.68
22.56
TEMP
LLM calls=50
2025.12
0.78
0.16
3.22
22.65
DIVERSE
LLM calls=50
2025.12
0.78
0.14
2.76
22.79
DEFAULT
LLM calls=50
2025.12
0.76
0.15
3
22.52
SUBSETSELECT
LLM calls=500
2025.12
0.76
0.16
3.08
22.52
HISTORY
LLM calls=50
2025.12
0.71
0.11
2.3
22.45
Feedback
Search any
task
Search any
task