Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Mathematical Question Generation on Grade school math
Loading...
0.81
Lexical Similarity
VOYAGER
0.2796
0.4173
0.555
0.6927
Dec 12, 2025
Lexical Similarity
Cosine Similarity
Vendi Score
General Quality Score
LLM Evaluation Score
Updated 4d ago
Evaluation Results
Method
Method
Links
Lexical Similarity
Cosine Similarity
Vendi Score
General Quality Score
LLM Evaluation Score
VOYAGER
2025.12
0.81
0.48
18.78
14.77
399
HIERARCHICAL
2025.12
0.68
0.4
8.72
15
550
SUBSETSELECT
2025.12
0.57
0.22
3.48
14.98
500
TEMP
2025.12
0.56
0.22
3.56
15
50
DEFAULT
2025.12
0.54
0.2
3.04
14.99
50
DIVERSE
2025.12
0.47
0.07
1.65
15
50
HISTORY
2025.12
0.3
0.24
3.13
14.83
50
Feedback
Search any
task
Search any
task