Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Mathematical Question Generation on Grade school math
Loading...
0.81
Lexical Similarity
VOYAGER
0.2796
0.4173
0.555
0.6927
Dec 12, 2025
Lexical Similarity
Cosine Similarity
Vendi Score
General Quality Score
LLM Evaluation Score
Updated 1mo ago
Evaluation Results
Method
Method
Links
Lexical Similarity
Cosine Similarity
Vendi Score
General Quality Score
LLM Evaluation Score
VOYAGER
2025.12
0.81
0.48
18.78
14.77
399
HIERARCHICAL
2025.12
0.68
0.4
8.72
15
550
SUBSETSELECT
2025.12
0.57
0.22
3.48
14.98
500
TEMP
2025.12
0.56
0.22
3.56
15
50
DEFAULT
2025.12
0.54
0.2
3.04
14.99
50
DIVERSE
2025.12
0.47
0.07
1.65
15
50
HISTORY
2025.12
0.3
0.24
3.13
14.83
50
Feedback
Search any
task
Search any
task