Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

EduAgent

Benchmarks

Task NameDataset NameSOTA ResultTrend
Distractor EffectivenessEduAgent (test)
Agreement Accuracy0.7933
10
DiscriminationEduAgent (test)
Accuracy66.39
10
DifficultyEduAgent (test)
Accuracy (AA)68.95
10
Topic CoverageEduAgent (test)
AA98.85
10
Question Generation EvaluationEduAgent GenQs (test)
Accuracy74.17
7
Showing 5 of 5 rows