| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Distractor Effectiveness | EduAgent (test) | Agreement Accuracy0.7933 | 10 | |
| Discrimination | EduAgent (test) | Accuracy66.39 | 10 | |
| Difficulty | EduAgent (test) | Accuracy (AA)68.95 | 10 | |
| Topic Coverage | EduAgent (test) | AA98.85 | 10 | |
| Question Generation Evaluation | EduAgent GenQs (test) | Accuracy74.17 | 7 |