Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Question Generation Evaluation on EduAgent HumanQs (test)
Loading...
78.33
AA
Human
67.0668
69.9909
72.915
75.8391
Mar 7, 2025
AA
Updated 4d ago
Evaluation Results
Method
Method
Links
AA
Human
2025.03
78.33
QG-SMS
2025.03
76.67
Swap
2025.03
73.33
Vanilla
2025.03
70.83
Metrics
2025.03
70.83
Reference
2025.03
69.17
ChatEval
2025.03
69.17
CoT
2025.03
67.5
Feedback
Search any
task
Search any
task