Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Question Generation Evaluation on EduAgent (test)
Loading...
74.17
Accuracy
ChatEval
64.6332
67.1091
69.585
72.0609
Mar 7, 2025
Accuracy
CA
Updated 1mo ago
Evaluation Results
Method
Method
Links
Accuracy
CA
ChatEval
2025.03
74.17
56.67
QG-SMS
2025.03
74.17
63.33
Vanilla
2025.03
70.83
58.33
Metrics
2025.03
69.17
53.33
Reference
2025.03
67.5
55
CoT
2025.03
65
38.33
Swap
2025.03
65
48.33
Feedback
Search any
task
Search any
task