Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Framework Usability Evaluation on Participant Ratings User Study (11 participants) 1.0 (test)
Loading...
4.27
Setup & Config Rating
EKA-EVAL
1.8156
2.4528
3.09
3.7272
Jul 2, 2025
Setup & Config Rating
Navigation Rating
UI Rating
Result Export Rating
Extensibility Rating
Multilingual Support Rating
Updated 3mo ago
Evaluation Results
Method
Method
Links
Setup & Config Rating
Navigation Rating
UI Rating
Result Export Rating
Extensibility Rating
Multilingual Support Rating
EKA-EVAL
2025.07
4.27
4.55
4.64
4.55
4.64
4.73
lm-eval-harness
2025.07
3.73
3.91
1
3.64
4.18
1.64
indic-eval
2025.07
2.55
3.18
1
2.55
2.55
4.55
FreeEval
2025.07
2.55
2.45
2.73
2.64
2.36
1.45
OpenCompass
2025.07
2.18
2.36
3.09
3.09
2.8
2.09
HELM
2025.07
1.91
2.64
2.36
2.55
2.09
1.36
Feedback
Search any
task
Search any
task