Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
General Usability Evaluation on Just Eval
Loading...
4.78
Helpfulness
AdaCD
3.584
3.8945
4.205
4.5155
Apr 18, 2026
Helpfulness
Engagement
Depth
Clarity
Factuality
Average Score
Updated 1mo ago
Evaluation Results
Method
Method
Links
Helpfulness
Engagement
Depth
Clarity
Factuality
Average Score
AdaCD
2026.04
4.78
4.47
3.83
4.88
4.49
4.49
Default
2026.04
4.75
4.19
3.88
4.85
4.5
4.43
Surgical
2026.04
4.66
4.27
3.6
4.81
4.39
4.35
Prompt
2026.04
4.33
4.18
3.17
4.96
4.54
4.24
SSD
2026.04
4.3
3.9
3.72
4.72
4.49
4.23
SelfCD
2026.04
3.63
4.33
2.74
4.26
4.45
3.88
Feedback
Search any
task
Search any
task