Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Human Evaluation on NoveltyBench
Loading...
4.04
Quality
BACO
2.7816
3.1083
3.435
3.7617
Nov 7, 2025
Quality
Overall (%)
Diversity Format (%)
Content (%)
Creativity (%)
Updated 23h ago
Evaluation Results
Method
Method
Links
Quality
Overall (%)
Diversity Format (%)
Content (%)
Creativity (%)
BACO
Variant=best variant,...
2025.11
4.04
79
74.6
77.1
79.6
Aligned
Description=aligned mo...
2025.11
2.83
21
25.4
22.9
20.4
Feedback
Search any
task
Search any
task