Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Quality Assessment on Consolidated Evaluation Dimensions
Loading...
91.7
Quality Score
Council Mode
70.588
76.069
81.55
87.031
Apr 3, 2026
Quality Score
Updated 13d ago
Evaluation Results
Method
Method
Links
Quality Score
Council Mode
Latency (s)=8.4
2026.04
91.7
Claude Opus 4.6
Latency (s)=4.1
2026.04
81.5
GPT-5.4
Latency (s)=3.2
2026.04
78.3
Gemini 3.1 Pro
Latency (s)=2.8
2026.04
76.2
DeepSeek V3.2
Latency (s)=5.6
2026.04
73.8
Seed 2.0 Pro
Latency (s)=3.8
2026.04
71.4
Feedback
Search any
task
Search any
task