Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Chat Model Evaluation on NanoChat-D12 (CORE)
Loading...
22.44
CORE Score
Role swarm
15.9296
17.6198
19.31
21.0002
May 7, 2026
CORE Score
Relative Improvement vs Start
Trials
Valid Improvements
Updated 26d ago
Evaluation Results
Method
Method
Links
CORE Score
Relative Improvement vs Start
Trials
Valid Improvements
Role swarm
Row=Role swarm
2026.05
22.44
38.7
200
5
Calibrated upstream start
Row=Calibrated upstrea...
2026.05
16.18
-
-
-
Feedback
Search any
task
Search any
task