Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Complex Reasoning on 3 Complex Reasoning (test)
Loading...
3
LLM Calls
CoT
-0.84
25.08
51
76.92
Mar 25, 2026
LLM Calls
Tokens
Runtime (s)
Cost Estimate ($)
Updated 23d ago
Evaluation Results
Method
Method
Links
LLM Calls
Tokens
Runtime (s)
Cost Estimate ($)
CoT
Model Engine=Claude So...
2026.03
3
3,000
97
0.02
EMoT
Model Engine=Claude So...
2026.03
99
79,052
1,214
0.36
Feedback
Search any
task
Search any
task