Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Accuracy on BBH General Reasoning
Loading...
82.9
Accuracy
TDA-RC
71.772
74.661
77.55
80.439
Mar 13, 2026
Accuracy
Updated 11d ago
Evaluation Results
Method
Method
Links
Accuracy
TDA-RC
Base LLM=DeepSeek-V3
2026.03
82.9
TDA-RC
Base LLM=GPT-4o-mini
2026.03
82.5
TDA-RC
Base LLM=Qwen-Turbo
2026.03
82.2
Instruction Induction
Base LLM=DeepSeek-V3
2026.03
80.8
Instruction Induction
Base LLM=GPT-4o-mini
2026.03
80.5
HoT
Base LLM=DeepSeek-V3
2026.03
80.4
HoT
Base LLM=GPT-4o-mini
2026.03
80.1
Instruction Induction
Base LLM=Qwen-Turbo
2026.03
80.1
Role / Persona Prompting
Base LLM=DeepSeek-V3
2026.03
80.1
HoT
Base LLM=Qwen-Turbo
2026.03
80
Role / Persona Prompting
Base LLM=GPT-4o-mini
2026.03
79.9
Prompt Canvas
Base LLM=DeepSeek-V3
2026.03
79.7
Prompt Canvas
Base LLM=GPT-4o-mini
2026.03
79.6
Role / Persona Prompting
Base LLM=Qwen-Turbo
2026.03
79.5
Prompt Canvas
Base LLM=Qwen-Turbo
2026.03
79.2
Analogical Prompting
Base LLM=DeepSeek-V3
2026.03
72.8
Analogical Prompting
Base LLM=GPT-4o-mini
2026.03
72.5
Analogical Prompting
Base LLM=Qwen-Turbo
2026.03
72.2
Feedback
Search any
task
Search any
task