Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Knowledge & Reasoning on ARC-Challenge, ENEM, BLUEX, OAB Exams, BELEBELE, MMLU, GSM8K-PT
Loading...
56.22
K&R Score (NPM)
Tucano2-qwen-3.7B-Instruct
13.3304
24.4652
35.6
46.7348
Mar 3, 2026
K&R Score (NPM)
Updated 3mo ago
Evaluation Results
Method
Method
Links
K&R Score (NPM)
Tucano2-qwen-3.7B-Instruct
Variant=Instruct, Para...
2026.03
56.22
Jurema-7B
Variant=Instruct, Para...
2026.03
50.66
Qwen2.5-3B-Instruct
Variant=Instruct, Para...
2026.03
47.34
Gemma-3-Gaia-PT-BR-4b-it
Variant=Instruct, Para...
2026.03
45
SmolLM3-3B
Variant=Instruct, Para...
2026.03
43.99
Llama-3.2-3B-Instruct
Variant=Instruct, Para...
2026.03
43.08
Qwen3-4B
Variant=Instruct, Para...
2026.03
42.33
Qwen2.5-1.5B-Instruct
Variant=Instruct, Para...
2026.03
40.25
Tucano2-qwen-1.5B-Instruct
Variant=Instruct, Para...
2026.03
39.61
Qwen3-1.7B
Variant=Instruct, Para...
2026.03
28.24
Tucano2-qwen-0.5B-Instruct
Variant=Instruct, Para...
2026.03
27.77
Llama-3.2-1B-Instruct
Variant=Instruct, Para...
2026.03
15.37
Qwen3-0.6B
Variant=Instruct, Para...
2026.03
15.13
Qwen2.5-0.5B-Instruct
Variant=Instruct, Para...
2026.03
14.98
Feedback
Search any
task
Search any
task