Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Reasoning on Xcope bo (acc)
Loading...
65.4
Accuracy
Ours-MoE-SFT
49.8
53.85
57.9
61.95
Jul 12, 2025
Accuracy
Updated 20d ago
Evaluation Results
Method
Method
Links
Accuracy
Ours-MoE-SFT
Model=Ours-MoE-SFT
2025.07
65.4
Ours-Base
2025.07
59.8
Ours-Base
Model=Ours-Base
2025.07
59.8
Ours-Base-32k
context-length=32k
2025.07
58.6
Ours-Base-32k
Model=Ours-Base-32k
2025.07
58.6
Ours-SFT
alignment=SFT
2025.07
57.8
Ours-SFT
Model=Ours-SFT
2025.07
57.8
Ours-MoE-Base
Model=Ours-MoE-Base
2025.07
57.8
Ours-MoE-Base-8k
Model=Ours-MoE-Base-8k
2025.07
57.2
Yak-Llama2-7B
2025.07
53
Qwen2.5-7B-base
Model=Qwen2.5-7B-base
2025.07
51.8
LLaMA3.1-8B-Instruct
Model=LLaMA3.1-8B-Inst...
2025.07
51.6
Qwen2.5-7B-Instruct
Model=Qwen2.5-7B-Instruct
2025.07
51.6
Tibetan-Alpaca-7B
2025.07
51.2
Tibetan-Llama2-7B
2025.07
50.6
Qwen3-8B
Model=Qwen3-8B
2025.07
50.4
DeepSeek-R1-Distill-Llama-8B
Model=DeepSeek-R1-Dist...
2025.07
50.4
Feedback
Search any
task
Search any
task