Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
SFT Generalization on Tulu3 SFT (Expanded)
Loading...
65.77
SFT Score
DeltaMoE
53.3732
56.5916
59.81
63.0284
May 18, 2026
SFT Score
Updated 15d ago
Evaluation Results
Method
Method
Links
SFT Score
DeltaMoE
Base Model=Qwen2.5-7B,...
2026.05
65.77
LLaMA-Pro
Base Model=Qwen2.5-7B,...
2026.05
64.97
MoLA
Base Model=Qwen2.5-7B,...
2026.05
64.25
Dense-FT-Delta
Base Model=Qwen2.5-7B,...
2026.05
62.25
Dense-FT-Avg
Base Model=Qwen2.5-7B,...
2026.05
61.55
Dense-FT-Delta-2FLOPs
Base Model=Qwen2.5-7B,...
2026.05
59.64
Dense-FT-Avg-2FLOPs
Base Model=Qwen2.5-7B,...
2026.05
58.32
Qwen-7B + Tulu-Delta
Base Model=Qwen2.5-7B,...
2026.05
53.85
Feedback
Search any
task
Search any
task