Share your thoughts, 1 month free Claude Pro on usSee more

Coding on TheoremQA

55.38Accuracy

InfiGFusion

Updated 9d ago

Evaluation Results

Method	Links
InfiGFusion 2025.05		55.38
InfiFusion 2025.05		54.62
Pivot-SFT 2025.05		54.5
FuseLLM 2025.05		53.52
FuseChat 2025.05		51.88
Phi-4 2025.05		51.12
Mistral-Small 2025.05		48.5
Qwen2.5-Instruct 2025.05		47.25
MiniLogit 2025.05		46.36
Qwen2.5-Coder 2025.05		38.88