Share your thoughts, 1 month free Claude Pro on usSee more

Medical Reasoning on MedQA and MedMCQA mixture

59.4Pass@1

FedAvg-PubSwap

Updated 1mo ago

Evaluation Results

Method	Links
FedAvg-PubSwap 2026.04		59.4
FedAvg-GRPO 2026.04		58.7
FedAvg-PubSwap 2026.04		58.5
FedAvg-GRPO 2026.04		58.2
FedAvg-PubSwap 2026.04		58.1
FedAvg-GRPO 2026.04		57.9
FedAvg-PubSwap 2026.04		57.5
FedAvg-GRPO 2026.04		56
Base model 2026.04		49.2
Base model 2026.04		49.2
Base model 2026.04		49.2
Base model 2026.04		49.2