Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Commonsense Reasoning on PIQA (non-IID distribution, alpha=0.1)

74.1Accuracy

FedAlign-MoE

67.672869.341471.0172.6786Mar 22, 2026
Updated 25d ago

Evaluation Results

MethodLinks
2026.03
74.1
2026.03
73.25
2026.03
72.14
2026.03
71.36
2026.03
71.26
2026.03
70.08
2026.03
69.18
2026.03
68.78
2026.03
68.24
2026.03
67.92