Share your thoughts, 1 month free Claude Pro on usSee more

Home/Benchmarks

Multi-Task Reasoning on MMLU

56.7Pass@1

DCRL

Updated 4mo ago

Evaluation Results

Method	Links
DCRL 2026.03		56.7
DCRL 2026.03		52.6
DCRL 2026.03		33.9

SOTA Paper

DCRL

Dual Consensus: Escaping from Spurious Majority in Unsupervised RLVR via Two-Stage Vote Mechanism

Dataset

MMLU

Follow for update

@wizwand_team Discord

© 2026 wizwand

Blog Contact Changelog Swarm

Privacy Policy Terms of Service FAQs Swarm Docs