Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Zero-shot Language Understanding on ARC-c, ARC-e, PIQA, Winogrande, and Hellaswag

6,257Mean Accuracy

RIA+SQ+VC+EBFT

-182.89841,488.99833,160.8954,832.7917Jul 3, 2025
Updated 1mo ago

Evaluation Results

MethodLinks
2025.07
6,257
2025.07
6,215
2025.07
6,191
2025.07
6,181
2025.07
6,127
2025.07
6,118
2025.07
6,112
2025.07
6,108
2025.07
6,104
2025.07
6,101
2025.07
6,095
2025.07
6,056
2025.07
6,020
2025.07
5,974
2025.07
5,941
2025.07
5,916
2025.07
5,872
2025.07
5,853
2025.07
5,787
2025.07
5,784
2025.07
5,748
2025.07
5,740
2025.07
5,644
2025.07
5,597
2025.07
64.79