Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Reasoning and Question Answering on BoolQ, RTE, HellaSWAG, ARC, OpenBookQA, and PiQA

67.24Avg Accuracy

Before finetune

34.32442.869551.41559.9605Jun 12, 2024
Updated 4d ago

Evaluation Results

MethodLinks
2024.06
67.24
2024.06
66.93
2024.06
66.85
2024.06
61.77
2024.06
61.14
2024.06
58.72
2024.06
58.34
2024.06
56.59
2024.06
56.34
2024.06
55.41
2024.06
54.73
2024.06
53.91
2024.06
53.78
2024.06
48.12
2024.06
35.59