Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Commonsense Reasoning on BoolQ (Inter/Intra-Layer Filtering)

67Accuracy (Inter-Layer Filtering)

CRFT

50.04854.44958.8563.251Jul 14, 2025
Updated 4d ago

Evaluation Results

MethodLinks
2025.07
6767.9-
2025.07
6466.6-
2025.07
6366.4-
2025.07
6366.5-
2025.07
62.562.5-
2025.07
62.565-
2025.07
62.466.2-
2025.07
62.364.8-
2025.07
62.162.1-
2025.07
62.160.8-
2025.07
6254.3-
2025.07
60.561.8-
2025.07
6053.7-
2025.07
6059.7-
2025.07
50.750.7-
2025.05
--81.47
2025.05
--63.39
2025.05
--80.89
2025.05
--80
2025.05
--81.96
2025.05
--82.01
2025.05
--82.11
2025.05
--82.45