Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Boolean Question Answering on BoolQ (acc_norm)

88.7Acc (Normalized)

Full model

-19.0448.92836.964.872Apr 6, 2026Apr 10, 2026Apr 14, 2026Apr 18, 2026Apr 22, 2026Apr 26, 2026May 1, 2026
Updated 1mo ago

Evaluation Results

MethodLinks
2026.05
88.7
2026.05
88.5
2026.05
86.6
2026.04
85.3
2026.04
84.3
2026.04
84.3
2026.05
83.5
2026.04
82.5
2026.05
80.5
2026.04
60.5
2026.04
-0.8
2026.04
-1.2
2026.04
-1.4
2026.04
-2.1
2026.04
-2.3
2026.04
-2.5
2026.04
-7.8
2026.04
-9.2
2026.04
-14.6
2026.04
-14.9