Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Complex Reasoning on SciFact (val)

71.15Macro-F1

EvoPool

24.287636.453848.6260.7862Jun 1, 2026
Updated 1d ago

Evaluation Results

MethodLinks
2026.06
71.15
2026.06
70.4
2026.06
34.38
2026.06
26.09