Share your thoughts, 1 month free Claude Pro on usSee more

Complex Reasoning on SciFact (val)

71.15Macro-F1

EvoPool

Updated 1d ago

Evaluation Results

Method	Links
EvoPool 2026.06		71.15
LLM annotation 2026.06		70.4
Alchemist 2026.06		34.38
DataSculpt 2026.06		26.09