Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Probabilistic Forecasting on Metaculus and Polymarket (test)

0.061Brier Score

Human Baseline

0.052840.107920.1630.21808May 26, 2026
Updated 6d ago

Evaluation Results

MethodLinks
0.06192.395.80.055-
2026.05
0.12583.774.20.0278.775
2026.05
0.12883.373.20.0369.004
2026.05
0.12982.772.30.034-
2026.05
0.12983.272.40.038-
2026.05
0.13383.368.60.0469.402
2026.05
0.13582.967.90.0459.526
2026.05
0.13583.268.40.04411.085
2026.05
0.13782.366.20.04-
2026.05
0.13782.967.70.046-
2026.05
0.13782.867.30.059.73
2026.05
0.13882.265.40.043-
2026.05
0.13881.667.10.05411.564
2026.05
0.13882.466.20.04411.55
2026.05
0.13982.965.50.051-
2026.05
0.14181.863.80.054-
2026.05
0.14282.766.10.079-
2026.05
0.1438073.60.1-
2026.05
0.14381.3700.097-
2026.05
0.14679.972.30.104-
2026.05
0.15178.266.90.099-
2026.05
0.15182.363.30.094-
2026.05
0.15777.765.50.119-
2026.05
0.15779.466.30.084-
2026.05
0.15879.666.10.111-
2026.05
0.16975.566.10.148-
2026.05
0.1857563.30.164-
2026.05
0.22277.261.90.22-
2026.05
0.23276.166.40.23-
2026.05
0.26572.665.60.265-