Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Binary decision task on LAPOP (test)

70.22Accuracy

OG-MAR

42.24449.50756.7764.033Jan 29, 2026
Updated 1mo ago

Evaluation Results

MethodLinks
2026.01
70.22
2026.01
63.85
2026.01
62.68
2026.01
62.68
2026.01
62.25
2026.01
60.85
2026.01
60.05
2026.01
59.13
2026.01
58.52
2026.01
57.6
2026.01
56.74
2026.01
55.51
2026.01
53.68
2026.01
53.39
2026.01
53.31
2026.01
53.06
2026.01
52.68
2026.01
50.06
2026.01
49.75
2026.01
49.39
2026.01
49.08
2026.01
47.12
2026.01
46.02
2026.01
43.32