Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Classification on Classification task dataset

31.3Tok-F1

LatentQA

11.43616.59321.7526.907May 25, 2026
Updated 8d ago

Evaluation Results

MethodLinks
2026.05
31.328.5
2026.05
31.128.5
2026.05
30.928.4
2026.05
30.728.3
2026.05
29.827.6
2026.05
29.827.3
2026.05
29.227.9
2026.05
29.127.1
2026.05
28.426.6
2026.05
15.619.2
2026.05
12.619
2026.05
12.316.4
2026.05
12.218.7