Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Multi-round Co-reference Resolution on Long-context benchmarks

38.5Score (8k Context)

CLP

23.4227.33531.2535.165Mar 18, 2026
Updated 1mo ago

Evaluation Results

MethodLinks
2026.03
38.533.931.933.930.1
2026.03
36.334.131.234.626.7
2026.03
35.735.533.429.217.2
2026.03
35.734.932.529.98.9
2026.03
3526.116.613.78.7
2026.03
34.630.428.926.321.3
2026.03
3429.831.931.625.5
2026.03
32.931.525.824.219.3
2026.03
32.531.826.925.919
2026.03
32.430.730.82111.4
2026.03
32.431.529.625.811.6
2026.03
3230.929.922.414.8
2026.03
29.72728.324.517
2026.03
26.822.21917.812.8
2026.03
26.722.817.613.911
2026.03
26.420.71615.69.4
2026.03
26.422.71815.45.6
2026.03
26.322.819.719.114.3
2026.03
26.323.118.618.78.9
2026.03
25.618.78.39.76.1
2026.03
2419.416.51312.1