Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Rhetorical Role Labeling on SCOTUSSteps
Loading...
56.2
Macro-F1
Gold Prototypes
44.8016
47.7608
50.72
53.6792
Mar 4, 2026
Macro-F1
Weighted-F1
Updated 1mo ago
Evaluation Results
Method
Method
Links
Macro-F1
Weighted-F1
Gold Prototypes
Evaluation protocol=op...
2026.03
56.2
69.86
PCM
Sampling strategy=Rand...
2026.03
54.62
67.55
PCM
Sampling strategy=Supe...
2026.03
54.4
67.79
PCM
Sampling strategy=Full...
2026.03
54.03
67.54
PBR
2026.03
50.48
65.73
Baseline (HSLN)
2026.03
46.7
63.21
Mind
2026.03
45.24
62.78
Feedback
Search any
task
Search any
task