Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Rhetorical Role Labeling on SCOTUSCategory
Loading...
85.2
Macro-F1
Gold Prototypes
82.1008
82.9054
83.71
84.5146
Mar 4, 2026
Macro-F1
Weighted-F1
Updated 1mo ago
Evaluation Results
Method
Method
Links
Macro-F1
Weighted-F1
Gold Prototypes
Evaluation protocol=op...
2026.03
85.2
90.02
PCM
Sampling strategy=Supe...
2026.03
84.13
89.75
PCM
Sampling strategy=Full...
2026.03
83.96
89.8
PCM
Sampling strategy=Rand...
2026.03
83.93
89.7
PBR
2026.03
83.69
89.75
Mind
2026.03
83.46
89.2
Baseline (HSLN)
2026.03
82.22
88.35
Feedback
Search any
task
Search any
task