Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Rhetorical Role Labeling on LEGALEVAL
Loading...
91.71
Macro F1
Gold Prototypes
78.3044
81.7847
85.265
88.7453
Mar 4, 2026
Macro F1
Weighted F1
Updated 1mo ago
Evaluation Results
Method
Method
Links
Macro F1
Weighted F1
Gold Prototypes
Evaluation protocol=op...
2026.03
91.71
99.57
PBR
2026.03
82.5
93.17
PCM
Sampling strategy=Rand...
2026.03
81.83
91.57
PCM
Sampling strategy=Full...
2026.03
81.41
91.21
PCM
Sampling strategy=Supe...
2026.03
80.77
91
Mind
2026.03
79.8
91.25
Baseline (HSLN)
2026.03
78.82
90.94
Feedback
Search any
task
Search any
task