Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Medical transcription post-processing on AN
Loading...
-2.2
ΔWDER
1P-S
-2.44
-0.82
0.8
2.42
Feb 16, 2026
ΔWDER
p-value
Updated 1mo ago
Evaluation Results
Method
Method
Links
ΔWDER
p-value
1P-S
N pass=1, Strategy=SR-...
2026.02
-2.2
0.13
9P-S
N pass=9, Strategy=SR-...
2026.02
-0.8
0.84
3P-W
N pass=3, Strategy=WR-...
2026.02
-0.4
1
4P-S
N pass=4, Strategy=SR-...
2026.02
-0.4
0.69
8P-S
N pass=8, Strategy=SR-...
2026.02
-0.3
0.94
2P-S
N pass=2, Strategy=SR-...
2026.02
0.1
0.31
5P-S
N pass=5, Strategy=SR-...
2026.02
0.2
0.2
6P-S
N pass=6, Strategy=SR-...
2026.02
1.1
0.22
Baseline
N pass=0, Backbone=Qwe...
2026.02
1.7
1
1P-S
N pass=1, Strategy=SR-...
2026.02
1.8
1
1P-S
N pass=1, Strategy=SR-...
2026.02
2
0.77
7P-S
N pass=7, Strategy=SR-...
2026.02
2.6
0.22
2P-S-FS
N pass=2, Strategy=Few...
2026.02
3.8
0.16
Feedback
Search any
task
Search any
task