Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Word-level Fine-grained Quality Estimation on en-ru
Loading...
0.3583
MCC
RIEQE
0.175884
0.223242
0.2706
0.317958
May 29, 2026
MCC
F1 Score
Precision
Recall
Updated 2d ago
Evaluation Results
Method
Method
Links
MCC
F1 Score
Precision
Recall
RIEQE
Category=Our Method
2026.05
0.3583
40.92
33.25
53.18
DCSQE
Category=Baselines
2026.05
0.351
37.33
29.84
49.85
RIEQE-NonThinking
Category=Our Method, T...
2026.05
0.3434
38.71
31.25
50.86
xComet-XXL
Category=Baselines
2026.05
0.263
32.86
27.58
40.67
Qwen3-4B-Thinking-2507
Category=Backbone Mode...
2026.05
0.2508
31.2
19.59
50.45
GPT-5.5
Category=Baselines
2026.05
0.2038
25.41
15.84
64.18
Qwen3-4B-Thinking-2507
Category=Backbone Model
2026.05
0.1829
25.33
16.83
51.26
Feedback
Search any
task
Search any
task