Share your thoughts, 1 month free Claude Pro on usSee more

Error Span Detection on WMT24 (test)

84.8SPA

Llama-MBR-SOFTF1

Updated 4mo ago

Evaluation Results

Method	Links
Llama-MBR-SOFTF1 2025.12		84.8	57.1	93.2	51.3
xCOMET-Reg 2025.12		84.4	58.1	-	-
xCOMET-QE-Reg 2025.12		82.5	54.9	-	-
Llama-MAP 2025.12		82.3	56.8	91.9	53.1
xCOMET-ESD 2025.12		75.7	55.3	88.9	30.2
xCOMET-QE-ESD 2025.12		68.8	54.1	87.9	28.9