Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Story Completion on MTG Story Completion (test)
Loading...
31.2
ROUGE-L (ES)
tgt-dev
28.808
29.429
30.05
30.671
May 27, 2023
ROUGE-L (ES)
ROUGE-L (DE)
ROUGE-L (FR)
ROUGE-L (ZH)
Score Delta
Updated 4d ago
Evaluation Results
Method
Method
Links
ROUGE-L (ES)
ROUGE-L (DE)
ROUGE-L (FR)
ROUGE-L (ZH)
Score Delta
tgt-dev
Model Selection Criter...
2023.05
31.2
30.5
35.6
28.1
0
cos-sim
Model Selection Criter...
2023.05
30.8
30.3
35.6
26.4
-0.58
en-dev
Model Selection Criter...
2023.05
28.9
27.8
28.9
20.3
-4.88
Feedback
Search any
task
Search any
task