Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
RDF-to-text generation on GEM2 Fictional (test)
Loading...
93.7
Grammaticality
Llama 3.3
53.556
63.978
74.4
84.822
Dec 20, 2025
Grammaticality
Additions
Omissions
Updated 4d ago
Evaluation Results
Method
Method
Links
Grammaticality
Additions
Omissions
Llama 3.3
Architecture=Neural model
2025.12
93.7
1.8
9.6
GPT-4.1
Architecture=Rule-base...
2025.12
73.8
3.6
9.8
Qwen 3 235B
Architecture=Rule-base...
2025.12
63.2
4.3
6.6
BART
Architecture=Neural model
2025.12
61.9
58
59.9
Qwen 2.5 72B
Architecture=Rule-base...
2025.12
60.3
3
9.8
Llama 3.3 70B
Architecture=Rule-base...
2025.12
55.1
6.5
20.8
Feedback
Search any
task
Search any
task