Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
RDF-to-text generation on GEM2 Counterfactual (test)
Loading...
81.8
Grammaticality
Llama 3.3
37.496
48.998
60.5
72.002
Dec 20, 2025
Grammaticality
Additions
Omissions
Updated 4d ago
Evaluation Results
Method
Method
Links
Grammaticality
Additions
Omissions
Llama 3.3
Architecture=Neural model
2025.12
81.8
20.9
8
GPT-4.1
Architecture=Rule-base...
2025.12
51.7
6.9
12.8
Qwen 2.5 72B
Architecture=Rule-base...
2025.12
44
5.4
9.1
BART
Architecture=Neural model
2025.12
42.6
61.3
62.2
Llama 3.3 70B
Architecture=Rule-base...
2025.12
41.9
7
13.8
Qwen 3 235B
Architecture=Rule-base...
2025.12
39.2
7.1
10.6
Feedback
Search any
task
Search any
task