Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
RDF-to-text generation on GEM2 Counterfactual (test)
Loading...
81.8
Grammaticality
Llama 3.3
37.496
48.998
60.5
72.002
Dec 20, 2025
Grammaticality
Additions
Omissions
Updated 1mo ago
Evaluation Results
Method
Method
Links
Grammaticality
Additions
Omissions
Llama 3.3
Architecture=Neural model
2025.12
81.8
20.9
8
GPT-4.1
Architecture=Rule-base...
2025.12
51.7
6.9
12.8
Qwen 2.5 72B
Architecture=Rule-base...
2025.12
44
5.4
9.1
BART
Architecture=Neural model
2025.12
42.6
61.3
62.2
Llama 3.3 70B
Architecture=Rule-base...
2025.12
41.9
7
13.8
Qwen 3 235B
Architecture=Rule-base...
2025.12
39.2
7.1
10.6
Feedback
Search any
task
Search any
task