Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
RDF-to-text generation on WebNLG All standard (test)
Loading...
0.4352
BLEU
Fine-tuned BART
0.279824
0.320162
0.3605
0.400838
Dec 20, 2025
BLEU
METEOR
BERTScore
BLEURT
Updated 4d ago
Evaluation Results
Method
Method
Links
BLEU
METEOR
BERTScore
BLEURT
Fine-tuned BART
Inter-pretability=fals...
2025.12
0.4352
0.6791
0.9308
0.1275
Rule-based NLG (trained by Qwen 3 235B)
Inter-pretability=true...
2025.12
0.3939
0.6759
0.929
0.1767
Rule-based NLG (trained by GPT-4.1)
Inter-pretability=true...
2025.12
0.3934
0.7069
0.9291
0.1841
Prompted Llama 3.3 70B
Inter-pretability=fals...
2025.12
0.3616
0.6887
0.9255
0.1058
Rule-based NLG (trained by Qwen 2.5 72B)
Inter-pretability=true...
2025.12
0.3309
0.6531
0.9224
0.1193
Rule-based NLG (trained by Llama 3.3 70B)
Inter-pretability=true...
2025.12
0.2858
0.6578
0.9179
0.0762
Feedback
Search any
task
Search any
task