Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

ToTTo

Benchmarks

Task NameDataset NameSOTA ResultTrend
Data-to-Text GenerationToTTo
BLEU52.28
18
Loosely controlled table-to-text generationToTTO Logic2Text-style (test)
BLEU52.7
15
Table-to-Text GenerationToTTo Over (test)
BLEURT0.364
15
Table-to-Text GenerationToTTo Non (test)
BLEURT Score0.116
15
Table-to-Text GenerationToTTo All (dev)
BLEURT0.24
15
Table-to-Text GenerationToTTo (test)
BLEURT Score0.24
15
Data-to-Text GenerationToTTo full (test)
BLEU50.8
12
Loosely controlled table-to-text generationToTTO Logic2Text-style (dev)
BLEU46.2
10
Tightly controlled table-to-text generationToTTO official (TestN)
BLEU48.7
10
Cell-Level AttributionToTTo
Precision74.2
6
Column-Level AttributionToTTo
Precision92.7
6
Row-Level AttributionToTTo
Precision77
6
Cell-level attributionToTTo (gold set)
Precision56.89
6
Open-ended table-to-text generationToTTO Logic2Text-style (test)
BLEU0.247
5
Open-ended table-to-text generationToTTO Logic2Text-style (TestO)
BLEU28.9
5
Open-ended table-to-text generationToTTO Logic2Text-style (TestN)
BLEU20.5
5
Tightly controlled table-to-text generationToTTO official (test)
BLEU56.7
5
Tightly controlled table-to-text generationToTTO official (dev)
BLEU49
5
Data-to-text generationTotto Non-overlap (dev)
BLEU Score41.5
5
Data-to-text generationTotto All (dev)
BLEU Score49.2
5
Table-to-text generationToTTo (human evaluation)
TControl89
4
Table-to-Text GenerationToTTo (dev)
BLEU48.95
4
Data-to-text generationTotto All (test)
BLEU49.5
3
Table-to-text generationToTTo Non-Overlap (test)
BLEU41.4
3
Showing 24 of 24 rows