LDC

Benchmarks

Task Name	Dataset Name	SOTA Result
AMR-to-text generation	LDC2017T10 (test)	BLEU49.72	55
Semantic Segmentation	LDC GL IQ LV	F1 Score52	12
Long Document Classification	LDC benchmark	Overall Performance (HYP)93.8	7
Authorship Verification	LDC Harder	AUC0.935	6
Authorship Verification	LDC Hard	AUC87.2	6
Authorship Verification	LDC Base	AUC86.1	6
AMR Parsing	LDC2017T10 (test)	Smatch (ordinary)74.4	6
Data-to-Text Generation	LDC2017T10	Fluency Score5.05	5
Machine Translation	LDC Chinese-English (test)	BLEU40.02	3
Construct Validity Assessment	LDC sample	Acceptance64	1

Showing 10 of 10 rows