Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

tldr_news

Benchmarks

Task NameDataset NameSOTA ResultTrend
Faithfulness Measurementtldr_news
BLEU79.4
12
Faithfulness Evaluationtldr_news 800 samples
BLEU79.5
5
Explanation Generationtldr_news avg prompt instance
Latency (s)15.397
2
Showing 3 of 3 rows