Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

CUAD

Benchmarks

Task NameDataset NameSOTA ResultTrend
Legal Contract RevisionCUAD
CQ85.82
25
RetrievalCUAD
Recall91.25
13
Question AnsweringCUAD
ANLS0.2498
13
Legal text generationCUAD
ROUGE-L Score55.77
10
Retrieval-Augmented GenerationCuad
Faithfulness76
5
Binary ClassificationCUAD 1.0 (test)
Precision25.6
4
Contract ReviewCUAD (test)
F1 Score88.3
3
Multi-hop QACUAD GE (test)
Token F138.58
2
Multi-hop QACUAD DS (test)
Token F137.19
2
Showing 9 of 9 rows