Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Dataset C

Benchmarks

Task NameDataset NameSOTA ResultTrend
PDE SolvingDataset C Out-of-distribution 1.0 (test)
Relative L2 Error2.63
13
PDE SolvingDataset C In-distribution 1.0 (test)
Relative L2 Error0.03
13
Clause-level privacy policy analysisDataset C
F1 Score73
4
ReconstructionDataset C (held-out)
Mean Reconstruction Score0.193
2
Autonomous rollout classificationDataset C Lissajous
Mean Train Acc99.98
1
Showing 5 of 5 rows