Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Consistency Evaluation Dataset

Benchmarks

Task NameDataset NameSOTA ResultTrend
Structural ConsistencyConsistency Evaluation Dataset (N=720) (test)
Structural Consistency Score1
54
Consistency EvaluationConsistency Evaluation Dataset (N=720) 1.0 (test)
Metric-
0
Showing 2 of 2 rows