Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

PI

Benchmarks

Task NameDataset NameSOTA ResultTrend
Indirect permutation problemPI10
TA100
5
Indirect permutation problemPI1-10
TA99.99
5
Natural Language InferencePI (OOD)
Accuracy84.38
4
Showing 3 of 3 rows