Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

FaultyScience

Benchmarks

Task NameDataset NameSOTA ResultTrend
Issue RecognitionFaultyScience (test)
Performance95
12
Fault-recognitionFaultyScience
Accuracy67.8
6
Showing 2 of 2 rows