Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Iris

Benchmarks

Task NameDataset NameSOTA ResultTrend
Robustness VerificationIris dataset (test)
Vulnerable Samples0
90
Robustness verificationIris
Vulnerable Samples Count0
50
Robustness VerificationIris Sigmoid Network
Vulnerable Samples0
50
ClusteringIris
ARI0.9
29
ClusteringIris
SSE95.87
24
Counterfactual Explanation GenerationIris
R Score1
23
Hierarchical Agglomerative Clusteringiris
Dendrogram Purity94.3
20
ClassificationIris (test)
Accuracy100
18
Feature AttributionIRIS
INFD0.007
17
Pairwise-constrained clusteringIris UCI (full)
SSE83.72
15
Explanation RegularityIris
Regularity0.838
11
Explanation Fidelity EstimationIris
Fidelity (R2 Score)0.86
11
Explanation StabilityIris
Stability Score100
11
Machine UnlearningIris 2% (forgotten)
UQI0.312
11
Machine UnlearningIris subset 2% (test)
Accuracy100
11
Machine UnlearningIris subset 2% (retained)
Accuracy94
11
Full-class forgettingIris (test)
Accuracy93.3
11
Full-class forgettingIris (retain)
Acc90
11
ClusteringIris 4D
ARI100
10
Cluster ValidationIris
PWRS Score88.6
10
Multiclass Classificationiris
Accuracy97.33
10
Visual Question AnsweringIRIS 1.0 (ambiguous trials)
Accuracy (Image Only)59.3
10
Multiclass classificationIRIS subsampled to 100 3 classes (train)
R50070
9
Continual ClusteringIris
AI NMI0.871
9
Tabular Classificationiris (61) (test)
Test Error Rate2.7
9
Showing 25 of 63 rows