| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Graph property detection | Ham Huge House of Graphs (test) | F1 Score93 | 17 | |
| Graph property detection | Ham Large House of Graphs (test) | F1 Score95 | 17 | |
| Graph property detection | Ham Medium House of Graphs (test) | F1 Score98 | 17 | |
| Graph property detection | Ham Small House of Graphs (test) | F1 Score94 | 17 | |
| Melanoma Classification | HAM Dermoscopic (test) | Recall92.81 | 16 | |
| Gradient Inversion Attack | HAM10000 | PSNR120.93 | 14 | |
| Medical Logical Reasoning (Rule 1: is_malignant) | HAM 5000 samples (test) | F1-score (NS-CL)100 | 10 | |
| Image Classification | HAM (test) | Accuracy83.18 | 9 | |
| Global Counterfactual Summary for Time-Series Clustering | Ham UCR Archive | Effectiveness51.8 | 6 | |
| Medical Image Classification | HAM-7 Dermoscopy | W_F10.805 | 6 | |
| Local Counterfactual Generation | Ham (UCR) | Effectiveness100 | 5 | |
| Time Series Classification | Ham (test) | Accuracy83.2 | 5 | |
| Medical Logical Reasoning (Rule 3: is_mel_on_back) | HAM 5000 samples (test) | F1 (NS-CL)99.8 | 4 | |
| Time Series Clustering | HAM | Rand Index0.527 | 4 | |
| Medical Logical Reasoning (Rule 2: is_mel_or_bcc) | HAM 5000 samples (test) | F1 Score (NS-CL)100 | 3 | |
| Correlation Analysis of Leakage Metrics against Intervention Performance (Sint) | HAM10K | CTL Pearson r0.49 | 1 |