| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Realistic color video completion | News 144×176×3×30 | PSNR38.6 | 70 | |
| Tensor Completion | News 144 x 176 x 100 | PSNR34.9 | 35 | |
| Treatment Effect Estimation | NEWS semi-synthetic | Mean Error0 | 22 | |
| Treatment Effect Estimation | NEWS semi-synthetic (test) | MSE0 | 22 | |
| Summarization | news multi | Rouge-L23.66 | 21 | |
| Named Entity Recognition | NEWS | F1 Score86.15 | 21 | |
| English-German document-level translation | News English-German (test) | s-BLEU30.34 | 20 | |
| Passage Reranking | News BEIR | NDCG@1049.32 | 19 | |
| Information Retrieval | news | Recall@10052.7 | 19 | |
| Marginal Distribution Alignment | News | Error Rate1.72 | 18 | |
| Tabular Data Synthesis | News | C2ST97.93 | 18 | |
| Tabular Data Synthesis | News | Pairwise Correlation Alignment Error1.34 | 18 | |
| Tabular Data Generation | News | DCR-0021.0325 | 18 | |
| News Recommendation | NEWS (test) | AUC64.68 | 18 | |
| Out-of-Distribution Detection | News (test) | AUROC80.7 | 17 | |
| Out-of-Distribution Detection | News | FPR69.31 | 17 | |
| Regression | News (test) | MSE0.69 | 17 | |
| Privacy Preservation | News (test) | DCR Score99 | 16 | |
| LLM Unlearning | NEWS | Verification Memory (VerMem)22.09 | 16 | |
| Individual Treatment Effect (ITE) Estimation | NEWS (out) | PEHE0.44 | 16 | |
| Individual Treatment Effect (ITE) Estimation | NEWS (in) | PEHE0.25 | 16 | |
| ATE estimation | News | Joint Bias (JB)0.07 | 14 | |
| Dosage Policy Estimation (DPE) | News (test) | Mean DPE2.69 | 12 | |
| Single change-point detection | News | WD0.12 | 12 | |
| Machine Text Detection | News | Claude 3.5 Rewrite AUC1 | 11 |