| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Realistic color video completion | News 144×176×3×30 | PSNR38.6 | 70 | |
| Tensor Completion | News 144 x 176 x 100 | PSNR34.9 | 35 | |
| Treatment Effect Estimation | NEWS semi-synthetic | Mean Error0 | 22 | |
| Treatment Effect Estimation | NEWS semi-synthetic (test) | MSE0 | 22 | |
| Summarization | news multi | Rouge-L23.66 | 21 | |
| Named Entity Recognition | NEWS | F1 Score86.15 | 21 | |
| English-German document-level translation | News English-German (test) | s-BLEU30.34 | 20 | |
| Information Retrieval | news | Recall@10052.7 | 19 | |
| Tabular Data Generation | News | DCR-0021.0325 | 18 | |
| News Recommendation | NEWS (test) | AUC64.68 | 18 | |
| Out-of-Distribution Detection | News (test) | AUROC80.7 | 17 | |
| Out-of-Distribution Detection | News | FPR69.31 | 17 | |
| Regression | News (test) | MSE0.69 | 17 | |
| LLM Unlearning | NEWS | Verification Memory (VerMem)22.09 | 16 | |
| Individual Treatment Effect (ITE) Estimation | NEWS (out) | PEHE0.44 | 16 | |
| Individual Treatment Effect (ITE) Estimation | NEWS (in) | PEHE0.25 | 16 | |
| ATE estimation | News | Joint Bias (JB)0.07 | 14 | |
| Machine Text Detection | News | Claude 3.5 Rewrite AUC1 | 11 | |
| Misclassification Detection | News | ROC-AUC (Misclassification Detection)88.8 | 10 | |
| Tabular Data Synthesis | News | Rank1 | 10 | |
| Named Entity Recognition | News (test) | F1 Score80.86 | 10 | |
| Retrieval Question Answering | News in-domain | MRR46.6 | 10 | |
| Tabular Data Generation | News | Beta Recall43.1 | 9 | |
| Tabular Data Generation | News | alpha-PRECISION98.24 | 9 | |
| Tabular Data Privacy Evaluation | News | DCR-0050.0001 | 9 |