| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Abstract Screening | Review 1 821 abstracts (Final Includes) | False Positives45 | 8 | |
| Full-Text Screening | Review 1 | False Positives18 | 8 | |
| Document-Level Anomaly Detection | Review (test) | AUROC0.9594 | 7 | |
| Token-Level Anomaly Detection | Review (test) | AUROC0.8271 | 7 | |
| scoring | Review-5K | MAE1.957 | 5 | |
| Full-text inclusion screening | Review 2 (7741 abstracts) | False Positives (FP)87 | 5 | |
| Abstract Screening | Review 2 (Final Includes) | Metric- | 0 |