| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| AIGT detection | HC3 PWWS attack, AI to Human (in-domain) | Overall Accuracy100 | 28 | |
| AI-generated text detection | HC3 (test) | F1 (Overall)99.93 | 18 | |
| AIGT detection | HC3 Pruthi attack Overall (in-domain) | Overall Accuracy100 | 14 | |
| AIGT detection | HC3 Pruthi attack Human to AI (in-domain) | OA1 | 14 | |
| AIGT detection | HC3 Pruthi attack AI to Human (in-domain) | Overall Accuracy100 | 14 | |
| AIGT detection | HC3 Deep-Word-Bug attack Overall (in-domain) | OA100 | 14 | |
| AIGT detection | HC3 Deep-Word-Bug attack Human to AI (in-domain) | Overall Accuracy100 | 14 | |
| AIGT detection | HC3 Deep-Word-Bug attack AI to Human (in-domain) | Overall Accuracy100 | 14 | |
| AIGT detection | HC3 PWWS attack, Human to AI (in-domain) | OA100 | 14 | |
| Machine-Generated Text Detection | HC3 COLING2025 (val) | AUC99.99 | 13 | |
| Machine-Generated Text Detection | HC3 (full) | Accuracy99.92 | 9 | |
| Machine-generated text detection | HC3 Cross-domain | Score (Medicine)80.48 | 3 |