| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Grammatical Error Correction | JFLEG | GLEU63.3 | 47 | |
| Grammatical Error Correction | JFLEG (test) | GLEU64.9 | 45 | |
| IPI sanitization | Jfleg RTE (unseen) | ASR0 | 20 | |
| Sentence Level Quality Estimation | JFLEG (test) | GLEU61.61 | 12 | |
| Indirect Prompt Injection Detection | Jfleg RTE | Accuracy99.4 | 10 | |
| Grammatical Error Correction | JFLEG (dev) | F0.5 Score63.61 | 7 | |
| Utility Preservation | JFLEG-RTE | Win Rate16.57 | 5 | |
| Indirect Prompt Injection Sanitization | Jfleg | GCG ASR7 | 2 | |
| Indirect Prompt Injection Attack | Jfleg | ASR99.5 | 2 | |
| Error Detection | JFLEG (test) | Precision72.53 | 2 | |
| Indirect Prompt Injection Detection | Jfleg | GCG Accuracy95.5 | 1 |