| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Binary Inconsistency Detection | LLM | Accuracy70.27 | 10 | |
| Robust Steganography | LLM Generative Text | Embedding Capacity (bits / 1k tokens)84.08 | 5 | |
| Span Detection | LLM | F1 Score0.3322 | 5 | |
| Language | LLM-329M | Peak Performance (FP4/FP8)205 | 1 |