| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Next Token Prediction | WildChat | Next Token Accuracy51 | 32 | |
| Safety | WildChat | Refusal Rate42.92 | 20 | |
| Safety Evaluation | WildChat | Safe@197.5 | 18 | |
| Quantization Detection | WildChat | Statistical Power AUC64.2 | 18 | |
| Jail-breaking detection | WildChat | AUC (Statistical Power)0.895 | 18 | |
| Fingerprint Detection | WildChat Fr | FSR1 | 18 | |
| Proactive next utterance prediction | WildChat (test) | LLM-Judge52.16 | 17 | |
| Safety Evaluation | WildChat (test) | WildChat Score69.85 | 13 | |
| Model Routing | NB-WildChat | Uniqueness Score42.6 | 11 | |
| Synthetic Text Generation | WildChat | Mean Embedding Similarity0.31 | 10 | |
| Safety Evaluation | WildChat unsafe prompts | Not-Unsafe Rate99.82 | 9 | |
| Next Token Prediction | WildChat | BERT-Small Next Token Accuracy (eps=inf)28.78 | 5 | |
| Over-safety measurement | WildChat | User Score15.1 | 2 |