| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Deception Detection | Liars' Bench Harm-Pressure Knowledge (test) | AUROC0.91 | 3 | |
| Deception Detection | Liars' Bench Insider Trading (test) | AUROC0.953 | 3 | |
| Deception Detection | Liars' Bench Convincing Game (test) | AUROC1 | 3 | |
| Deception Detection | Liars' Bench Harm-Pressure Choice (test) | AUROC0.949 | 3 | |
| Deception Detection | Liars' Bench Instructed Deception (test) | AUROC0.939 | 3 |