| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| AI-generated text detection | Finance | AUC0.998 | 24 | |
| Named Entity Recognition | Finance (test) | F1 Score87.25 | 14 | |
| Machine-generated text detection | Finance Llama-3-70B-Instruct (test) | AUC0.995 | 12 | |
| AI-generated text detection | Finance GPT-3.5 Turbo | AUC98.7 | 12 | |
| Sentiment Classification | Finance (test) | Accuracy90.8 | 11 | |
| Sentiment Classification | Finance | F1 Score89.41 | 11 | |
| Theme Label Quality | Finance Out-of-domain | ROUGE-142.4 | 4 | |
| Theme Distribution | Finance Out-of-domain | Accuracy55.8 | 4 | |
| Sentiment Analysis | Finance | Accuracy66 | 2 |