| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Language Modeling | Finance (Fin) | PPL Change (%)0 | 28 | |
| Machine Translation (Zh to En) | Finance | SacreBLEU49.1 | 26 | |
| AI-generated text detection | Finance | AUC0.998 | 24 | |
| Machine Translation (En to Zh) | Finance | SacreBLEU39.6 | 20 | |
| Time Series Classification | Finance (test) | F1 Score63.1 | 19 | |
| Named Entity Recognition | Finance (test) | F1 Score87.25 | 14 | |
| Machine-generated text detection | Finance Llama-3-70B-Instruct (test) | AUC0.995 | 12 | |
| AI-generated text detection | Finance GPT-3.5 Turbo | AUC98.7 | 12 | |
| Data Extraction | Finance D2 | Match Ratio (Mean)47.6 | 11 | |
| Financial Reasoning | Finance | Accuracy52.45 | 11 | |
| Sentiment Classification | Finance (test) | Accuracy90.8 | 11 | |
| Sentiment Classification | Finance | F1 Score89.41 | 11 | |
| Open-ended Generation | Finance | ROUGE-Lsum29.19 | 8 | |
| Text Classification | Finance | Label Quality62.3 | 5 | |
| Partition Selection | Finance | Output Size17,695 | 4 | |
| Theme Label Quality | Finance Out-of-domain | ROUGE-142.4 | 4 | |
| Theme Distribution | Finance Out-of-domain | Accuracy55.8 | 4 | |
| Cross-domain generalization | Finance (test) | Accuracy99.3 | 3 | |
| Parametric Dynamical System Modeling | Finance Extrapolation | TtT0.225.9 | 2 | |
| Parametric Dynamical System Modeling | Finance Interpolation | TtT0.245 | 2 | |
| Sentiment Analysis | Finance | Accuracy66 | 2 |