| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Brain Tumor Classification | Target dataset | Macro F1 Score72.95 | 10 | |
| Task-free exploration | Target | SR (%)77.3 | 10 | |
| Clustering | target | ARI1 | 7 | |
| Autonomous Replication | target-1 citrusdrop | Non-refusal Rate48 | 4 | |
| Qualitative evaluation of conversational responses | Target N=262 | Constructive Guidance Score4.09 | 2 |