| Dataset Name | SOTA Method | Metric | Trend | ||
|---|---|---|---|---|---|
| Just-Eval | Just-Eval Average Score4.83 | 18 | 4d ago | ||
| GM | TVAE | Balanced Acc66.6 | 13 | 4d ago | |
| CR | Balanced Acc68.6 | 13 | 4d ago | ||
| CC | DP-CTGAN | Balanced Acc67.3 | 13 | 4d ago | |
| BM | TVAE | Balanced Acc60.3 | 13 | 4d ago | |
| AD | Balanced Accuracy81.8 | 13 | 4d ago | ||
| ScienceQA (S-QA) | CMRM_dataset | Accuracy73.2 | 13 | 4d ago | |
| LLaVA-Bench Coco | ShareGPT4V | Score92.3 | 13 | 4d ago | |
| Downstream Tasks | DAPT (nontoxic) | Average Accuracy63.4 | 12 | 4d ago | |
| BC | Balanced Acc72.1 | 11 | 4d ago | ||
| MMbench and DocVQA (test) | MMbench Score87.02 | 7 | 4d ago |