| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Expressive Evaluation | HRS benchmark | Creativity66.97 | 21 | |
| 10-year Mortality | HRS (held-out set) | AUC0.778 | 16 | |
| Layout prediction | HRS spatial | Accuracy86.07 | 11 | |
| Layout prediction | HRS numerical | Precision93.28 | 11 | |
| Text-to-Image Generation | HRS | Count F166 | 10 | |
| Spatial Reasoning | HRS | Accuracy53.96 | 8 | |
| Numerical Reasoning | HRS | Precision78.65 | 8 | |
| Grounding Accuracy | HRS | Spatial Accuracy45.01 | 8 | |
| Grounding | HRS-Spatial | mIoU0.372 | 8 | |
| Prompt Fidelity | HRS dataset | CLIP Score33.63 | 6 | |
| Text-to-Image Generation | HRS benchmark | CLIP Score33.63 | 2 |