| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Question Answering | ChronoTQA Tier 1 (147 Phenopackets-grounded onset questions) | Accuracy86.4 | 16 | |
| Biomedical Temporal Reasoning | ChronoTQA 1.0 (120-question stratified subsample) | Cross-disease Comparison100 | 4 | |
| Biomedical Question Answering | ChronoTQA | Metric- | 0 |