| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| NLI | MultiMed-X ZU | Accuracy0.68 | 5 | |
| Long-form QA | MultiMed-X ZU | Overall Score4.42 | 5 | |
| NLI | MultiMed-X YO | Accuracy68.67 | 5 | |
| Long-form QA | MultiMed-X YO | Overall Score4.45 | 5 | |
| NLI | MultiMed-X TH | Accuracy71.33 | 5 | |
| Long-form QA | MultiMed-X TH | Overall Score4.66 | 5 | |
| NLI | MultiMed-X SW | Accuracy73.33 | 5 | |
| Long-form QA | MultiMed-X SW | Overall Score4.55 | 5 | |
| NLI | MultiMed-X KO | Accuracy0.6733 | 5 | |
| Long-form QA | MultiMed-X KO | Overall Score4.54 | 5 | |
| NLI | MultiMed-X JP | Accuracy78 | 5 | |
| Long-form QA | MultiMed-X JP | Overall Score4.43 | 5 | |
| NLI | MultiMed-X ZH | Accuracy0.7667 | 5 | |
| Long-form QA | MultiMed-X ZH | Overall Score4.53 | 5 | |
| NLI | MultiMed-X EN | Accuracy78.67 | 5 | |
| Long-form QA | MultiMed-X EN | Overall Score4.6 | 5 |