| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Fact-Checking | PubHealth | Balanced Accuracy78.66 | 26 | |
| Closed-set Question Answering | PubHealth | Accuracy74.5 | 15 | |
| Multi-Label Verdict Prediction | PUBHEALTH supplementary experiments | OI2.786 | 8 | |
| Active RAG | Pubhealth | Accuracy73.4 | 6 | |
| Question Answering | PubHealth (test) | Accuracy40.86 | 3 | |
| Multi-label Verdict Prediction | PUBHEALTH | Margin-0.0008 | 2 |