| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Question Answering | CovidQA | F147.64 | 17 | |
| Question Answering | CovidQA | Accuracy67.59 | 15 | |
| Machine-generated text detection | CovidQA Community ChatGPT-generated (test) | AUROC0.9923 | 11 | |
| Hallucination Detection | CovidQA | F1 Score91.7 | 6 | |
| Retrieval-Augmented Generation | CovidQA | Faithfulness79.7 | 5 |