| Dataset Name | SOTA Method | Metric | Trend | ||
|---|---|---|---|---|---|
| ARC-Challenge (first 300 questions) | EcoLang | Average Accuracy34.89 | 10 | 8d ago | |
| ARC Easy (first 300 questions) | TextFullT | Average Accuracy42.33 | 10 | 8d ago | |
| MedQA (first 300 questions) | TextFullT | Average Accuracy44.55 | 10 | 8d ago | |
| PubMedQA (first 300 questions) | TextFullT | Average Accuracy69.67 | 10 | 8d ago | |
| WorldTree (first 300 questions) | TextFullT | Average Accuracy70.56 | 10 | 8d ago | |
| SocialIQA (first 300 questions) | HyLaT | Average Accuracy82.56 | 10 | 8d ago | |
| StrategyQA (first 300 questions) | HyLaT | Average Accuracy64.67 | 10 | 8d ago | |
| CommonsenseQA (first 300 questions) | TextFullT | Average Accuracy64.44 | 10 | 8d ago |