| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Generative Question Answering | Bolmo Evaluation Suite GenQA 7B | GenQA Average81.6 | 29 | |
| Language Modeling Evaluation | Bolmo 1B evaluation suite | Overall Average Score58.5 | 5 | |
| Multiple Choice Question Answering | Bolmo 7B Evaluation Suite MC Non-STEM | Average Score (Non-STEM)77.7 | 5 | |
| Character Understanding | Bolmo Character Understanding 7B | Char (Avg)75.1 | 5 |