Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Bolmo Evaluation Suite

Benchmarks

Task NameDataset NameSOTA ResultTrend
Multiple Choice Question AnsweringBolmo Evaluation Suite MC STEM 7B
MC STEM Average Accuracy78.8
17
Mathematical ReasoningBolmo Evaluation Suite Math 7B
Avg Math Score55.3
5
Code GenerationBolmo Evaluation Suite Code 7B
Average Code Score0.407
5
Showing 3 of 3 rows