Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Bolmo

Benchmarks

Task NameDataset NameSOTA ResultTrend
Generative Question AnsweringBolmo Evaluation Suite GenQA 7B
GenQA Average81.6
29
Language Modeling EvaluationBolmo 1B evaluation suite
Overall Average Score58.5
5
Multiple Choice Question AnsweringBolmo 7B Evaluation Suite MC Non-STEM
Average Score (Non-STEM)77.7
5
Character UnderstandingBolmo Character Understanding 7B
Char (Avg)75.1
5
Showing 4 of 4 rows