Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Bolmo: Byteifying the Next Generation of Language Models

About

Recent advances in generative AI have been largely driven by large language models (LLMs), deep neural networks that operate over discrete units called tokens. To represent text, the vast majority of LLMs use words or word fragments as the tokens, known as subword tokenization. Subword tokenization obscures fine-grained information, which is problematic, especially for scientific data - such as computer code or biological sequences - where meaning depends on the individual characters. Models that instead operate directly on the byte encoding of text avoid these limitations, but until now they have lagged behind subword-based models in performance. Here we introduce Bolmo, a family of fully open byte-level LLMs that approach the capabilities of subword-based systems. Using a two-stage conversion procedure, we transform existing subword-based models into byte-level models with minimal additional training. The resulting models outperform prior byte-level approaches and excel on character-level reasoning tasks, while remaining competitive across standard benchmarks. By efficiently processing byte-level information, these models achieve practical inference speeds and can be adapted at low cost using the existing ecosystem around the source LLM. Our results remove a long-standing performance barrier to end-to-end byte-level language modeling, demonstrating that models operating on raw text encodings can scale competitively while offering advantages in domains requiring fine-grained textual understanding.

Benjamin Minixhofer, Tyler Murray, Tomasz Limisiewicz, Anna Korhonen, Luke Zettlemoyer, Noah A. Smith, Edoardo M. Ponti, Luca Soldaini, Valentin Hofmann• 2025

Related benchmarks

TaskDatasetResultRank
Generative Question AnsweringBolmo Evaluation Suite GenQA 7B
GenQA Average0.709
29
Multiple-choice Question AnsweringBolmo Evaluation Suite MC STEM 7B
MC STEM Average Accuracy65.5
17
Character UnderstandingBolmo Character Understanding 7B
Char (Avg)75.1
5
Code GenerationBolmo Evaluation Suite Code 7B
Average Code Score0.407
5
Mathematical ReasoningBolmo Evaluation Suite Math 7B
Avg Math Score48.9
5
Multiple-choice Question AnsweringBolmo 7B Evaluation Suite MC Non-STEM
Average Score (Non-STEM)75.8
5
Language Modeling EvaluationBolmo 1B evaluation suite
Overall Average Score58.2
5
Showing 7 of 7 rows

Other info

Follow for update