Symbol-Aware Reasoning with Masked Discrete Diffusion for Handwritten Mathematical Expression Recognition
About
Handwritten Mathematical Expression Recognition (HMER) requires reasoning over diverse symbols and 2D structural layouts, yet autoregressive models struggle with exposure bias and syntactic inconsistency. We present a discrete diffusion framework that reformulates HMER as iterative symbolic refinement instead of sequential generation. Through multi-step remasking, the proposal progressively refines both symbols and structural relations, removing causal dependencies and improving structural consistency. A symbol-aware tokenization and Random-Masking Mutual Learning further enhance syntactic alignment and robustness to handwriting diversity. On the MathWriting benchmark, the proposal achieves 5.56\% CER and 60.42\% EM, outperforming strong Transformer and commercial baselines. Consistent gains on CROHME 2014--2023 demonstrate that discrete diffusion provides a new paradigm for structure-aware visual recognition beyond generative modeling.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Handwritten Mathematical Expression Recognition | CROHME 2014 | -- | 47 | |
| Handwritten Mathematical Expression Recognition | CROHME 2016 | Expression Rate63.23 | 40 | |
| Handwritten Mathematical Expression Recognition | CROHME 2019 | ExpRate60.66 | 39 | |
| Handwritten Mathematical Expression Recognition | CROHME 2023 (test) | Expression Rate60.78 | 11 | |
| Mathematical Expression Recognition | MathWriting 1.0 (test) | CER5.55 | 9 | |
| Mathematical Expression Recognition | MathWriting 1.0 (val) | CER4.7 | 9 |