Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Symbol-Aware Reasoning with Masked Discrete Diffusion for Handwritten Mathematical Expression Recognition

About

Handwritten Mathematical Expression Recognition (HMER) requires reasoning over diverse symbols and 2D structural layouts, yet autoregressive models struggle with exposure bias and syntactic inconsistency. We present a discrete diffusion framework that reformulates HMER as iterative symbolic refinement instead of sequential generation. Through multi-step remasking, the proposal progressively refines both symbols and structural relations, removing causal dependencies and improving structural consistency. A symbol-aware tokenization and Random-Masking Mutual Learning further enhance syntactic alignment and robustness to handwriting diversity. On the MathWriting benchmark, the proposal achieves 5.56\% CER and 60.42\% EM, outperforming strong Transformer and commercial baselines. Consistent gains on CROHME 2014--2023 demonstrate that discrete diffusion provides a new paradigm for structure-aware visual recognition beyond generative modeling.

Takaya Kawakatsu, Ryo Ishiyama• 2026

Related benchmarks

TaskDatasetResultRank
Handwritten Mathematical Expression RecognitionCROHME 2014--
47
Handwritten Mathematical Expression RecognitionCROHME 2016
Expression Rate63.23
40
Handwritten Mathematical Expression RecognitionCROHME 2019
ExpRate60.66
39
Handwritten Mathematical Expression RecognitionCROHME 2023 (test)
Expression Rate60.78
11
Mathematical Expression RecognitionMathWriting 1.0 (test)
CER5.55
9
Mathematical Expression RecognitionMathWriting 1.0 (val)
CER4.7
9
Showing 6 of 6 rows

Other info

Follow for update