Multi-Scale Attention with Dense Encoder for Handwritten Mathematical Expression Recognition
About
Handwritten mathematical expression recognition is a challenging problem due to the complicated two-dimensional structures, ambiguous handwriting input and variant scales of handwritten math symbols. To settle this problem, we utilize the attention based encoder-decoder model that recognizes mathematical expression images from two-dimensional layouts to one-dimensional LaTeX strings. We improve the encoder by employing densely connected convolutional networks as they can strengthen feature extraction and facilitate gradient propagation especially on a small training set. We also present a novel multi-scale attention model which is employed to deal with the recognition of math symbols in different scales and save the fine-grained details that will be dropped by pooling operations. Validated on the CROHME competition task, the proposed method significantly outperforms the state-of-the-art methods with an expression recognition accuracy of 52.8% on CROHME 2014 and 50.1% on CROHME 2016, by only using the official training dataset.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Handwritten Mathematical Expression Recognition | CROHME 2016 (test) | Expression Rate (Exp)55.97 | 164 | |
| Handwritten Mathematical Expression Recognition | CROHME 2014 (test) | Expression Rate (Exp)57.97 | 156 | |
| Handwritten Mathematical Expression Recognition | CROHME 2019 (test) | Expression Rate (Exp)56.05 | 107 | |
| Handwritten Mathematical Expression Recognition | CROHME 2014 | Error Rate50.1 | 47 | |
| Handwritten Mathematical Expression Recognition | CROHME 2016 | Expression Rate50.1 | 40 | |
| Handwritten Mathematical Expression Recognition | CROHME 2019 | ExpRate47.7 | 39 | |
| Handwritten Mathematical Expression Recognition | HME100K | ExpRate61.85 | 17 | |
| Handwritten Mathematical Expression Recognition | HME100K 1.0 (test) | Expression Rate0.6185 | 9 | |
| Handwritten Mathematical Expression Recognition | M2E multi-line (test) | ExpRate53.14 | 8 | |
| Handwritten Mathematical Expression Recognition | CROHME Average 2014/2016/2019 | ExpRate0.4917 | 5 |