Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

LANS: A Layout-Aware Neural Solver for Plane Geometry Problem

About

Geometry problem solving (GPS) is a challenging mathematical reasoning task requiring multi-modal understanding, fusion, and reasoning. Existing neural solvers take GPS as a vision-language task but are short in the representation of geometry diagrams that carry rich and complex layout information. In this paper, we propose a layout-aware neural solver named LANS, integrated with two new modules: multimodal layout-aware pre-trained language module (MLA-PLM) and layout-aware fusion attention (LA-FA). MLA-PLM adopts structural-semantic pre-training (SSP) to implement global relationship modeling, and point-match pre-training (PMP) to achieve alignment between visual points and textual points. LA-FA employs a layout-aware attention mask to realize point-guided cross-modal fusion for further boosting layout awareness of LANS. Extensive experiments on datasets Geometry3K and PGPS9K validate the effectiveness of the layout-aware modules and superior problem-solving performance of our LANS solver, over existing symbolic and neural solvers. The code will be made public available soon.

Zhong-Zhi Li, Ming-Liang Zhang, Fei Yin, Cheng-Lin Liu• 2023

Related benchmarks

TaskDatasetResultRank
Geometry Problem SolvingGeometry3K (test)
Choice Accuracy82.3
32
Geometry Problem SolvingPGPS9K (test)
Completion66.1
18
Showing 2 of 2 rows

Other info

Follow for update