Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

OCSU: Optical Chemical Structure Understanding for Molecule-centric Scientific Discovery

About

Understanding the chemical structure from a graphical representation of a molecule is a challenging image caption task that would greatly benefit molecule-centric scientific discovery. Variations in molecular images and caption subtasks pose a significant challenge in both image representation learning and task modeling. Yet, existing methods only focus on a specific caption task that translates a molecular image into its graph structure, i.e., OCSR. In this paper, we propose the Optical Chemical Structure Understanding (OCSU) task, which extends low-level recognition to multilevel understanding and aims to translate chemical structure diagrams into readable strings for both machine and chemist. To facilitate the development of OCSU technology, we explore both OCSR-based and OCSR-free paradigms. We propose DoubleCheck to enhance OCSR performance via attentive feature enhancement for local ambiguous atoms. It can be cascaded with existing SMILES-based molecule understanding methods to achieve OCSU. Meanwhile, Mol-VL is a vision-language model end-to-end optimized for OCSU. We also construct Vis-CheBI20, the first large-scale OCSU dataset. Through comprehensive experiments, we demonstrate the proposed approaches excel at providing chemist-readable caption for chemical structure diagrams, which provide solid baselines for further research. Our code, model, and data are open-sourced at https://github.com/PharMolix/OCSU.

Siqi Fan, Yuguang Xie, Bowen Cai, Ailin Xie, Gaochao Liu, Mu Qiao, Jie Xing, Zaiqing Nie• 2025

Related benchmarks

TaskDatasetResultRank
Molecular structure recognitionJPO (450)
Accuracy70.4
19
Optical Chemical Structure RecognitionUSPTO (5719)
Exact Match Accuracy93.2
12
Optical Chemical Structure RecognitionUOB (5740)
Exact Match Accuracy95.6
12
Optical Chemical Structure RecognitionCLEF 992
Exact Match Accuracy88.3
12
Optical Chemical Structure RecognitionUSPTO-10K
Accuracy (Exact Match)89.4
12
Optical Chemical Structure RecognitionWildMol-10K
Exact Match Accuracy9
12
Optical Chemical Structure RecognitionACS 331
Exact Match Accuracy72.9
9
Optical Chemical Structure RecognitionIndigo 5719
Exact Match Accuracy96.3
9
Optical Chemical Structure RecognitionChemDraw 5719
Exact Match Accuracy93.7
9
Optical Chemical Structure RecognitionStaker 50000
Exact Match Accuracy83.4
9
Showing 10 of 10 rows

Other info

Follow for update