Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

CDeC-Net: Composite Deformable Cascade Network for Table Detection in Document Images

About

Localizing page elements/objects such as tables, figures, equations, etc. is the primary step in extracting information from document images. We propose a novel end-to-end trainable deep network, (CDeC-Net) for detecting tables present in the documents. The proposed network consists of a multistage extension of Mask R-CNN with a dual backbone having deformable convolution for detecting tables varying in scale with high detection accuracy at higher IoU threshold. We empirically evaluate CDeC-Net on all the publicly available benchmark datasets - ICDAR-2013, ICDAR-2017, ICDAR-2019,UNLV, Marmot, PubLayNet, and TableBank - with extensive experiments. Our solution has three important properties: (i) a single trained model CDeC-Net{\ddag} performs well across all the popular benchmark datasets; (ii) we report excellent performances across multiple, including higher, thresholds of IoU; (iii) by following the same protocol of the recent papers for each of the benchmarks, we consistently demonstrate the superior quantitative performance. Our code and models will be publicly released for enabling the reproducibility of the results.

Madhav Agarwal, Ajoy Mondal, C. V. Jawahar• 2020

Related benchmarks

TaskDatasetResultRank
Document Layout AnalysisPubLayNet (val)
mAP0.978
21
Table DetectionICDAR 2013 (test)
Recall100
18
Table DetectionMarmot (Mixed Full)
Recall93
12
Table DetectionMarmot English
Recall95.2
12
Table DetectionMarmot Chinese
Recall0.966
12
Table DetectionICDAR 2019
Recall93.4
12
Table DetectionUNLV (test)
Recall92.5
12
Table DetectionICDAR archive 2019
Recall96.2
10
Table DetectionICDAR 60 images 2013 (test)
Recall93.7
10
Table DetectionTableBank LaTeX 1K (test)
Recall97.9
9
Showing 10 of 30 rows

Other info

Code

Follow for update