Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

TableFormer: Table Structure Understanding with Transformers

About

Tables organize valuable content in a concise and compact representation. This content is extremely valuable for systems such as search engines, Knowledge Graph's, etc, since they enhance their predictive capabilities. Unfortunately, tables come in a large variety of shapes and sizes. Furthermore, they can have complex column/row-header configurations, multiline rows, different variety of separation lines, missing entries, etc. As such, the correct identification of the table-structure from an image is a non-trivial task. In this paper, we present a new table-structure identification model. The latter improves the latest end-to-end deep learning model (i.e. encoder-dual-decoder from PubTabNet) in two significant ways. First, we introduce a new object detection decoder for table-cells. In this way, we can obtain the content of the table-cells from programmatic PDF's directly from the PDF source and avoid the training of the custom OCR decoders. This architectural change leads to more accurate table-content extraction and allows us to tackle non-english tables. Second, we replace the LSTM decoders with transformer based decoders. This upgrade improves significantly the previous state-of-the-art tree-editing-distance-score (TEDS) from 91% to 98.5% on simple tables and from 88.7% to 95% on complex tables.

Ahmed Nassar, Nikolaos Livathinos, Maksym Lysak, Peter Staar• 2022

Related benchmarks

TaskDatasetResultRank
Table RecognitionPubTabNet (test)
TEDS (All)93.6
49
Table Structure RecognitionPubTabNet (val)
TEDS93.6
21
Table RecognitionFinTabNet (evaluation)--
10
Table Structure RecognitionFinTabNet (evaluation)
TEDS96.8
9
Table Structure RecognitionPubTabNet All 1.0 (test)
TEDS96.75
7
Table Structure RecognitionPubTabNet Simple 1.0 (test)
TEDS98.5
6
Table Structure RecognitionPubTabNet Complex 1.0 (test)
TEDS95
6
Table Structure RecognitionPubTables-1M
GriTS Top Score98.45
6
Logical structure recognitionFinTabNet (test)
S-TEDS96.8
5
Content Bounding Box DetectionPubTabNet (test)
AP5082.1
3
Showing 10 of 10 rows

Other info

Follow for update