Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Bilingual Rhetorical Structure Parsing with Large Parallel Annotations

About

Discourse parsing is a crucial task in natural language processing that aims to reveal the higher-level relations in a text. Despite growing interest in cross-lingual discourse parsing, challenges persist due to limited parallel data and inconsistencies in the Rhetorical Structure Theory (RST) application across languages and corpora. To address this, we introduce a parallel Russian annotation for the large and diverse English GUM RST corpus. Leveraging recent advances, our end-to-end RST parser achieves state-of-the-art results on both English and Russian corpora. It demonstrates effectiveness in both monolingual and bilingual settings, successfully transferring even with limited second-language annotation. To the best of our knowledge, this work is the first to evaluate the potential of cross-lingual end-to-end RST parsing on a manually annotated parallel corpus.

Elena Chistova• 2024

Related benchmarks

TaskDatasetResultRank
End-to-end RST parsingRST-DT En (test)
Segmentation Score97.9
7
End-to-end RST parsingGUM v9.1 En (test)
Segm Score95.5
3
End-to-end RST parsingRRT Ru (test)
Segm. Score92.4
3
End-to-end RST parsingRRG Ru (test)
Segmentation Score96.9
3
Showing 4 of 4 rows

Other info

Code

Follow for update