Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Cross-lingual RST Discourse Parsing

About

Discourse parsing is an integral part of understanding information flow and argumentative structure in documents. Most previous research has focused on inducing and evaluating models from the English RST Discourse Treebank. However, discourse treebanks for other languages exist, including Spanish, German, Basque, Dutch and Brazilian Portuguese. The treebanks share the same underlying linguistic theory, but differ slightly in the way documents are annotated. In this paper, we present (a) a new discourse parser which is simpler, yet competitive (significantly better on 2/3 metrics) to state of the art for English, (b) a harmonization of discourse treebanks across languages, enabling us to present (c) what to the best of our knowledge are the first experiments on cross-lingual discourse parsing.

Chlo\'e Braud, Maximin Coavoux, Anders S{\o}gaard• 2017

Related benchmarks

TaskDatasetResultRank
RST Discourse ParsingRST-DT Parseval (test)
Span (S) Score81.3
32
RST ParsingRST-DT original Parseval (test)
Span F162.7
28
Structure PredictionRST-DT
Micro F181.3
24
Discourse ParsingEn-DT English (test)
Span85.1
8
Discourse ParsingPt-DT Brazilian Portuguese (test)
Span Score82
6
Discourse ParsingEs-DT Spanish (test)
Span89.7
6
Discourse ParsingDe-DT German (test)
Span Score80.2
5
Discourse ParsingNI-DT Dutch (test)
Span69.5
4
Discourse ParsingEu-DT Basque (test)
Span78.7
4
Showing 9 of 9 rows

Other info

Code

Follow for update