Hierarchical Bracketing Encodings for Dependency Parsing as Tagging
About
We present a family of encodings for sequence labeling dependency parsing, based on the concept of hierarchical bracketing. We prove that the existing 4-bit projective encoding belongs to this family, but it is suboptimal in the number of labels used to encode a tree. We derive an optimal hierarchical bracketing, which minimizes the number of symbols used and encodes projective trees using only 12 distinct labels (vs. 16 for the 4-bit encoding). We also extend optimal hierarchical bracketing to support arbitrary non-projectivity in a more compact way than previous encodings. Our new encodings yield competitive accuracy on a diverse set of treebanks.
Ana Ezquerro, David Vilares, Anssi Yli-Jyr\"a, Carlos G\'omez-Rodr\'iguez• 2025
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Dependency Parsing | English (en) (test) | LAS94.55 | 16 | |
| Dependency Parsing | Hebrew (he) (test) | LAS89.61 | 10 | |
| Dependency Parsing | Ancient Greek grc (test) | LAS63.82 | 8 | |
| Dependency Parsing | Uyghur ug (test) | LAS66.88 | 8 | |
| Dependency Parsing | Finnish (fi) (test) | LAS88.65 | 8 | |
| Dependency Parsing | French (fr) (test) | LAS92.06 | 8 | |
| Dependency Parsing | Russian (ru) (test) | LAS88.18 | 8 | |
| Dependency Parsing | Tamil (ta) (test) | LAS66.06 | 8 | |
| Dependency Parsing | Wolof wo (test) | LAS72.37 | 8 |
Showing 9 of 9 rows