Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Unbalanced Optimal Transport for Unbalanced Word Alignment

About

Monolingual word alignment is crucial to model semantic interactions between sentences. In particular, null alignment, a phenomenon in which words have no corresponding counterparts, is pervasive and critical in handling semantically divergent sentences. Identification of null alignment is useful on its own to reason about the semantic similarity of sentences by indicating there exists information inequality. To achieve unbalanced word alignment that values both alignment and null alignment, this study shows that the family of optimal transport (OT), i.e., balanced, partial, and unbalanced OT, are natural and powerful approaches even without tailor-made techniques. Our extensive experiments covering unsupervised and supervised settings indicate that our generic OT-based alignment methods are competitive against the state-of-the-arts specially designed for word alignment, remarkably on challenging datasets with high null alignment frequencies.

Yuki Arase, Han Bao, Sho Yokoi• 2023

Related benchmarks

TaskDatasetResultRank
Word AlignmentEnglish-French (test)
AER6
37
Word AlignmentRomanian-English (Ro-En) (test)
AER34
34
Word AlignmentEnglish-Hindi en-hi (test)
AER44
17
Supervised Word AlignmentMSR-RTE (test)
F10.864
7
Supervised Word AlignmentNewsela (test)
F1 Score84.6
7
Supervised Word AlignmentMTRef (test)
F1 Score77.2
7
Word AlignmentMSR-RTE Sure Only (S) (test)
F1 Score92.2
7
Word AlignmentNewsela Sure and Possible (S + P) (test)
F1 Score79.8
7
Word AlignmentEDB++ Sure Only (S) (test)
F1 Score84.7
7
Word AlignmentEDB++ Sure and Possible (S + P) (test)
F1 Score82.8
7
Showing 10 of 20 rows

Other info

Code

Follow for update