Semi-Supervised Bilingual Lexicon Induction with Two-way Interaction

Zhao, Xu; Wang, Zihao; Wu, Hao; Zhang, Yong

Computer Science > Computation and Language

arXiv:2010.07101 (cs)

[Submitted on 14 Oct 2020]

Title:Semi-Supervised Bilingual Lexicon Induction with Two-way Interaction

Authors:Xu Zhao, Zihao Wang, Hao Wu, Yong Zhang

View PDF

Abstract:Semi-supervision is a promising paradigm for Bilingual Lexicon Induction (BLI) with limited annotations. However, previous semisupervised methods do not fully utilize the knowledge hidden in annotated and nonannotated data, which hinders further improvement of their performance. In this paper, we propose a new semi-supervised BLI framework to encourage the interaction between the supervised signal and unsupervised alignment. We design two message-passing mechanisms to transfer knowledge between annotated and non-annotated data, named prior optimal transport and bi-directional lexicon update respectively. Then, we perform semi-supervised learning based on a cyclic or a parallel parameter feeding routine to update our models. Our framework is a general framework that can incorporate any supervised and unsupervised BLI methods based on optimal transport. Experimental results on MUSE and VecMap datasets show significant improvement of our models. Ablation study also proves that the two-way interaction between the supervised signal and unsupervised alignment accounts for the gain of the overall performance. Results on distant language pairs further illustrate the advantage and robustness of our proposed method.

Comments:	12 pages, 2 figures, 6 tables, accepted as long paper by EMNLP2020
Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2010.07101 [cs.CL]
	(or arXiv:2010.07101v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2010.07101

Submission history

From: Xu Zhao [view email]
[v1] Wed, 14 Oct 2020 13:59:07 UTC (404 KB)

Computer Science > Computation and Language

Title:Semi-Supervised Bilingual Lexicon Induction with Two-way Interaction

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Semi-Supervised Bilingual Lexicon Induction with Two-way Interaction

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators