Mixture Content Selection for Diverse Sequence Generation

Cho, Jaemin; Seo, Minjoon; Hajishirzi, Hannaneh

Computer Science > Computation and Language

arXiv:1909.01953 (cs)

[Submitted on 4 Sep 2019]

Title:Mixture Content Selection for Diverse Sequence Generation

Authors:Jaemin Cho, Minjoon Seo, Hannaneh Hajishirzi

View PDF

Abstract:Generating diverse sequences is important in many NLP applications such as question generation or summarization that exhibit semantically one-to-many relationships between source and the target sequences. We present a method to explicitly separate diversification from generation using a general plug-and-play module (called SELECTOR) that wraps around and guides an existing encoder-decoder model. The diversification stage uses a mixture of experts to sample different binary masks on the source sequence for diverse content selection. The generation stage uses a standard encoder-decoder model given each selected content from the source sequence. Due to the non-differentiable nature of discrete sampling and the lack of ground truth labels for binary mask, we leverage a proxy for ground truth mask and adopt stochastic hard-EM for training. In question generation (SQuAD) and abstractive summarization (CNN-DM), our method demonstrates significant improvements in accuracy, diversity and training efficiency, including state-of-the-art top-1 accuracy in both datasets, 6% gain in top-5 accuracy, and 3.7 times faster training over a state of the art model. Our code is publicly available at this https URL.

Comments:	EMNLP-IJCNLP 2019; Code is available at this https URL
Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:1909.01953 [cs.CL]
	(or arXiv:1909.01953v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.1909.01953

Submission history

From: Jaemin Cho [view email]
[v1] Wed, 4 Sep 2019 17:23:54 UTC (420 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.CL

< prev | next >

new | recent | 2019-09

Change to browse by:

References & Citations

DBLP - CS Bibliography

listing | bibtex

Jaemin Cho
Min Joon Seo
Hannaneh Hajishirzi

export BibTeX citation

Computer Science > Computation and Language

Title:Mixture Content Selection for Diverse Sequence Generation

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Mixture Content Selection for Diverse Sequence Generation

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators