The Cascade Transformer: an Application for Efficient Answer Sentence Selection

Soldaini, Luca; Moschitti, Alessandro

Computer Science > Computation and Language

arXiv:2005.02534 (cs)

[Submitted on 5 May 2020 (v1), last revised 7 May 2020 (this version, v2)]

Title:The Cascade Transformer: an Application for Efficient Answer Sentence Selection

Authors:Luca Soldaini, Alessandro Moschitti

View PDF

Abstract:Large transformer-based language models have been shown to be very effective in many classification tasks. However, their computational complexity prevents their use in applications requiring the classification of a large set of candidates. While previous works have investigated approaches to reduce model size, relatively little attention has been paid to techniques to improve batch throughput during inference. In this paper, we introduce the Cascade Transformer, a simple yet effective technique to adapt transformer-based models into a cascade of rankers. Each ranker is used to prune a subset of candidates in a batch, thus dramatically increasing throughput at inference time. Partial encodings from the transformer model are shared among rerankers, providing further speed-up. When compared to a state-of-the-art transformer model, our approach reduces computation by 37% with almost no impact on accuracy, as measured on two English Question Answering datasets.

Comments:	Accepted to ACL 2020 (long)
Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2005.02534 [cs.CL]
	(or arXiv:2005.02534v2 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2005.02534

Submission history

From: Luca Soldaini [view email]
[v1] Tue, 5 May 2020 23:32:01 UTC (188 KB)
[v2] Thu, 7 May 2020 15:07:38 UTC (381 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.CL

< prev | next >

new | recent | 2020-05

Change to browse by:

References & Citations

DBLP - CS Bibliography

listing | bibtex

Luca Soldaini
Alessandro Moschitti

export BibTeX citation

Computer Science > Computation and Language

Title:The Cascade Transformer: an Application for Efficient Answer Sentence Selection

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:The Cascade Transformer: an Application for Efficient Answer Sentence Selection

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators