Multilingual Code-Switching for Zero-Shot Cross-Lingual Intent Prediction and Slot Filling

Krishnan, Jitin; Anastasopoulos, Antonios; Purohit, Hemant; Rangwala, Huzefa

Computer Science > Computation and Language

arXiv:2103.07792 (cs)

[Submitted on 13 Mar 2021 (v1), last revised 16 Mar 2021 (this version, v2)]

Title:Multilingual Code-Switching for Zero-Shot Cross-Lingual Intent Prediction and Slot Filling

Authors:Jitin Krishnan, Antonios Anastasopoulos, Hemant Purohit, Huzefa Rangwala

View PDF

Abstract:Predicting user intent and detecting the corresponding slots from text are two key problems in Natural Language Understanding (NLU). In the context of zero-shot learning, this task is typically approached by either using representations from pre-trained multilingual transformers such as mBERT, or by machine translating the source data into the known target language and then fine-tuning. Our work focuses on a particular scenario where the target language is unknown during training. To this goal, we propose a novel method to augment the monolingual source data using multilingual code-switching via random translations to enhance a transformer's language neutrality when fine-tuning it for a downstream task. This method also helps discover novel insights on how code-switching with different language families around the world impact the performance on the target language. Experiments on the benchmark dataset of MultiATIS++ yielded an average improvement of +4.2% in accuracy for intent task and +1.8% in F1 for slot task using our method over the state-of-the-art across 8 different languages. Furthermore, we present an application of our method for crisis informatics using a new human-annotated tweet dataset of slot filling in English and Haitian Creole, collected during Haiti earthquake disaster.

Subjects:	Computation and Language (cs.CL); Machine Learning (cs.LG)
Cite as:	arXiv:2103.07792 [cs.CL]
	(or arXiv:2103.07792v2 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2103.07792

Submission history

From: Jitin Krishnan [view email]
[v1] Sat, 13 Mar 2021 21:05:09 UTC (1,550 KB)
[v2] Tue, 16 Mar 2021 16:39:48 UTC (8,628 KB)

Computer Science > Computation and Language

Title:Multilingual Code-Switching for Zero-Shot Cross-Lingual Intent Prediction and Slot Filling

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Multilingual Code-Switching for Zero-Shot Cross-Lingual Intent Prediction and Slot Filling

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators