Online Algorithm for Unsupervised Sequential Selection with Contextual Information

Verma, Arun; Hanawal, Manjesh K.; Szepesvári, Csaba; Saligrama, Venkatesh

Computer Science > Machine Learning

arXiv:2010.12353 (cs)

[Submitted on 23 Oct 2020]

Title:Online Algorithm for Unsupervised Sequential Selection with Contextual Information

Authors:Arun Verma, Manjesh K. Hanawal, Csaba Szepesvári, Venkatesh Saligrama

View PDF

Abstract:In this paper, we study Contextual Unsupervised Sequential Selection (USS), a new variant of the stochastic contextual bandits problem where the loss of an arm cannot be inferred from the observed feedback. In our setup, arms are associated with fixed costs and are ordered, forming a cascade. In each round, a context is presented, and the learner selects the arms sequentially till some depth. The total cost incurred by stopping at an arm is the sum of fixed costs of arms selected and the stochastic loss associated with the arm. The learner's goal is to learn a decision rule that maps contexts to arms with the goal of minimizing the total expected loss. The problem is challenging as we are faced with an unsupervised setting as the total loss cannot be estimated. Clearly, learning is feasible only if the optimal arm can be inferred (explicitly or implicitly) from the problem structure. We observe that learning is still possible when the problem instance satisfies the so-called 'Contextual Weak Dominance' (CWD) property. Under CWD, we propose an algorithm for the contextual USS problem and demonstrate that it has sub-linear regret. Experiments on synthetic and real datasets validate our algorithm.

Comments:	Accepted to NeurIPS 2020
Subjects:	Machine Learning (cs.LG); Machine Learning (stat.ML)
Cite as:	arXiv:2010.12353 [cs.LG]
	(or arXiv:2010.12353v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2010.12353

Submission history

From: Arun Verma [view email]
[v1] Fri, 23 Oct 2020 12:32:21 UTC (236 KB)

Computer Science > Machine Learning

Title:Online Algorithm for Unsupervised Sequential Selection with Contextual Information

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Online Algorithm for Unsupervised Sequential Selection with Contextual Information

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators