Domain Adaptation for Statistical Classifiers

Daume III, H.; Marcu, D.

doi:10.1613/jair.1872

Computer Science > Machine Learning

arXiv:1109.6341 (cs)

[Submitted on 28 Sep 2011]

Title:Domain Adaptation for Statistical Classifiers

Authors:H. Daume III, D. Marcu

View PDF

Abstract:The most basic assumption used in statistical learning theory is that training data and test data are drawn from the same underlying distribution. Unfortunately, in many applications, the "in-domain" test data is drawn from a distribution that is related, but not identical, to the "out-of-domain" distribution of the training data. We consider the common case in which labeled out-of-domain data is plentiful, but labeled in-domain data is scarce. We introduce a statistical formulation of this problem in terms of a simple mixture model and present an instantiation of this framework to maximum entropy classifiers and their linear chain counterparts. We present efficient inference algorithms for this special case based on the technique of conditional expectation maximization. Our experimental results show that our approach leads to improved performance on three real world tasks on four different data sets from the natural language processing domain.

Subjects:	Machine Learning (cs.LG); Computation and Language (cs.CL)
Cite as:	arXiv:1109.6341 [cs.LG]
	(or arXiv:1109.6341v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.1109.6341
Journal reference:	Journal Of Artificial Intelligence Research, Volume 26, pages 101-126, 2006
Related DOI:	https://doi.org/10.1613/jair.1872

Submission history

From: H. Daume III [view email] [via jair.org as proxy]
[v1] Wed, 28 Sep 2011 20:18:30 UTC (68 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.LG

< prev | next >

new | recent | 2011-09

Change to browse by:

cs
cs.CL

References & Citations

DBLP - CS Bibliography

listing | bibtex

Hal Daumé III
H. Daume III
Daniel Marcu
D. Marcu

export BibTeX citation

Computer Science > Machine Learning

Title:Domain Adaptation for Statistical Classifiers

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Domain Adaptation for Statistical Classifiers

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators