Cross-Modal Similarity Learning : A Low Rank Bilinear Formulation

Kang, Cuicui; Liao, Shengcai; He, Yonghao; Wang, Jian; Niu, Wenjia; Xiang, Shiming; Pan, Chunhong

Computer Science > Multimedia

arXiv:1411.4738 (cs)

[Submitted on 18 Nov 2014 (v1), last revised 17 Dec 2015 (this version, v2)]

Title:Cross-Modal Similarity Learning : A Low Rank Bilinear Formulation

Authors:Cuicui Kang, Shengcai Liao, Yonghao He, Jian Wang, Wenjia Niu, Shiming Xiang, Chunhong Pan

View PDF

Abstract:The cross-media retrieval problem has received much attention in recent years due to the rapid increasing of multimedia data on the Internet. A new approach to the problem has been raised which intends to match features of different modalities directly. In this research, there are two critical issues: how to get rid of the heterogeneity between different modalities and how to match the cross-modal features of different dimensions. Recently metric learning methods show a good capability in learning a distance metric to explore the relationship between data points. However, the traditional metric learning algorithms only focus on single-modal features, which suffer difficulties in addressing the cross-modal features of different dimensions. In this paper, we propose a cross-modal similarity learning algorithm for the cross-modal feature matching. The proposed method takes a bilinear formulation, and with the nuclear-norm penalization, it achieves low-rank representation. Accordingly, the accelerated proximal gradient algorithm is successfully imported to find the optimal solution with a fast convergence rate O(1/t^2). Experiments on three well known image-text cross-media retrieval databases show that the proposed method achieves the best performance compared to the state-of-the-art algorithms.

Subjects:	Multimedia (cs.MM); Information Retrieval (cs.IR); Machine Learning (cs.LG)
Cite as:	arXiv:1411.4738 [cs.MM]
	(or arXiv:1411.4738v2 [cs.MM] for this version)
	https://doi.org/10.48550/arXiv.1411.4738

Submission history

From: Cuicui Kang [view email]
[v1] Tue, 18 Nov 2014 05:53:06 UTC (67 KB)
[v2] Thu, 17 Dec 2015 01:25:27 UTC (2,437 KB)

Computer Science > Multimedia

Title:Cross-Modal Similarity Learning : A Low Rank Bilinear Formulation

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Multimedia

Title:Cross-Modal Similarity Learning : A Low Rank Bilinear Formulation

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators