Cross-modal similarity learning: A low rank bilinear formulation

Cuicui Kang, Shengcai Liao, Yonghao He, Jian Wang, Wenjia Niu, Shiming Xiang, Chunhong Pan

Research output: Chapter in Book/Report/Conference proceedingConference contribution

20 Citations (Scopus)

Abstract

The cross-media retrieval problem has received much attention in recent years due to the rapid increasing of multimedia data on the Internet. A new approach to the problem has been raised which intends to match features of different modalities directly. In this research, there are two critical issues: how to get rid of the heterogeneity between different modalities and how to match the cross-modal features of different dimensions. Recently metric learning methods show a good capability in learning a distance metric to explore the relationship between data points. However, the traditional metric learning algorithms only focus on single-modal features, which suffer difficulties in addressing the cross-modal features of different dimensions. In this paper, we propose a cross-modal similarity learning algorithm for the cross-modal feature matching. The proposed method takes a bilinear formulation, and with the nuclear-norm penalization, it achieves low-rank representation. Accordingly, the accelerated proximal gradient algorithm is successfully imported to find the optimal solution with a fast convergence rate O(1/t2). Experiments on three well known image-text crossmedia retrieval databases show that the proposed method achieves the best performance compared to the state-of-the-art algorithms.

Original languageEnglish
Title of host publicationCIKM 2015 - Proceedings of the 24th ACM International Conference on Information and Knowledge Management
PublisherAssociation for Computing Machinery
Pages1251-1260
Number of pages10
ISBN (Electronic)9781450337946
DOIs
Publication statusPublished - Oct 17 2015
Externally publishedYes
Event24th ACM International Conference on Information and Knowledge Management, CIKM 2015 - Melbourne, Australia
Duration: Oct 19 2015Oct 23 2015

Publication series

NameInternational Conference on Information and Knowledge Management, Proceedings
Volume19-23-Oct-2015

Conference

Conference24th ACM International Conference on Information and Knowledge Management, CIKM 2015
Country/TerritoryAustralia
CityMelbourne
Period10/19/1510/23/15

Keywords

  • Accelerated proximal gradient
  • Cross-Modality
  • Multimedia retrieval
  • Nuclear norm
  • Similarity learning

ASJC Scopus subject areas

  • General Business,Management and Accounting
  • General Decision Sciences

Fingerprint

Dive into the research topics of 'Cross-modal similarity learning: A low rank bilinear formulation'. Together they form a unique fingerprint.

Cite this