Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Constructing Hierarchical Image-tags Bimodal Representations for Word Tags Alternative Choice

Jul 04, 2013

Fangxiang Feng, Ruifan Li, Xiaojie Wang

Figure 1 for Constructing Hierarchical Image-tags Bimodal Representations for Word Tags Alternative Choice

Figure 2 for Constructing Hierarchical Image-tags Bimodal Representations for Word Tags Alternative Choice

Figure 3 for Constructing Hierarchical Image-tags Bimodal Representations for Word Tags Alternative Choice

Share this with someone who'll enjoy it:

Abstract:This paper describes our solution to the multi-modal learning challenge of ICML. This solution comprises constructing three-level representations in three consecutive stages and choosing correct tag words with a data-specific strategy. Firstly, we use typical methods to obtain level-1 representations. Each image is represented using MPEG-7 and gist descriptors with additional features released by the contest organizers. And the corresponding word tags are represented by bag-of-words model with a dictionary of 4000 words. Secondly, we learn the level-2 representations using two stacked RBMs for each modality. Thirdly, we propose a bimodal auto-encoder to learn the similarities/dissimilarities between the pairwise image-tags as level-3 representations. Finally, during the test phase, based on one observation of the dataset, we come up with a data-specific strategy to choose the correct tag words leading to a leap of an improved overall performance. Our final average accuracy on the private test set is 100%, which ranks the first place in this challenge.

* 6 pages, 1 figure, Presented at the Workshop on Representation Learning, ICML 2013

View paper on

Share this with someone who'll enjoy it:

Title:Constructing Hierarchical Image-tags Bimodal Representations for Word Tags Alternative Choice

Paper and Code