Motivation We studies the use of everyday words to describe images. The common saying has it that a picture is worth a thousand words, here we ask which thousand? The pro- liferation of tagged social multimedia data presents a chal- lenge to understanding collective tag-use at large scale. One can ask if patterns from photo tags help understand tag-tag relations, and how it can be leveraged to improve visual search and recognition. |
||||
Methodology There are three main parts of this work: |
||||
Results We analyze over 5 million photos with over 20,000 visual tags. The statistics from this collection leads to good results for image tagging, relationship estimation, and generalizing to unseen tags. This is a first step in analyzing picture tags and everyday semantic knowledge. Potential other applications include generating natural language descriptions of pictures, as well as validating and supplementing knowledge databases. |
||||
Demo: Explore the visual informativeness of tags This link will lead to an interactice plot exploring the visual informativeness of photo tags. |
||||
Data We release the data file that contains the list of associated Flickr images for each synset in imagenet/wordnet.
>> head n03437741.txt n03437741 2517428224 http://farm4.static.flickr.com/3160/2517428224_f0ac83532f.jpg n03437741 3106498561 http://farm4.static.flickr.com/3150/3106498561_f3e8c5c580.jpg n03437741 1290786822 http://farm2.static.flickr.com/1051/1290786822_9a9e69e00d.jpg n03437741 3421957804 http://farm4.static.flickr.com/3629/3421957804_7990e8f7b2.jpg ... ... |
||||
September 2013 Contact: Lexing Xie |