Google gets a clearer picture of image search
Posted on 29 Apr 2008 at 08:09
Google researchers have devised a new technique that they claim could vastly improve the quality of image searches.
The company aims to make its image search results as relevant as text searches, but is hampered by a computer's reliance on text cues to decipher the content of a picture.
"Although image search has become a popular feature in many search engines, including Yahoo, MSN, Google etc, the majority of image searches use little, if any, image information to rank the images," claims Google researcher Shumeet Baluja in a research paper.
"Instead, commonly only the text on the pages in which the image is embedded (text in the body of the page, anchor-text, image name, etc) is used."
Such dependency on text can throw up freak results, such as a magazine front cover of Monica Lewinsky dressed as Mona Lisa appearing when people search for the painting.
So the Google researchers have devised a new algorithm called VisualRank that looks for visual themes across a range of photos, with images ranked according to how similar they are to other images that contain that theme.
So, for example, a search for McDonalds may look for the famous golden arches, with those photos where the logo has been partially cropped or not made the main focus of the image ranking lower than those that do.
"The second challenge is that even after we find the common features in the images, we need a mechanism to utilise this information for the purposes of ranking," writes Baluja.
"Simply counting the number of common visual features will yield poor results. To address this task, we infer a graph between the images, where images are linked to each other based on their similarity. Once a graph is created, we demonstrate how iterative procedures similar to those used in PageRank can be employed to effectively create a ranking of images."
"We implicitly rely on the intelligence of crowds: the image similarity graph is generated based on the common features between images. Those images that capture the common themes from many of the other images are those that will have higher rank."
Baluja claims that when 2,000 Google employees were asked to rank the relevance of VisualRank results compared to standard image search, the new algorithm returned 83% fewer irrelevant images.
Author: Barry Collins
advertisement
- What's that eggy smell in the server room?
- How to change the default template in Word 2007
- Book review: Rework by Jason Fried and David Heinemeier Hansson
- Panorama parents deserve their file-sharing fine
- Google and BT offer free website service to British businesses
- Lords' last chance to protect broadband customers
- Extreme handwriting recognition on the Dell Latitude XT2
- 12 surprising things that Wolfram Alpha knows
- Nokia N900: phone or pocket computer?
- The sinister side of Spotify
- The ease of hacking a WEP network
- Delving into the Norton 2010 line-up
- Banish your Wi-Fi woes
- How to commit Facebook suicide
- Which smartphone keyboard is the best?
- We can beat the botnets
- Paying for code doesn’t mean owning it
- Cracking the iSCSI conundrum
- The perfect open-source task scheduler
- Exploring Microsoft Office 2010 beta
advertisement



Printed from www.pcpro.co.uk