Google gets a clearer picture of image search
Posted on 29 Apr 2008 at 08:09
Google researchers have devised a new technique that they claim could vastly improve the quality of image searches.
The company aims to make its image search results as relevant as text searches, but is hampered by a computer's reliance on text cues to decipher the content of a picture.
"Although image search has become a popular feature in many search engines, including Yahoo, MSN, Google etc, the majority of image searches use little, if any, image information to rank the images," claims Google researcher Shumeet Baluja in a research paper.
"Instead, commonly only the text on the pages in which the image is embedded (text in the body of the page, anchor-text, image name, etc) is used."
Such dependency on text can throw up freak results, such as a magazine front cover of Monica Lewinsky dressed as Mona Lisa appearing when people search for the painting.
So the Google researchers have devised a new algorithm called VisualRank that looks for visual themes across a range of photos, with images ranked according to how similar they are to other images that contain that theme.
So, for example, a search for McDonalds may look for the famous golden arches, with those photos where the logo has been partially cropped or not made the main focus of the image ranking lower than those that do.
"The second challenge is that even after we find the common features in the images, we need a mechanism to utilise this information for the purposes of ranking," writes Baluja.
"Simply counting the number of common visual features will yield poor results. To address this task, we infer a graph between the images, where images are linked to each other based on their similarity. Once a graph is created, we demonstrate how iterative procedures similar to those used in PageRank can be employed to effectively create a ranking of images."
"We implicitly rely on the intelligence of crowds: the image similarity graph is generated based on the common features between images. Those images that capture the common themes from many of the other images are those that will have higher rank."
Baluja claims that when 2,000 Google employees were asked to rank the relevance of VisualRank results compared to standard image search, the new algorithm returned 83% fewer irrelevant images.
Author: Barry Collins
advertisement
- Need a bit of extra Christmas cash? Grass up your boss, says BSA
- Photoshop Mobile on Android review: first look
- ATI Radeon HD 5970: 42% more expensive in the UK
- Office 2010 Beta – 32-bit or 64-bit – The Choice is Clear
- Why Britain's watchdogs have fewer teeth than goldfish
- Tabbed documents: how to make Office 2010 great
- Outlook 2010 People Pane – does it spell death to Xobni
- Microsoft Outlook 2010 screenshots
- Co-Authoring in Word 2010 and SharePoint Foundation 2010
- Microsoft Outlook 2010 screenshots: Backstage view
- Getting to grips with Microsoft's IT Health Environment Scanner
- Virtualise your servers
- The changing face of travel gadgets
- Build your own distributed file system
- The bulletproof Dell that costs an arm and a leg
- Microsoft Office 2010 Technical Preview: Q&A
- Lawnmowers, the TyTN II and one odd insurance request
- There'll never be a bulletproof OS
- How far can we trust apps?
- Five nice touches in Outlook 2010
advertisement
Printed from www.pcpro.co.uk


