Search Engine People - Search Engine Positioning, Placement Service
Home  |  Blog  |  About Us  |  Careers  |  News  |  Contact Us

How Search Really Works: Grabbing Most Red M&M’s

Ruud HeinWelcome! Thanks for visiting!

Subscribe to the full feed

by Ruud Hein
May 2, 2008

This post is part of an ongoing series: How Search Really Works.
Previously: Relevance (2)

Instead of painstakingly grabbing the absolute best matches for your query to then rank those with infinite precision, one time saving strategy has search engines go for “close enough”.

Painstaking Precision

sorted-mm

Given all the time, money and resources in the world, here’s what we’d normally do.

Word by word you go through a search. You look in your documents and see which has word one…. word two… word three…. You get the picture.

You give some plus points for every time a searched word appears in a document. How many points? Depends on the TFxIDF score for that specific word.

You add up the scoring into a sum, measured by relevance again. Do the same for the query itself (treating it like a very short document basically). In short: you’re calculating their vector space scores.

Measure the mathematical similarity between document 1 and the query, document 2 and the query, document 3 and… Yup.

And then you can’t just slap all those on the screen. You have to tailor to the searcher’s need and pick and sort the top scoring documents!

Now you could sort all your scored documents at once or just go for the top number of documents needed; say the first or next 10 because the searchers has that set as maximum results per page.

Instead of doing this HUGE sorting routine you throw all values together in a big black hat (mathematicians call this a “heap”), come up with the top 10 documents or so and only then sort them.

Perceived Relevance

mms-in-bowl

What struck me as funny is that this high precision, high-cost way of doing things doesn’t necessarily mean you get the most bang for you buck, the best quality results for your searcher’s patience.

No, the mathematical similarity between our search and those documents is something we perceive as relevant.

That’s a low payback to work for when the cost is so high; comparing a huge number of documents, calculating mathematical similarities…. grabbing the top of the heap…

The perception of relevance is something a search engine can use though by going for “good enough”.

The Inexact Sort Of Top 10-ish Documents You Might Want

mixed-mm

Instead of calculating the top 10 with high precision, why not grab a bundle of documents that will most probably be in that top 10?

Just grab a bunch of documents that are in the race to be the answer to the searcher’s query and take the top 10 of that bunch!

Even though this top 10 is not The Top 10 we would have found using our Painstaking Precision method, it will contain many documents that would have been in that top 10 or near it.

It’s like having a bowl of M&M’s and wanting to eat red ones. You could sort them out painstakingly and then go for the red ones… or you could grab in that area where you see most of the red ones seem to be.

In order of appearance, images courtesy of westpark, Irina Souiki and jacalynsnana

I hang out at Twitter where I enjoy the company, the buzz, the nuggets of info and opinion we pass along.
Join me on Twitter!
• Get Search Engine People delivered by email

As posted in How Search Really Works.

You're welcome to join the conversation; add your response. You can track the conversation using the RSS 2.0 feed.
You can also trackback from your own site.

Leave a Reply

« Friday Funnies: When World’s Collide
SEO is about to Grow Up »

Subscribe

Full Feed
Email Updates

Recent Posts

  • Let Me Count The Ways: Enumerated Sphinn Wisdom
  • Friday Funnies: Independence Day
  • Google versus Les Pages Jaunes
  • 50 Sites et + Pour Vous Aider à Enterrer les Commentaires Négatifs sur Vous ou Votre Compagnie!
  • 50 + Sitios que Ayudarán a Ocultar Publicaciones Negativas Acerca de Usted o de su Compañía
  • Key Elements of an Online Community Strategy
  • The Art of Eluding Google: Is It Even Possible?
  • Using the Cross Pollination Concept to Aid With Social Media Success!
  • Perpetuum Mobile SEO : Reaping The Benefits
  • Website Transition Planning Critical When Making Changes

Most Popular Ever

  • 50 Sites to help your bury negative posts about you or your company
  • What is authority and how do you build it?
  • How to sell your client on a blog strategy?
  • Dude I'm phaaaaaat
  • Google vs. Yellow Pages

Most Popular this Month

  • Friday Funnies: The Last Judgement
  • Indiana Jones and the Age of an SEO
  • Link Request Strategies for Blogs, Edu’s & .Gov’s: Respect My Authoritah!
  • 50+ Sites To Help You Bury Negative Posts About You or Your Company!
  • Friday Funnies: Warning Labels For Bloggers

Subjects

  • Affiliate Marketing
  • Authority Building
  • Blogging
  • Branding
  • Canada
  • Content
  • Coupons
  • eBooks
  • En Español
  • En français
  • En fran栩s
  • Events
  • Experiments
  • Francophone
  • Funnies
  • Google
  • Guest Post
  • How Search Really Works
  • Local Search
  • Mobile Search
  • MSN/Live
  • News
  • Online Marketing
  • Online Retailing
  • Online Shopping
  • Opinion
  • Pages Jaunes
  • PPC
  • Quebec
  • Reputation Management
  • SEM
  • SEO
  • Social Media
  • Spanish
  • Stats
  • Technology
  • The Algorithm is Human
  • Tips
  • Tools
  • video
  • Yahoo
  • Yellow Pages

Archive

  • July 2008
  • June 2008
  • May 2008
  • April 2008
  • March 2008
  • February 2008
  • January 2008
  • December 2007
  • November 2007
  • October 2007
  • September 2007
  • August 2007
  • July 2007
  • June 2007
  • May 2007
  • April 2007
  • March 2007
  • February 2007
  • January 2007
  • September 2006
  • July 2006
  • May 2006
  • March 2006

Search


Recent Readers

The Writers

  • Jeff Quipp
  • Jennifer Osborne
  • Ruud Hein
  • Tom Tsinas

Top Commentators

  • Lily (5)
  • Utah SEO (5)
  • Comparison Shopping (4)
  • Chelle (4)
  • Paul (3)
  • Metaspring (3)
  • Wii Boy (3)
  • Phil Benwell (2)
  • Yossarian (2)
  • Marketing Man (2)

Blogroll

  • AbleReach Blog
  • aimClear Blog
  • Bill Hartzer
  • Blah Blah Tech
  • Courtney Tuttle's Blog
  • DoshDosh
  • Geyser Marketing
  • Gray Wolf's SEO Blog
  • Justilien - Link Building
  • Learning SEO Basics
  • Matt Cutts Blog
  • New Orleans Internet Marketing
  • NorthSouthMedia
  • Nowsourcing
  • Profectio - Dave Forde
  • Quiddity - Essence SEO Blog
  • Search Engine College
  • Search Engine Jounal
  • Search Engine Land
  • Search Engine Watch
  • SEO by the SEA
  • SEO Design Solutions
  • SEOco UK Blog
  • SEOPittfall
  • SexySEO
  • Small Business SEM
  • Social Desire
  • Sphinn
  • Stepforth.com - Ross Dunn
  • Stephan Spencer's Scatterings
  • Stuntdubl
  • Techipedia
  • Tim Nash
  • Top Rank Blog
  • Trail of the Fire Horse
  • Utah SEO Blog
  • Yeepage Blogging Tips

SEO Toronto - Search Engine Optimization Specialists
Copyright © Search Engine People - All Rights Reserved.
Contact Us at 1-877-486-7875 or 905-426-9340 - contact@searchenginepeople.com