Site icon Search Engine People Blog

Can scraper sites hurt us now?

About a year or so ago, there were some discussions regarding whether or not scrapers could be hurting legitimate sites' rankings. One such thread is here and blog posts are here and here. Even the Washington Post got a word in. It seems as though the main consensus was that scrapers would not hurt you in the rankings, although different people had different opinions.

I wonder if perhaps the issue should be raised again, based on the information we've received from Matt Cutts in his Big Daddy Indexing Timeline. A couple of things he mentioned were:

It’s true that if you had N backlinks and some fraction of those are considered lower quality, we’d crawl your site less than if all N were fantastic.

Off-topic links wouldn’t cause a penalty by themselves. Now if the off-topic links are spammy, that could cause a problem.

I bring this up because I'm watching one of my sites get more and more scraper backlinks every day. It would be impossible for me to obtain enough natural backlinks (or even "arranged" backlinks) to outweigh the scraper backlinks. Could these scraper links be hurting me? They are certainly "spammy", I would think. No doubt that they are considered "lower quality". And if a bot were to measure them against the total backlinks, it would probably determine that my site has many more low quality backlinks than high quality backlinks.

In addition, nearly all of these scraper links use the same anchor text, which could in itself cause some sort of "too similar" penalty. We usually believe that our anchor text should be natural and varied. Are these scrapers, using exactly the same anchor text, causing it to seem as though I am creating an unnatural backlink pattern?

Perhaps, a year ago, this situation wasn't causing a problem. But now? With Big Daddy? Could scraper sites be causing harm now? I hope not, but I think it's possible. And unfortunately, if true, there's nothing any of us could do about it. My hope is that Google simply disregards those scraper links, and doesn't use them positively or negatively against a site. But I'm a bit worried that the algorithm isn't quite that bright. Ah well, just something to think about, as I adjust my tinfoil hat.