Site icon Search Engine People Blog

International SEO: How to Fix & Prevent Duplicate Content Issues

international-seo

International SEO is becoming increasingly important for online companies to meet their ultimate aim of growth. Due to the Internet, expanding your business to any country in the world is fortunately a click away. All you need is an optimized website that caters to audience across national borders. However, doing so without duplicate content can be easier said than done.
First, there's the major problem of a language barrier. There are also international search engines that go well beyond Google and Bing. For example, Yandex is the preferred search engine in Russia and Baidu is popular in China. Even when you simply translate your content into different languages, you risk being penalized by Google and other search engines for duplicate content. In this article, I'll look closely at the common content issues businesses commonly face with international SEO. 

Debunking Myths About International Seo

Let's start by bursting some popular myths circulating around the international SEO realm so that you can stay on the right track.

Summarized from Martin Kura's excellent post on the subject:

The Major Issue In International SEO


First and foremost, the biggest problem that comes in international SEO is finding the right approach to target different countries. Second, most businesses face challenges in avoiding the problem of duplicate content. Writing unique content for each page per the location isn't the best approach. This will affect your budget costs and squeeze valuable time. 

Avoiding Duplicate Content in International SEO

Before I get into the topic of duplicate content, you must understand that you can create either a multilingual website or multi-regional website. Here are the differences:

How to Manage Multilingual Versions on Your Website?

Content in different languages is not considered duplicate content if it's done manually with correct grammar and intent. However, using auto-translated text from tools like Google Translator may create duplicate content issues. You can use robot.txt to block the search engines from indexing automatically translated pages, which can help avoid duplicate content. When Google indexes incoherent text, it might regard such content as spam and block the page. Steer clear of translating boilerplate content into different languages as it negatively affects user experience and isn't acceptable by search engines.

For international SEO, it's best to use a single language with a navigation tab for visitors to choose any language they speak. Text should be manually translated. Ensure that each language version can be discovered easily. Refrain from using cookies to guide a user from one language to another. Automatic redirection is also a bad idea because it prevents visitors from exploring the website the way he or she wants. It can also limit the search engines from indexing your website in its entirety. Use interlinking between languages so that users can land on the right language with a simple click.

Preferably, your URL should tell the user what language they're getting. If you want a French user to click on the French version of the text, the URL must contain French words without any English. Google doesn't use code-level information like the "lang" attribute to understand a website's langauge. The search engine reads the content of the page to understand it. I'll cover how to use the hreflang tag instead to guarantee 100 percent safety. 

Targeting a Specific Country

Google encourages website owners to inform them of their targeted countries for enhancing search results. The search engine has set aside the following elements of international SEO for this purpose:

Tackling Duplicate Content on International Sites

It's common for websites to provide similar or the same content in different languages when targeting different regions while having different URLs. Google is okay with this as long as the users are from different countries. Your website will not be penalized when translation is manual and accurate. Even though Google still prefers unique content for each version, it understands that having unique content can be quite tough. Google clearly states that you don't need to hide such content by not allowing Google to crawl it using a robots.txt file or no index robots meta tag.

The circumstances are entirely different if you're providing the same content to the same audience through two URLs. Let me explain this with an example. Imagine you've created yourbusiness.com and yourbusiness.com.au. One targets the USA and other targets Australia respectively. Since both are in English, this will cause duplicate content. Luckily, it can be easily solved using a hreflang tag, which is widely accepted by all search engines globally. 

Using the "Hreflang" Tag

As I mentioned earlier, the hreflang tag protects international SEO campaigns from being penalized with duplicate content. It's usually required by businesses that cater to different languages or countries through sub-domains, subfolders, or ccTLD. The hreflang tag also is important if you have multiple languages for one single targeted country. Here's how you can go about implementing it:

Should only contain the alternate versions like

<link rel="alternate" hreflang="en-us" href="http://en-us. xyz.com/page.html" />

and for page http://en-us. xyz.com/page.html/, the tag should be.

<link rel="alternate" hreflang="en-us" href=" http://en-gb. xyz.com/page.html" />

After implementation you can check that what you've done works properly by logging into your Google Webmaster Tool account. Proceed to "Search Traffic" and then "International Targeting." If the hreflang tags were placed properly, you'll be able to test them utilizing the feature presented there. When problems ensue, try using the hreflang tag generator tool to make things easy. 

Common Mistakes to Avoid

Challenges with Canonical Tags

The purpose of the canonical tag or rel=canonical is simple. Consider you have two pages with different URLs that have exactly the same content. Here you would place a canonical tag on one page so that Google only indexes that. Yet, there are several problems that can arise when using the canonical tag.

From what you've read so far, there's no doubt that the hreflang tag is used for geo-targeting. Canonical tags are used to solve duplication issues. However, with canonical, you must have a preferred version of the web page. For you, this means spending a lot of time, effort, and likely cost. Rel=alternate hreflang tags have an advantage. They can be keyed in with ccTLDs to inhbit users from getting the notion that they need a .com. ccTLDs are more apt at achieving better results.

What the Hreflang and Canonical Test Tells Us

State of Digital put hreflang and canonical tags to the test. Here's what their study found:

As the person in charge of international SEO, there's no doubt that you'd be quite busy the year round. Hopefully the information gained here will be useful in steering clear of costly penalties and de-rankings from duplicate content. If you have additional international SEO tips or have unanswered questions, please feel free to use the comment box below.