Correct Your URL Canonicalization

Published November 10, 2008
 
URL canonicalization, also known as URL normalization, is the manner in which you modify and ‘normalize’ your URL addresses in order to standardize them in a consistent fashion. Why would you want to do this? It is important for your website SEO to have clean and consistent URLs to help search engines reduce the amount of duplicate indexing.
 
Typically, URL canonicalization is most important for home pages. With inconsistent syntactical representations of a web page address, you may be creating a number of different possibilities for search engines to choose from with every link you have on your website. For instance, look at the different syntax for the following home page addresses:
 
http://www.yourhomepage.com
http://www.yourhomepage.com/index.html
http://yourhomepage.com
http://yourhomepage.com/
 
You may notice that every one of the above examples is a valid link to the domain “yourhomepage.com.”  In actuality, however, a search engine could be confused and index each of these URLs as a different web site with possibly different content.   
 
Search Engine URL Normalization
 
Most search engines will use URL normalization to help organize and sort the different URLs to the same web page address. When a search returns a set of potentially identical but duplicate page listings, the search engine ‘normalization’ tries to pick the best representative from the set to display. 
 
Google tries to help webmasters and keyword searchers alike by preventing the negative effects of duplicate content for both parties. They do this by publicizing the following policies:
 
1. When we detect duplicate content, such as through variations caused by URL parameters, we group the duplicate URLs into one cluster.
 
2. We select what we think is the "best" URL to represent the cluster in search results.
 
3. We then consolidate properties of the URLs in the cluster, such as link popularity, to the representative URL.
 
However, search engines cannot always normalize properly. Therefore, the search algorithms could find different syntactical links to the same web page, be unable to normalize them appropriately, and subsequently rank your website lower on search results. Why is that a problem? The search engine views your page as being published by many different URLs, which results in “duplicate content” that can dilute your link popularity.
 
How to Canonicalize your Website
 
What is the correct link structure and syntax you should use on your website? There are recommendations that professionals have suggested, but the main thing to remember is to use consistent URL syntax across your entire site to reduce potential search engine problems. Here are some specific strategies to consider:
 
Use a trailing slash – A directory should use a trailing slash at the end of the URL, such as in the following: http://www.yourhomepage.com/. There are many reasons for this strategy:
  • It allows browsers to serve the correct file after a single request from the server. 
  • Should you ever change your page structure from .html to .php, you can keep the directory structure the same.
  • If ever you should switch to a new coding language, your directory URLs can remain the same.
Convert all to lowercase – Although most web servers and host components are case-insensitive, most normalizers will convert them to lowercase. Avoid the extra work and keep your URLs consistent with lowercase letters.
 
Remove the directory index – Directory indexes such as yourhomepage.com/index.html are generally not required in linking URLs. Be sure to list only the main home page website. 
 
Incorporating www vs. non-www – Many domains are using just the simple “example.com” without the use of “www”. However, both URLs should be valid to your home page. Whichever one you want to use as your default home URL, be sure to facilitate a 301 redirect to that default.
 
Use a 301 Redirect – A 301 redirect is used to force resolutions to one URL. For instance, in the above example, many sites can be accessed by both “www.yourhomepage.com” and “yourhomepage.com”. A 301 permanent redirect script helps search engines like Google know which URL you prefer to be canonical. The 301 redirect also is a good idea for websites that are updated often with dynamic content or blogs. 
 
If you have a large website, you may not be able to touch on every canonicalization issue on your own. You can utilize a web-based tool, such as http://training.seobook.com/website-health-check , that can search and use date-based filter options to detect problems. 
 
If you want to improve your ranking with search engines, you need to provide them with clean and consistent links to your website. Be sure to perform a full canonicalization evaluation on your website to ensure that you present less confusion to search engines and avoid potential normalization issues.
 
Resources:
 
http://www.mattcutts.com/blog/seo-advice-url-canonicalization/
 
http://en.wikipedia.org/wiki/URL_normalization
 
http://googlewebmastercentral.blogspot.com/2007/09/google-duplicate-content-caused-by-url.html
 
http://www.drostdesigns.com/url-canonicalization-correct-your-home-page-urls/
 
http://training.seobook.com/website-health-check
 

To gain access to the most up-to-date SEO strategies, fill out the form below to sign up for our newsletter. This newsletter will help you know what Google's up to as well as how you can get your website ranking higher without raising any red flags.



{"http://www.seositecheckup.com/articles/7":{"data":[]}}
Read Related Articles About SEO Programming Topics
  •  How to Protect Your Website From Hackers and Cyberthugs
    In a perfect world, you'd be able to mind your own business and cheerfully run a successful online business without being harassed by "cyberthugs." However, until that perfect world comes, you'll need to slap great big KEEP OUT sign on your website. This means creating security settings which will disallow access to sensitive data and controls.
  •  How External CSS Style Sheets and External Javascript Files Can Boost Your Rankings
    If you’re searching for some simple ways to increase your ranking with the search engines and provide visitors with a better user experience, external CSS style sheets and external JavaScript files might just be the answer you’re looking for. As search engine companies such as Yahoo, MSN and Google continue to consider page loading speed an important factor in determining the ranking of a website, the need to reduce load times becomes more and more important. This can be accomplished through file compression strategies such as GZIP and deflation and by optimizing graphics file sizes.
  •  How Using GZIP Compression Helps Your Site's SEO
    You might have heard some talk about how compressing your web pages helps with your site's SEO, but how does it help and how can you use it on your site? This article will provide you with some details on what file compression is and why it�s essential to the process of optimizing your site.
  •  How Page Caching Optimizes Your Site Performance
    Page caching is another method which can help you to improve the load time of your web pages and thus optimize your site for the search engines. Page load time can significantly impact your user experience and your site?s ability into convert visitors into buyers or into leads. In fact, experiments at Google have revealed that just a half second?s difference in load times can cause up to a 20% reduction in web traffic.
  •  Static Links vs. Dynamic Links: Which Are Best for SEO?
    Learn about the pros and cons behind static links and dynamic links, as well as which one is best for your SEO efforts.
  •  Some Guidelines for Determining Web Page and File Size
    Web page size is an important factor in determining how well optimized your site is. Search engines like MSN, Yahoo and Google are considering load time to be more and more important when it comes to both search engine optimization and PPC quality scores, and file size certainly effects your website’s load time.
  •  Why You Need A Sitemap Protocol As Part of Your SEO
    Learn about how powerful a Sitemap Protocol is in enhancing your website's search engine optimization.
  •  How to Focus Your Content and Site Structure for Maximum SERP Results
    What does the future of SEO hold? Will back links, fresh content or site structure be most important for ranking high in the search engine's? Find out what Google, MSN and Yahoo! have in mind...
  •  SEO for Wordpress: Optimizing Your Blog With Wordpress Plug ins
    A simple step by step game plan for boosting the SEO of your Wordpress blog, optimizing traffic flow and increasing visitor interactions.
  •  The Importance of Avoiding the Use of Nested Tables
    Learn about why you should avoid utilizing nested tables in your web pages from both SEO and browsing perspectives.
  •  The Importance of a Robots.txt File for Your SEO
    Your Robots.txt file is what tells the search engines which pages to access and index on your website on which pages not to. For example, if you specify in your Robots.txt file that you don't want the search engines to be able to access your thank you page...
  •  Understanding Javascript Redirects
    Learn how javascript redirects work, when they are used, and better alternatives for redirecting users.
  •  Finding a Good Ecommerce System for Your Site
    Learn exactly what you need to know to find the perfect e-commerce system for selling products and services from your website or blog...

All Categories:

RSS/XML Feed RSS
Article Categories