SEO headaches: Duplicate Content
Author:
Nick Rinylo
SEO
Yesterday’s WebProNews Video is all about duplicate content, and it got me thinking. We all know duplicate content isn’t a good thing, and this video drew my attention with the title: “How to Avoid Duplicate Content”. Shari Thurow of Omni Marketing Interactive gives some interesting and straightforward advice about how to avoid duplicate content.
The first thing that struck me was her assertion that you must ‘be proactive’ and ‘be consistent’ – this is such common sense advice that I can immediately see the merit in it. The advice to be consistent with your information architecture makes perfect sense to me too, by ensuring that you follow a logical and consistent structure to your sites you can ensure that they will not only be easier to crawl, but you can also lead the search engine to crawl the pages you want them to. This leads nicely to the third point from Thurow – “Link consistently to the same URLs” – if you link to the shortest, most keyword rich URL then you are giving the search engines the best URL and don’t forget this is also the one your visitors will want too.
You should always send consistent messages to the search engines regarding which pages they should index, because they will undoubtedly choose the wrong file to show in the SERPs.
Thurow makes the point that one could upload several files with almost identical content, but with the use of CSS these pages can look totally different to one another, (CSS Zen Garden is a prime example of this), a human visitor can look at the pages and see that they look different, but the search engine only has the html code to go on. So the search engine uses the only information they have available and class the pages as duplicate content. You should make sure that each page on your website has content that is dramatically different, as this is the only way to avoid falling into the duplicate content trap.
Thurow goes on to say that the canonical tag is both a blessing and a curse. It is a curse because SEO spammers regularly abuse it. It is a blessing when used correctly, and for the purpose it was designed for, it allows users to filter CMS sites easily, which you’ll appreciate is a blessing in itself.
Takeaway
So, to avoid duplicate content, the very best thing you can do is apply your common sense, look at your site logically and be consistent and proactive. Give the search engines as much information and help as you can and they will index the pages you want them to index. At all costs, avoid giving the search engines conflicting information though, as this will only result in disaster.

