APLawrence.com -  Resources for Unix and Linux Systems, Bloggers and the self-employed

Canonical tag for duplicate content

Google, Yahoo and Microsoft Live recently announced support for a new "canonical" tag that their spiders will recognize. It's purpose is simple: when you have deliberately created duplicate content, this tag tells the search engine where the "canonical" content lives - that is, the page you'd prefer they pay attention to for listing results.

As an example, back in the days when tracking SCO Unix stuff was important to me, I created a "Bofcusm" directory and would put copies of relevant and interesting newsgroups posts there. However, there was one poster, "Bela", who made a lot of useful posts, so I duplicated his posts to /Bela. People who only wanted to read his stuff could find it all there but obviously this is dreaded "duplicate content".

The new tag solves that. In the Bela articles, I can tell the search engines that they should look to /Bofcusm for the "real" content. If the Bela article is 1735.html, the tag looks like this:

<link rel="canonical" href="//news-posts.aplawrence.com/1735.html" />

I also did the same thing with the pages generated by the "Printer Friendly" links. That shouldn't be necessary as my robots.txt tells spiders not to go to those pages anyway, but who knows if this will help some other thing at some other time.. I figure it can't hurt.

Now I just need to figure out if there are any other conditions where I've duped content, on purpose or otherwise.

Got something to add? Send me email.

(OLDER)    <- More Stuff -> (NEWER)    (NEWEST)   

Printer Friendly Version

-> -> Canonical tag for duplicate content

1 comment

Increase ad revenue 50-250% with Ezoic

More Articles by

Find me on Google+

© Anthony Lawrence

Tue Apr 7 17:02:54 2009: 6040   TonyLawrence

And I forgot to add:

I'm looking forward to the day when those articles are only of historical interest..

Kerio Samepage

Have you tried Searching this site?

Support Rates

This is a Unix/Linux resource website. It contains technical articles about Unix, Linux and general computing related subjects, opinion, news, help files, how-to's, tutorials and more.

Contact us

Getting information off the Internet is like taking a drink from a fire hydrant. (Mitchell Kapor)

This post tagged: