I hate finding 404's in my web logs. There was a time when my logs were full of them. This was from carelessness and forgetfulness on my part as well as from simple typing mistakes by both me and others. I've cleaned up most of my own mistakes and done links or redirects for the mistakes of others. This has cut my error log down to about a third of 1% - that is, there's one entry in my error log for every 300 entries in the access log. Most of those are script-kiddie attacks: people looking for exploitable php or cgi scripts.
The tool that has been most helpful to me in cleaning up the real errors has been the Web Crawl report at Google's Webmaster Tools. They have offered the "not found" report for some time, but recently they added info about the source page - the page that led them to the 404 on your site. This makes all the difference in the world: if the offending page is on your own site, you can immediately fix it. If it's someone else's page, you can decide whether to put in a link or a redirect to handle it.
While the relative importance of inbound links for SEO has decreased, you certainly don't want to throw away whatever value a link could give you. As Matt Cutts said at Free links to your site:
If you found something useful today, please consider a small donation.
Got something to add? Send me email.
More Articles by Anthony Lawrence © 2009-11-07 Anthony Lawrence
We are questioning more than the philosophy behind our dependence upon limited and limiting systems. We question the power structures that have grown up around such systems. (Frank Herbert)