# # Word HTML Cleanup
APLawrence.com -  Resources for Unix and Linux Systems, Bloggers and the self-employed

Word HTML Cleanup

I've removed advertising from most of this site and will eventually clean up the few pages where it remains.

While not terribly expensive to maintain, this does cost me something. If I don't get enough donations to cover that expense, I will be shutting the site down in early 2020.

If you found something useful today, please consider a small donation.



Some material is very old and may be incorrect today

© December 2004 Tony Lawrence

Referencing: http://textism.com/wordcleaner/

Boy do I hate it when somebody sends me an HTML document created by Microsoft Word. It is bloated junk, and a major pain to publish in a format that is useful here. But that's the Microsoft Way: eschew simplicity, embrace complexity, excelsior!

I usually just refuse such offerings unless the content is so good that I just can't bring myself to turn it down. If I do take it, I fuss and fume and annoy my wife by complaining about "Microsoft idiots". Was I referring to the author or the corporation? Maybe both.

I haven't test driven this tool, and who knows if it will still be there when I need it, but if you publish web pages and your contributors are wont to submit in this abominable manner, maybe you can use this.


If you found something useful today, please consider a small donation.



Got something to add? Send me email.





(OLDER)    <- More Stuff -> (NEWER)    (NEWEST)   

Printer Friendly Version

->
-> Word HTML Cleanup


Inexpensive and informative Apple related e-books:

Sierra: A Take Control Crash Course

Take control of Apple TV, Second Edition

Are Your Bits Flipped?

Take Control of Pages

Take Control of Apple Mail, Third Edition





More Articles by © Tony Lawrence




---December 4, 2004

Next time, tell them to simply download openoffice.org, open their .doc file in that and then use that program's facilities to convert to html.

OO.org produces much nicer code. Doesn't do a perfect job, but it does it better then MS Office does!

--Drag


---January 2, 2005
The XML output is much cleaner, as you would just need a SAX parser to strip out the excess namespaces, which are built cleanly.

2 cents.




Printer Friendly Version

Have you tried Searching this site?

This is a Unix/Linux resource website. It contains technical articles about Unix, Linux and general computing related subjects, opinion, news, help files, how-to's, tutorials and more.

Contact us


Printer Friendly Version





You learn about life by the accidents you have, over and over again, and your father is always in your head when that stuff happens. (Kurt Vonnegut)




Linux posts

Troubleshooting posts


This post tagged:

Blog

Microsoft

Web/HTML



Unix/Linux Consultants

Skills Tests

Unix/Linux Book Reviews

My Unix/Linux Troubleshooting Book

This site runs on Linode