APLawrence.com -  Resources for Unix and Linux Systems, Bloggers and the self-employed

Word HTML Cleanup

© December 2004 Tony Lawrence

Referencing: https://textism.com/wordcleaner/

Boy do I hate it when somebody sends me an HTML document created by Microsoft Word. It is bloated junk, and a major pain to publish in a format that is useful here. But that's the Microsoft Way: eschew simplicity, embrace complexity, excelsior!

I usually just refuse such offerings unless the content is so good that I just can't bring myself to turn it down. If I do take it, I fuss and fume and annoy my wife by complaining about "Microsoft idiots". Was I referring to the author or the corporation? Maybe both.

I haven't test driven this tool, and who knows if it will still be there when I need it, but if you publish web pages and your contributors are wont to submit in this abominable manner, maybe you can use this.

Got something to add? Send me email.

(OLDER)    <- More Stuff -> (NEWER)    (NEWEST)   

Printer Friendly Version

-> Word HTML Cleanup

Inexpensive and informative Apple related e-books:

Sierra: A Take Control Crash Course

Take control of Apple TV, Second Edition

Are Your Bits Flipped?

Take Control of Pages

Take Control of Apple Mail, Third Edition

More Articles by © Tony Lawrence

---December 4, 2004

Next time, tell them to simply download openoffice.org, open their .doc file in that and then use that program's facilities to convert to html.

OO.org produces much nicer code. Doesn't do a perfect job, but it does it better then MS Office does!


---January 2, 2005
The XML output is much cleaner, as you would just need a SAX parser to strip out the excess namespaces, which are built cleanly.

2 cents.

Printer Friendly Version

Have you tried Searching this site?

This is a Unix/Linux resource website. It contains technical articles about Unix, Linux and general computing related subjects, opinion, news, help files, how-to's, tutorials and more.

Contact us

Printer Friendly Version

You learn about life by the accidents you have, over and over again, and your father is always in your head when that stuff happens. (Kurt Vonnegut)

Linux posts

Troubleshooting posts

This post tagged:




Unix/Linux Consultants

Skills Tests

Unix/Linux Book Reviews

My Unix/Linux Troubleshooting Book

This site runs on Linode