Advertisement
If you have a new account but are having problems posting or verifying your account, please email us on hello@boards.ie for help. Thanks :)
Hello all! Please ensure that you are posting a new thread or question in the appropriate forum. The Feedback forum is overwhelmed with questions that are having to be moved elsewhere. If you need help to verify your account contact hello@boards.ie
Hi there,
There is an issue with role permissions that is being worked on at the moment.
If you are having trouble with access or permissions on regional forums please post here to get access: https://www.boards.ie/discussion/2058365403/you-do-not-have-permission-for-that#latest

Best Word HTML Cleaner?

  • 17-01-2006 4:56pm
    #1
    Registered Users, Registered Users 2 Posts: 55,571 ✭✭✭✭


    Everyone knows that Microsoft Word produces junk HTML if you save a doc as HTML (usually tripling the size of the file in the process).

    What do people recommend for stripping the junk out of it?

    I googled, and there are lots of suggestions, but I want to know what you use.

    BTW, ideally the solution would be free... (I use dreamweaver myself, but the person who needs it doesn't have access to DW)


Comments

  • Moderators, Recreation & Hobbies Moderators, Science, Health & Environment Moderators, Technology & Internet Moderators Posts: 93,567 Mod ✭✭✭✭Capt'n Midnight


    Would be better to not garbage it up first.

    OpenOffice does some HTML, I haven't tried publishing a word document with it but it can't be worse.

    EDIT - Picked first doc google found on Microsoft.COM
    saved as HTML by OO
    http://research.microsoft.com/users/marycz/el.doc


  • Registered Users, Registered Users 2 Posts: 19,396 ✭✭✭✭Karoma


    I haven't tested but I would imagine HTML Tidy (OSS) would do a very good job..


  • Registered Users, Registered Users 2 Posts: 469 ✭✭thetourist


    my html kit ( which is an awsome - and free- program in the first place ) has in the actions-> tools menu "strip surplus tags from word 2000 pages" -- my version is 5.1 - i have used it in the past and as far as i can remember it works ok

    http://www.chami.com/html-kit/


  • Registered Users, Registered Users 2 Posts: 469 ✭✭thetourist


    Would be better to not garbage it up first.

    OpenOffice does some HTML, I haven't tried publishing a word document with it but it can't be worse.

    EDIT - Picked first doc google found on Microsoft.COM
    saved as HTML by OO
    http://research.microsoft.com/users/marycz/el.doc

    very interesting

    had a look at the source - there's no where near the ammount of crap ms spits out


  • Closed Accounts Posts: 12,382 ✭✭✭✭AARRRGH


    Don't forget http://validator.w3.org

    Lovely for cleaning up your HTML.


  • Advertisement
Advertisement