(Msg. 1) Posted: Tue Aug 18, 2009 6:05 pm
Post subject: WordprocessingML (.docx) to HTML converter Archived from groups: microsoft>public>word>conversions (more info?)
I want to convert Word 2007 (.docx) files
containing OMML equations into clean well-formed XHTML format.
Equations should be converted into GIF image format, or
failing that, either MathML or TeX (I'm not sure which is
best).
Are there any tools to help me do so, other than
the "Save As..." feature in Word itself, which
generates dirty HTML format with lumps of XML
thrown in (inside comment delimiters).
So far, I have come across the following potential
candidates
2. DocBook XSLT stylesheets
(the roundtrip component supports conversion of WordML files
exported from Word 2003, not Word 2007 files or subfiles (document.xml)
cf. http://docbook.sourceforge.net/
3. Logictran or UpCast
These tools convert Word documents first into
RTF, then convert RTF into XML.
If anyone knows of anything else, especially something
that converts into XML without the intermediate RTF format,
I would be interested to find out about them.
All times are: Eastern Time (US & Canada) (change)
Page 1 of 1
You can post new topics in this forum You can reply to topics in this forum You cannot edit your posts in this forum You cannot delete your posts in this forum You cannot vote in polls in this forum