freelanceprogrammers.org Forum Index » XML / XSL
Word 2002 to XML
Joined: 14 Sep 2005
Posts: 2
Word 2002 to XML
Hi,
I`ll be grateful if people in this forum could give me some tips on how to
export content as XML in Word 2002. I`m aware that Microsoft has come out
with Word 2003 which provides XML export feature out of the box. However, I
would like to know how we can achieve this in Word 2002.
I have a nested table structure (actually microchip register data) which I
want to export as XML and then transform the XML data via XSLT routines.
I`ve googled for this information and found that some commercial solutions,
for example the xDoc Converter <http://www.cambridgedocs.com/> transforms
doc format to XML. I`m sure there are some more commercial solutions around.
But what I want to know is if anybody has had any luck customizing Word to
wrap content in XML tags. I`m also interested in knowing if anyone knows how
to add new hidden fields in Word that can be potentially used for XML
markup.
Thanks,
Alok
[Non-text portions of this message have been removed]
Joined: 13 Jun 2003
Posts: 39
Word 2002 to XML
> I`ll be grateful if people in this forum could give me some tips on
> how to export content as XML in Word 2002. ...
I`ve had to do something similar, although not with nested tables. I
used OpenOffice to open the file and save it as HTML (Word`s "HTML"
output is too dirty to use without a lot of busy work). After running
the result through HTML Tidy, I had reasonably clean HTML that I
could use directly or transform to the target DTD. I would recommend
this method for a one-time conversion.
OpenOffice also has plug-ins to export directly to the DTD of choice --
DocBook is supported out of the box, and Sebastian Rahtz modified it
to output TEI. This might be the way to go if you have to maintain the
document in Word (or OpenOffice, preferably) and export it as needed.
Finally, there are tools to convert RTF to XML -- Paul Trembley
maintains one at http://rtf2xml.sourceforge.net/ (Google will point
you to some others)
> But what I want to know is if anybody has had any luck customizing
> Word to wrap content in XML tags. I`m also interested in knowing
> if anyone knows how to add new hidden fields in Word that can be
> potentially used for XML markup.
You could write a script to wrap tags around each paragraph and
character style; I did that back in the Word 6.0 days to create
useable HTML. (Word has always had a problem exporting decent
HTML, it seems.) Using fields as an aid is an interesting idea
I`d never thought of; the Placeholder field might be what you
want. Bookmarks could be another useful tool as well.
--
Larry Kollar, Senior Technical Writer, ARRIS CPE Products
"Content creators are the engine that drives
value in the information life cycle."
-- Barry Schaeffer, on XML-Doc
Joined: 02 May 2005
Posts: 10
Word 2002 to XML
This link might be helpful...
http://msdn.microsoft.com/office/understanding/word/codesamples/default.aspx?pul
l=/library/en-us/dnword2k/html/odc_expwordtoxml.asp
--- Alok Narula <tuxwarrior@...> wrote:
> Hi,
>
> I`ll be grateful if people in this forum could give
> me some tips on how to
> export content as XML in Word 2002. I`m aware that
> Microsoft has come out
> with Word 2003 which provides XML export feature out
> of the box. However, I
> would like to know how we can achieve this in Word
> 2002.
>
> I have a nested table structure (actually microchip
> register data) which I
> want to export as XML and then transform the XML
> data via XSLT routines.
> I`ve googled for this information and found that
> some commercial solutions,
> for example the xDoc Converter
> <http://www.cambridgedocs.com/> transforms
> doc format to XML. I`m sure there are some more
> commercial solutions around.
>
> But what I want to know is if anybody has had any
> luck customizing Word to
> wrap content in XML tags. I`m also interested in
> knowing if anyone knows how
> to add new hidden fields in Word that can be
> potentially used for XML
> markup.
>
> Thanks,
> Alok
>
>
> [Non-text portions of this message have been
> removed]
>
>
>
>
> Yahoo! Groups Links
>
>
> xml-doc-unsubscribe@yahoogroups.com
>
>
>
>
>
>
__________________________________
Yahoo! Mail - PC Magazine Editors` Choice 2005
http://mail.yahoo.com
All times are GMT
Page 1 of 1
You cannot post new topics in this forum
You cannot reply to topics in this forum
You cannot edit your posts in this forum
You cannot delete your posts in this forum
You cannot vote in polls in this forum
You cannot reply to topics in this forum
You cannot edit your posts in this forum
You cannot delete your posts in this forum
You cannot vote in polls in this forum
Freelace Website Designer - Customer web design and software building.
Booking Calendar - reservation calendar script
Land Surveying - total station instruments and equipments
China Wholesale - Electronics Products
Character Studio - Tutorials and Help
Booking Calendar - reservation calendar script
Land Surveying - total station instruments and equipments
China Wholesale - Electronics Products
Character Studio - Tutorials and Help







