
Hello, Version 9.3 which will be released in a couple of weeks will include an Archive Browser view that is able to open and browse Word and Excel documents saved in XML format, that is .docx files and .xlsx files. In the Archive Browser view the files that are included in such a Word or Excel document can be opened and edited in Oxygen so migrating the data to a DITA document will be easy: just apply an XSLT stylesheet to the XML file containing the data that must be imported. For importing from an Excel document saved in the old format (.xls file) you should try the action for importing MS Excel files from menu File -> Import. The action creates an XML file with a configurable structure. This file can be merged into a DITA document with a custom XSLT stylesheet that you write. There is a stylesheet in DITA Open Toolkit for converting an HTML document to a DITA one. It is called h2d.xsl and it is located in [Oxygen-folder]/frameworks/dita/DITA-OT/demo/h2d. Just create a transformation scenario with this stylesheet and apply it to your HTML documents. The type of the DITA result is configurable with a parameter stylesheet: concept, reference, task, topic. The DITA result may contain elements <required-cleanup> which you should edit manually. You can find more details at: http://dita-ot.sourceforge.net/doc/DITA-h2d.html Regards, Sorin Seraphim Larsen wrote:
Some XML authoring tools can copy-and-paste from Word / Excel / HTML and intelligently re-tag into DITA.
Oxygen doesn't seem to be able to do that yet. Can you add it to your list of feature requests?
Thanks very much! Seraphim