On Jun 19, 8:14 am, Saurabh Nandu
<SaurabhNa... DeleteThis @discussions.microsoft.com> wrote:
> Hi,
>
> I am building a Content Management System that allows users to upload Word
> 2007 documents to be published. On the backend we convert the Word 2007
> document toHTMLwith Stylesheets and store it.
>
> I am using the MS Word 2007 API and calling doing a Save ASHTMLto extractHTML.
>
> The problem that I am facing is that things like Lists are not being
> converted to OL/UL tags inHTMLgenerated. Only P tags are generated with
> styles.
>
> How I can generate correctHTMLtags?
>
> I was reading in the articles on MSDN that they have a publish to Blog
> feature where the claim to generate correctHTML(I have not tried it yet)
> but if Word 2007 can generate validHTMLwith CSS for the blogging feature,
> why does it not do the same when it's API's are used? Any help ..direction
> will be helpful.
>
> I have looked at couple of 3rd Libraries - they too do not convertHTML
> correctly.
>
> PS: I have tried searching this NG using the web interface but it did not
> provide any results, so I am asking this question again.
>
> Regards,
> Saurabh
I too am curious about the MSWord to HTML conversion but from a
different angle. You see I don't have or use a current version of
MSWord but my customers do. So I need to know what a typical MSWord-
produced html file would look like. For example, does it retain the
paper size if the document is e.g., 6 x 9?
John Culleton