Help!

Convert Word to Correct HTML

 
  

Post new topic   General Reply to Topic (not reply to a specific post)    Forums Home -> Conversions RSS
Next:  Convert Word 6 files  
Author Message
Saurabh Nandu
External


Since: Jun 19, 2007
Posts: 1



PostPosted: Tue Jun 19, 2007 5:14 am    Post subject: Convert Word to Correct HTML
Archived from groups: microsoft>public>word>conversions (more info?)

Hi,

I am building a Content Management System that allows users to upload Word
2007 documents to be published. On the backend we convert the Word 2007
document to HTML with Stylesheets and store it.

I am using the MS Word 2007 API and calling doing a Save AS HTML to extract
HTML.

The problem that I am facing is that things like Lists are not being
converted to OL/UL tags in HTML generated. Only P tags are generated with
styles.

How I can generate correct HTML tags?

I was reading in the articles on MSDN that they have a publish to Blog
feature where the claim to generate correct HTML (I have not tried it yet)
but if Word 2007 can generate valid HTML with CSS for the blogging feature,
why does it not do the same when it's API's are used? Any help ..direction
will be helpful.

I have looked at couple of 3rd Libraries - they too do not convert HTML
correctly.

PS: I have tried searching this NG using the web interface but it did not
provide any results, so I am asking this question again.

Regards,
Saurabh
Back to top
wexfordpress
External


Since: Jul 27, 2007
Posts: 1



PostPosted: Fri Jul 27, 2007 7:30 am    Post subject: Re: Convert Word to Correct HTML [Login to view extended thread Info.]
Archived from groups: per prev. post (more info?)

On Jun 19, 8:14 am, Saurabh Nandu
<SaurabhNa... DeleteThis @discussions.microsoft.com> wrote:
> Hi,
>
> I am building a Content Management System that allows users to upload Word
> 2007 documents to be published. On the backend we convert the Word 2007
> document toHTMLwith Stylesheets and store it.
>
> I am using the MS Word 2007 API and calling doing a Save ASHTMLto extractHTML.
>
> The problem that I am facing is that things like Lists are not being
> converted to OL/UL tags inHTMLgenerated. Only P tags are generated with
> styles.
>
> How I can generate correctHTMLtags?
>
> I was reading in the articles on MSDN that they have a publish to Blog
> feature where the claim to generate correctHTML(I have not tried it yet)
> but if Word 2007 can generate validHTMLwith CSS for the blogging feature,
> why does it not do the same when it's API's are used? Any help ..direction
> will be helpful.
>
> I have looked at couple of 3rd Libraries - they too do not convertHTML
> correctly.
>
> PS: I have tried searching this NG using the web interface but it did not
> provide any results, so I am asking this question again.
>
> Regards,
> Saurabh

I too am curious about the MSWord to HTML conversion but from a
different angle. You see I don't have or use a current version of
MSWord but my customers do. So I need to know what a typical MSWord-
produced html file would look like. For example, does it retain the
paper size if the document is e.g., 6 x 9?

John Culleton
Back to top
Display posts from previous:   
Post new topic   General Reply to Topic (not reply to a specific post)    Forums Home -> Conversions All times are: Eastern Time (US & Canada) (change)
Page 1 of 1

 
You can post new topics in this forum
You can reply to topics in this forum
You cannot edit your posts in this forum
You cannot delete your posts in this forum
You cannot vote in polls in this forum