|
|
Author: * Sankira Qin -
3 Posts
on this thread out of
1,340 Posts
sitewide.
Date: Jun 9, 2005 - 10:29
Ladee, there is a DTD for HTML in XML syntax. What you're seeing in a Word document converted to HTML is an implementation of that. All that information is there because Microsoft uses XML to retain what they call "round-trip" functionality in the page. If their customer is using Word as their html editor, it needs that XML information to be able to convert back to Word format for subsequent editing. (Remember I mentioned portability and incompatible systems? This is an example of that.) So, that's what all that XML stuff is for.
If you create that same page and never again intend to edit the html in Word, try saving as "Web Page Filtered" in Word 2002 or higher. If you're on an earlier version of Word, you can download a web page filter from Microsoft. Filtering the page while converting it strips out all the XML information, so all you see in your code is standard HTML and an embedded stylesheet...and a much shorter one than you see in the XML version.
|
|