![]() |
![]() |
![]() |
Equivalent of mshtml for Linux? Is there any equivalent of mshtml for Linux? Preferably, I should be able to use it from Python?
Banana Man
Gnome's libxml might help you.
ice
You can do this with a combination of TidyLib (to clean up non-well formed HTML and convert to XHTML) and libxml2. TidyLib itself has uses a DOM-like structure as well, so you may not need libxml2, depending on what you're trying to do.
joev
There's also Tag Soup ( http://mercury.ccil.org/~cowan/XML/tagsoup/ ) and NekoHTML ( http://www.apache.org/~andyc/neko/doc/html/ )
matt
There are many native Python libraries you can use, starting with thebuilt-in SGMLParser and HTMLParser modules. You might also take a look at Beautiful Soup:
Abe Fettig
If I recall correctly, KHTML (Konqueror and Safari's rendering engine) has Python bindings.
Simon Perreault
Can you use MSHTML to parse a document without hosting IE?
|