The Chilkat HTML-to-XML component is designed for the purpose of transforming HTML into well-formed XML for parsing. If effect, it is designed to be an HTML parser / scraper. Once HTML is converted to XHTML (i.e. well-formed XML), the plethora of existing XML parsing components and libraries can be leveraged for HTML parsing and scraping. Also includes HTML to plain-text conversion. The internal conversion process is much more sophisticated than can be accomplished with the simple regular-expression freeware codes found in the Internet. It is more than simply removing HTML tags from an HTML document.
|License||Free to try|
|File Size||790.5 kB|
|Operating System||Windows XP Windows Windows 2000|