Jericho HTML Parser

Jericho HTML Parser is a java library allowing analysis and manipulation of parts of an HTML document, including server-side tags, while reproducing verbatim any unrecognised or invalid HTML. ASP, JSP, PSP, PHP and Mason server tags are explicitly recognised by the parser. This means that normal HTML is still parsed properly even if there are server tags inside them, which is common for example when dynamically setting element attributes. It has built-in functionality to extract all text from HTML markup, render HTML markup with simple text formatting, format HTML source code that indents elements according to their depth in the document element hierarchy, and to compact HTML source code by removing all unnecessary white space.
Price USD 0
License Free
File Size 2.77 MB
Version 3.3
Operating System Windows 2000, Windows Vista, Windows, Windows 7, Windows XP
System Requirements None