Emsa HTML Tag Remover is a software utility that allows removing html tags from a html file with some extra degree of control on how the html is removed and whitespace removal as well. It provides several options to remove different types of data from the html page. It allows whitespace removal, making the resulting text output condensed as necessary. Finally, it works both in interactive mode, as well as in command line mode, which can be useful for users wanting to use this functionality from other programs or batch files.
When the program is ran for the first time, all options are checked by default. This is the most advanced removal and the result will actually be only one line of text with no carriage returns. We recommend the user to take the time and play with it in order to see the effect of each particular setting, until the output becomes as the user wants it.
Users perform the html removal by selecting an input html file, selecting or typing an output file (usually as text file) and then clicking on 'remove html' button. Please also view the command line mode below.
Some advanced functions:
Remove all tags strips all remaining html data comprised between < and > characters.
Generate foreign & special characters. This is a function that allows rendering foreign and special html
sequences into their corresponding resulting character. for example the 'Ã?Â£' sequence will be rendered as the corresponding 'Ã?Â£' pound character; the '>' sequence will be rendered as the corresponding '>' character; and so on.
Remove spaces will trim all unnecesary spaces from the output; therefore between two words there will always be one space max.
Remove blank lines will cut out all unneded carriage returns and line feeds, in such a way that all lines having some text will be aligned one after another with no blank lines in between.
|File Size||209.96 kB|
Windows NT 4
|System Requirements||<li>Windows 95/98/ME/NT4/2000/XP/2003</li>|