This software scans an MS Word docx file or a text file (including HTML and XML files) with text encoded via ANSI or UTF-8 and counts the frequencies of different words. The words which are found and displayed can be ordered alphabetically or by frequency. Characters which can appear in words can be specified, so the program can be told to allow or disallow words with numerals, hyphens, apostrophes, underscores or colons, to ignore words which are short or which occur infrequently, to treat upper/lower case as significant or not, and to ignore words (e.g., common words such as 'this') contained in a specified file. This software may be used with text in languages other than English, in particular, with French, German, Italian and Spanish text. Language of the text is detected automatically and the corresponding 'common words' file (with words to be ignored) is optionally loaded. Results can be written to an output file and can then be read into Excel for further processing.
||Free to try
Windows Server 2008