PageScrape is a command line utility, which can be used to screen scrape specified data from a given Web page. As an example consider retrieving the current stock price for your company from your favorite stock quote Web page. In order to let PageScrape to know how to scrape the required data the user provides the following parameters - target URL (along with optional HTTP GET request parameters) and a regular expression. A regular expression is just a powerful and fairly standard way to express a set of textual search criteria. The provided regular expression is used by PageScrape to search the resulting HTML stream for the required data. PageScrape connects to the Web server and submits a GET request, it then waits for the resulting Web page (HTML text stream), and as it arrives, PageScrape searches through it using the provided regular expression. If the match occurs, the matched data is brought to output and the page download stops, as the Web screenscrape is complete.
|File Size||157.41 kB|
|Operating System||Windows Windows XP Windows 2000|
|System Requirements||Windows 2000/XP|