The Fusion PDF Image Extractor has two purposes:
1. To extract all of the individual images from a PDF (to gather the images from brochures etc) (limited to JPG images so far)
2. To extract all of the pages of a PDF as JPEG image representations of the original page
We have released a zip file containing all of the program files and the source code to do with as you please. We have also released a windows installation image for anyone not comfortable handling zip files. The product is Open Source under the GPL license.
We created the PDF Image Extractor after a painful experience trying to extract scanned images from a PDF financial report so that they could be put through an OCR process and then translated using Google docs. We therefore decided to bundle up the work we did and the small product we wrote as open source and release to all. It is written using C# and if you want to use the code it will work with the Visual Studio C# Express product which is free from Microsoft. Just add the included iTextSharp library to your project.
|File Size||10.59 MB|
|System Requirements||Microsoft .NET Framework 2.0|