File Juicer

File Juicer for Mac OS X

Overview of Formats

Search & Extract

Images

jpg jpeg 2000 gif png pdf wmf emf tiff eps pict bmp

Video

mov mpeg avi wmv

Sound

mp3 wav System 7 au aiff

Text

ascii rtf html

From:

avi cab cache chm dmg doc emlx exe ithmb m4p mht mp3 pdf pps ppt raw swf wps xls zip and other formats


Download File Juicer 4.8.4


Drop Files into DoubleTake

Try DoubleTake

Images and Text in PDF files


PDF files can contain any type of file, just as an email can have attachments. File Juicer can extract most of the bitmap images.
File Juicer can also extract PDF which are embedded in other files.

JPEG

Compressed images are stored in PDF as embedded JPEG files which can be extracted.

EPS and PostScript

Are converted to PDF the same way Preview does before they are "Juiced".

Losslessly compressed images

File Juicer extract them as PDF to preserve the ICC color information with the file.

Vector Graphics

Extract Vector Graphics from PDF Is an integrated part of the PDF files, and there is no boundary between text and graphics. To extract vector graphics from a PDF, you can use Tiger & Preview which let you place a crop and copy the contents to a new PDF.

One exception is if an EPS file is embedded in a Word document and then printed to PDF. File Juicer can extract the EPS from this PDF.

Images Mirrored, Inverted or Cut Into Strips

File Juicer extracts images, exactly as they are, no changes, from the files you search in. PDF files are made by many applications, and some of those cut images into stripes, inverts, rotates or mirrors them, scales them or covers some of them up. What ends in the PDF is decided by the application which creates the PDF.

You can deal with such PDF files by "rendering" them to a pixel based file format with Preview.

  • Select the image you wish to save
  • Copy it
  • Pick New from the File menu
  • Save as TIFF, JPG or PNG as you desire
Adobe has professional applications which can do this too, like Adobe Photoshop CS5 Adobe Photoshop but that is probably more than you need.

Text

File Juicer can extract the text both as plain text and as RTF. In the plain text case, you enable the "ascii" checkbox in the preferences. Note that the encoding of the extracted text is UTF-8 which preserve the "exotic" characters which can be found in PDF files. TextEdit understand UTF-8 if you chose it in the preferences.

RTF can also be a good format if you wish to convert simple PDF files to Word.

Scanned Text

If you have a scanned document, File Juicer can extract the images from it, but it does not convert the images to text. You need an Optical Character Recognition application.

ICC profiles

PDF files may result in File Juicer finding ICC profiles. They are stored separate from the images but when File Juicer saves images from PDF, it will include the ICC profiles correctly.

Encrypted PDF files

Are not searched or decoded by File Juicer. If the PDF allow printing you can print to PDF and juice that one instead. Otherwise you need a PDF password recovering utility.

Troublesome PDF files

Some PDF files may be encoded in unusual ways where images are not stored in any of the standard formats File Juicer can recognize (see File Juicer's preferences for a list). Sometimes Preview can help out as it can "normalize" the PDF into a more streamlined format if you "Print" the PDF to PDF.