File Juicer: Byte by Byte search, find and extractFile Juicer searches the files byte by byte for the formats it knows.
JPEG, PNG, GIF, PDF, BMP, WMF, EMF, PICT, TIFF, Flash, ZIP, HTML, WAV, AVI, MOV, MPEG, WMV, MP3, MP4, AU and AIFF.
The only requirement for this to work, is that the file to recover is stored in exactly one of the above formats.
To see how this works, you can try to download and extract images from a Windows screensaver. The screensaver in this example contains 20 JPEG images from Tetons National Park inside an .EXE file.
File Juicer does not decode and reencode data. For example: dropping an MP3 file on File Juicer will not make it into a WAV or AAC file. It will however extract album cover art, because MP3 files can contain images.
For conversion of from one video file format to another, I recommend looking at ffmpegX and QuickTime Pro
A new piece of software for WMV conversion I have not tried is WMV studio.
Text is slightly different. Here the requirement is "fuzzy": there should not be too much binary data close by. More about text below.
File Juicer can search for several file formats, and it takes more time to search for all formats. Decoding email attachments is roughly twice the work of searching for the other file formats.
Email attachments are a special case, as it requires File Juicer to do base64 decoding, which is used for sending any kind of file as email. File Juicer just decodes the attachment and tries to figure out what kind of file it has found, in case the file name or file type is not immediately available.
File Juicer has one requirement of the files it searches: that the images should be stored in their original format inside the files. In Flash, QuickTime and Windows Media Files, the original files are sometimes converted into an internal format, which File Juicer does not understand.
File Juicer can squeeze a lot of files from browser caches or if you feed it fit folders containing a lot of images. The results are normally saved in a folder on the Desktop, but you can configure File Juicer to save the results next to the juiced files (as Stuffit Expander), or choose a place somewhere else.
Crating thumbnails can take some time, but for an overview of the images, this is convenient. You can enable this check box, and skip the process manually if it takes too long.
Don't save duplicates, compare the contents of the images it finds, and if it find the same image twice, it will not save the copies, even if they have different names of dates.
The checkbox "Organize files in folders for each format" is helpful when extracting many files of different formats. If you are generating icons to test which files may be corrupt, File Juicer will place files where it could not generate icons inside yet another folder.
If (when!) I learn about bugs, I will fix them and update the web site.
File Formats List. I have added information about more than 100 common and not so common file formats, what they may contain, and hints to other applications which are relevant.
Here is a short list of the formats File Juicer handles:
For more info about which formats File Juicer have extracted files from, see the File Juicer Formats page.
disk images. This is useful for un-erasing files deleted by accident.
The situation where I have needed this mostly, is to recover photos on the flash card from my digital camera.
You can try this on your flash card now, when you have read the images in your normal preferred way. That way you get an idea of what to expect the day you need it. File juicer uses Apples's "Disk Utility" tool to make a disk image of your flash card.
To make a disk image from a flash card, choose "Flash Card..." from the File menu, and File Juicer will read the flash card, and extract the files from the result, and keep the disk image.
If you use Mac OS X 10.2.8, you can use Apple's Disk Copy to make a disk image, which you then drag into File Juicer. Screen shot.
When you recover images from flash cards, you may get many tiny JPG files which are remainders of images. To sort out which are good and which are not, you can set File Juicer's preferences to generate thumbnail icons, or just sort by size.
If you have erased single images, shot new ones, and erased again on the flash card, the results may be worse as the images gets broken up and the parts stored different places on the card. I usually empty my flash card completely when I connect to my Mac, so I have found one year old images on my cards.
You can make disk images of any type of disk, and try on those. It may be very slow if you try to juice disk images larger than the amount of RAM in your Mac.
More about flash card recovery
On this screen shot, I have filtered an mp3 file, and the tiny amounts of text inside gets extracted. File Juicer names the file with a percentage in the name, this percentage tells you how large a part of the file is text. For the mp3 file it was about 1%. Even with files made by word processors this may be well below 50%.
If you filter files from Microsoft Office, or files of Windows origin, you may benefit from using the Preferences in TextEdit which lets you interpret those files better.
When File Juicer extract text from PDF and Word documents, its encoding is UTF-8 which TextEdit handles nicely even if it is not the default.
Two options from the preferences are:
Other options for handling lots of images are:
Screen shot of how Preview is showing the files
selected in Finder in the background
(click image to enlarge)
Google search: illusions.pps
If you followed the above Google link, and tried to download
the illusions.pps file, you may get gibberish like this:
Solution: control-click on the link and chose to save the file.
Choose "Don't append" if Safari asks about adding ".txt" to the file name.
File Juicer does not decrypt PDF files which are encrypted. This will result in white images.
Before Mac OS X 10.4, extracting text from PDF files is not practical, although it can work. On Mac OS X 10.4, PDF Text extraction use the same converter as copy and paste from Preview.
File Juicer recognize .ZIP, ,bz2, .rar and "deflate" compressed data and will extract it so you can decompress it with Finder, but not other compression algorithms. It does not decrypt encrypted data.
If File Juicer should crash and you wish to tell me about it, I would appreciate if you send me the file:
A few tips about: reporting bugs.
Log FilesFile Juicer saves two log files in your Library > Logs folder. "FileJuicerLog.txt" and "FileJuicerResultsLog.txt". They are created for every "juicing", and contain the names of the files juiced and the files found. The log files are emptied every time you juice a new set of files, so you don't need to delete them as they don't grow.
Special Names on found filesSome files get special names. If they contain "", it means that compressed data was found 1346 bytes from the beginning of the file.
If the end is ".inflated" or ".bz2 extracted", it refers to the name of decompression algorithm used when extracting the file, and that the extracted data was not one of the formats File Juicer can identify. See .inflated for more info.
Text files can have something like "(6%)" in the file name. It means that only 6% of the juiced file was text.
Display of Images while JuicingYou can turn this off by the little checkbox. This makes File Juicer a bit faster, but what is more important, is that it becomes more stable under very large "juicings". An example is if you have a disk image of a PC hard disk, a lot of remainders of deleted files can be found, and Mac OS 10.3.9 has rare issues when trying to display damaged PDF or TIFF files.
defaults write com.echoone.FileJuicer skipRawConversion -boolean yesIf you wish to extract only the largest JPEG inside use this command
defaults write com.echoone.FileJuicer returnOnlyLargestFile -boolean yesThis will also apply if you extract images from PDF files only the largest image will get extracted (and you may like them all).
defaults write com.echoone.FileJuicer skipWMVConversion -boolean yesThis will turn off conversion of WMV to MOV conversion which you may now wish if you use File Juicer as a forensics tool where the search anf extract function is all you need.
AppleScriptFile Juicer understands a bit of AppleScript which can be useful if you wish to do a folder action setup. Feedback and bug reports are welcome.