File Juicer
File Juicer for macOS
Overview of Formats

Search & Extract

Images

jpg jpeg 2000 gif png pdf wmf emf tiff eps pict bmp

Video

mov mpeg avi wmv

Sound

mp3 wav System 7 au aiff

Text

ascii rtf html

From:

avi cab cache chm dmg doc emlx exe ithmb m4p mht mp3 pdf pps ppt raw swf wps xls zip and other formats

Pages files

Is a document which may contain images and video which File Jucier can extract. It has been created by Pages a part of Apple's iWork office suite.

Text In Pages Files

Pages is mostly for text, however to allow to all the layout features of Pages, the file format is mostly about layout

The text is mixed in with the layout information in the gz file extracted by File Juicer

Doubleclick the gz file to get the xml file inside.

The XML file has everything which is much more than most will ever need. The text is in there and to find it use TextMate and its strip html function and perhaps its reformat function.