Extracting raw images from PDF

If you just screen-shot a PDF to get the images, you are losing quality since you’re not extracting at the native resolution. Let’s use PDFtk to extract the original images from the PDF.

Related: extracting a page(s) from PDF

List all images in a PDF

Some images are vector art and will not export with this program

pdfimages -list in.pdf

Dump all images from PDF

pdfimages -all in.pdf out

That will dump all the images in mydoc.pdf to the same directory, filenames starting with out-. There might be a lot of images.

PDFtk extract images from PDF specific pages

say you want page 3 only:

pdfimages -all -f 3 -l 3 in.pdf out

-f first page to extract

-l last page to extract

Install PDFtk

PDFtk installs on any operating system easily, here’s how.

Linux PDFtk install

apt install poppler-utils

Windows PDFtk install

https://www.pdflabs.com/tools/pdftk-the-pdf-toolkit/

Mac PDFtk install

Follow the discussion at

https://stackoverflow.com/questions/20804441/how-to-install-pdftk-on-mac-os-x

Leave a Comment