Extract Images

Extract all images from PDF

PDFs often serve as archives for visual content — product catalogs, illustrated reports, marketing materials, technical manuals with diagrams, and photo books all embed images inside the PDF container. When you need to reuse those images in other projects, re-edit them in image editing software, upload them to a website, or submit them to a print vendor, you need to extract them from the PDF at their original full resolution. LazyPDF's Extract Images tool scans every page of your document and recovers all embedded image objects at their native resolution — not the scaled-down version you see on screen, but the actual image data as it was embedded. This is particularly important for high-resolution photography or print-quality graphics where the original resolution may be 300 DPI or higher. The tool also handles PDF transparency correctly by reading SMask data, which means images with transparent backgrounds (such as logos, product cutouts, or diagrams) are extracted with their full alpha channel intact rather than being flattened onto a white background. All processing happens in your browser using pdfjs-dist — the same engine used by Firefox's built-in PDF viewer — so your files never touch a server. Extracted images are delivered as PNG files for maximum quality and compatibility. You can download each image individually or batch-download all of them at once, making it easy to recover an entire collection of product photos from a catalog or all the diagrams from a technical manual in one operation.

How It Works

Extract Images scans every page of your PDF and pulls out all embedded image objects in their original resolution using pdfjs-dist — the same PDF rendering engine that powers Firefox's built-in viewer. The tool reads each image's raw data stream directly from the PDF's internal structure rather than rendering the page to screen, which means you get the actual embedded image at its full embedded DPI, not a screen-resolution screenshot. The tool also reads SMask data — the PDF standard for transparency/alpha channel information — and combines it with the main image data to reconstruct images with their original transparent backgrounds intact. All images are saved as PNG for lossless quality. Processing happens entirely in your browser with no server upload.

Key Features

Full Original Resolution

Images are extracted at their original embedded resolution — typically 300 DPI or higher for print-quality PDFs — not at the lower screen-display quality.

Transparency Preservation

Correctly reads SMask transparency data to preserve alpha channels, so logos and diagrams with transparent backgrounds are extracted without a white background.

Batch Download

Download all extracted images in one operation, or select individual images to save. All images are delivered in high-quality PNG format.

Complete Privacy

All extraction runs in your browser using pdfjs-dist. Your PDF and its embedded images are never sent to any server at any point.

All Pages Processed

The tool scans every page of your document automatically, extracting every embedded raster image object regardless of its position or the page it appears on.

No Image Count Limit

Extract 5 images or 500 images from the same PDF — there is no restriction on how many images can be recovered from a single document.

Works on Any PDF

Extract images from product catalogs, technical manuals, illustrated reports, scanned documents, or any PDF that contains embedded raster images.

Frequently Asked Questions

What format are the extracted images saved in?

Images are extracted and saved as PNG files. PNG is a lossless format that preserves full quality and supports transparency, making it suitable for photographs, diagrams, logos, and graphics with transparent backgrounds extracted from PDFs.

Why do some images look different from how they appear in the PDF?

PDFs can apply display-level transformations like cropping, scaling, or color space adjustments when rendering. The extraction tool pulls the original embedded image data, which may be larger, differently proportioned, or in a different color space than what appears in the PDF viewer.

Can I extract images from a scanned PDF?

Yes. A scanned PDF stores the scan as one large raster image per page. The tool extracts each page's scan image as a full-page PNG. These are full-page images of the scanned content rather than individual photos or graphics from the original physical document.

How many images can I extract at once?

There is no limit on image count. The tool processes every page and extracts all embedded image objects. For documents containing hundreds of images — like large product catalogs — extraction may take a minute but completes fully without any caps.

Will the extracted images be at their original print resolution?

Yes. The tool reads the raw image stream data from the PDF as originally embedded. Images embedded at 300 DPI for print purposes are extracted at that full resolution — not the lower resolution shown on screen. This is critical for recovering print-quality photography.

Can I extract only images from specific pages?

The current tool processes all pages. To extract images from specific pages only, use the Split tool first to extract those pages into a separate PDF, then run the image extractor on that smaller file.

What if a PDF contains vector graphics rather than raster images?

Vector graphics drawn with PDF path operators (lines, curves, fills) are not stored as image objects and cannot be extracted as image files. Only raster images (embedded JPEGs, PNGs, etc.) are recovered. To capture vector content, render the page to an image using the PDF to JPG tool.

Do extracted images include EXIF or metadata from the originals?

EXIF metadata is typically stripped when images are embedded into a PDF — the PDF format does not preserve EXIF data. Extracted PNG files will not contain the original camera or creation metadata even if it was present before the PDF was created.

Can I use this tool to recover images from a corrupted PDF?

If the PDF file can be loaded and parsed at all by the browser, the tool will attempt to extract all readable image objects. Severely corrupted PDFs may not load at all, but PDFs with localized corruption may still yield some recoverable images.

What is SMask data and why does it matter?

SMask (Soft Mask) is the PDF format's mechanism for storing transparency information. Without reading SMask data, extracted images with transparent areas — logos, cutout product photos, diagrams — would have their transparent pixels replaced with solid white or black. Reading SMask data correctly produces images with their original transparent backgrounds.

Try more free PDF tools

No signup needed, no watermarks, free to use.