Format GuidesMarch 24, 2026
Meidy Baffou·LazyPDF

Why Are Scanned PDFs So Large? (And How to Fix It)

You scan a two-page letter and end up with a 15 MB PDF. You email it and get a 'file too large' bounce. You try to upload it and the progress bar crawls. Sound familiar? Scanned PDFs are notorious for being enormous compared to their content. A document with three paragraphs of text, scanned to PDF, can be ten to fifty times larger than the same document saved digitally from Word. This is not a bug or a scanner malfunction — it's the natural result of how scanning works. Understanding why scanned PDFs are large helps you make smarter decisions: when to scan at lower resolution, when to use grayscale instead of color, and how to compress after the fact without losing legibility. This guide covers the technical reasons in plain English, compares different scan types, and gives you actionable steps to shrink these files to a manageable size.

The Core Reason: Images Instead of Text

A digitally created PDF — one made from Word, Google Docs, or any word processor — stores text as actual text. The letter 'A' is stored as a few bytes representing the character, its font, and its position. This is extraordinarily compact. A full page of text digitally might be 50–100 KB. A scanned PDF stores each page as a raster image. The scanner or camera photographs the physical paper and saves every pixel of that image. The letter 'A' becomes thousands of pixels of varying shades. At 300 DPI on a standard letter-size page (8.5" × 11"), that's 2,550 × 3,300 pixels = roughly 8.4 million pixels per page. In color (24-bit), that's about 25 MB of raw data per page before any compression. Even with JPEG compression applied, you're looking at 1–5 MB per page depending on the compression level and image complexity. That's why a 10-page scanned document can be 15–50 MB.

  1. 1Open a digitally created PDF (e.g., exported from Word) and note its file size.
  2. 2Scan the same document with your phone scanner and note the resulting PDF size.
  3. 3Compare — the scanned version is typically 10–50x larger.
  4. 4Upload the scanned PDF to LazyPDF's compress tool.
  5. 5Download the compressed version — typically 60–85% smaller while remaining legible.

DPI: The Biggest Factor in Scan File Size

DPI (dots per inch) is the resolution of your scan. It directly determines how many pixels are captured per inch of the original document. Higher DPI means more detail — and dramatically more data. Here's the relationship: doubling the DPI quadruples the number of pixels (because you're doubling in both dimensions). A page scanned at 300 DPI contains 4× more pixels than the same page at 150 DPI, so it's roughly 4× larger before compression. **Common DPI settings and their practical uses:** - 72–100 DPI: Web display only — text may not be crisp enough for printing - 150 DPI: Acceptable for most office documents — text readable, smaller file - 200 DPI: Good balance for most scanning needs — recommended for forms and letters - 300 DPI: Professional standard — required for OCR to work accurately - 600 DPI: Fine detail scanning — for photographs, engineering drawings, or archival copies Most scanner apps default to 200–300 DPI, which is appropriate for quality but produces larger files. If you don't need print-quality output, scanning at 150 DPI can cut file size by 75% compared to 300 DPI.

Color vs. Grayscale vs. Black-and-White

The color mode of your scan is the second major factor in file size. **Color (RGB)**: Each pixel stores three values (red, green, blue), typically 8 bits each = 24 bits per pixel. This is the largest format and the default for most modern scanner apps because phone cameras are inherently color sensors. **Grayscale**: Each pixel stores a single brightness value (0–255). This is 8 bits per pixel — one-third of the data compared to color. For text documents and black-and-white forms, grayscale looks identical to color but produces files about 2–3× smaller. **Black-and-white (1-bit)**: Each pixel is either black or white. This is 1 bit per pixel, making it 24× smaller than color. For text-only documents, 1-bit scanning is perfectly legible and dramatically smaller. Many fax machines and document management systems use 1-bit TIFF or PDF because of this efficiency. For a typical office letter, switching from color to grayscale halves the file size, and switching to black-and-white can reduce it by 90% with no perceptible quality loss in the text.

How to Shrink Oversized Scanned PDFs

Once you have an oversized scanned PDF, you have several options to reduce its size. The most practical for most users is post-scan compression using an online tool. **Option 1 — Web-based compression**: Upload to LazyPDF's compress tool. The tool resamples images to a lower effective resolution (typically 150 DPI for text, retaining original for photos) and re-encodes them with JPEG or mixed compression. Results: 60–85% reduction, text remains fully legible. **Option 2 — Re-scan at lower settings**: If you haven't scanned yet, choose grayscale or black-and-white mode and 150–200 DPI. This prevents the problem rather than fixing it afterward. **Option 3 — OCR + compression**: Run OCR to add a searchable text layer, then compress. The text layer is tiny; the image layer still gets compressed. Benefit: the PDF becomes searchable (Ctrl+F works) and is also smaller. **Option 4 — Convert to JPEG first, then back to PDF**: This approach downgrades quality significantly and isn't recommended for official documents. Stick with proper PDF compression tools. For ongoing workflows — like scanning receipts, contracts, or forms regularly — consider setting your scanner app to medium quality and enabling auto-compression to avoid accumulating large files.

Comparison: File Size by Scan Type

To illustrate how dramatically these factors affect file size, here's a comparison for a typical 1-page letter-size document: **Digital PDF (Word export)**: ~80 KB **Scanned at 150 DPI, grayscale**: ~300 KB **Scanned at 200 DPI, grayscale**: ~600 KB **Scanned at 300 DPI, color**: ~2–4 MB **Scanned at 600 DPI, color**: ~8–16 MB **After compression (300 DPI color source)**: ~400–800 KB These are approximate figures that vary with document content. Text-heavy pages compress more than photo-heavy pages because text has high redundancy (large white areas, repeating patterns). For most business use cases — emails, HR forms, contracts, invoices — the sweet spot is 200 DPI grayscale (or post-compress a 300 DPI scan). This gives professional quality at email-friendly sizes. Reserve 600 DPI for archival storage where future enlargement or high-fidelity printing may be needed.

Frequently Asked Questions

Is it safe to compress a scanned PDF that contains signatures?

Yes, with appropriate compression settings. A signature in a scanned document is stored as image pixels. Moderate compression (targeting 150 KB per page) preserves signatures clearly enough for legal purposes. However, for court submissions or documents that will be scrutinized closely, use lighter compression (70–80% size reduction rather than 90%) to retain fine pen strokes. Always keep a full-resolution backup of the original scan.

Does DPI affect OCR accuracy in scanned PDFs?

Yes, significantly. OCR (optical character recognition) algorithms perform best on scans of 200–300 DPI minimum. Below 150 DPI, letter recognition accuracy drops sharply, especially for small fonts, handwriting, or documents with noise. If you plan to run OCR on your scanned PDFs, always scan at 300 DPI first, then compress the resulting file. This preserves OCR accuracy while reducing storage size.

Why does my scanner app make PDFs larger than my desktop scanner?

Smartphone scanner apps typically use the phone's camera, which captures images at native camera resolution (often 8–48 megapixels) before downsampling. Some apps also apply HDR processing or multiple exposures, increasing processing time and file size. Desktop scanners with dedicated glass beds and controlled lighting tend to produce cleaner, more compressible images. If phone scan files are too large, check the app's quality settings and set them to 'Medium' or '200 DPI'.

What is the maximum PDF file size most email providers accept?

Gmail supports attachments up to 25 MB. Outlook.com and Office 365 allow 20 MB. Yahoo Mail allows 25 MB. Many corporate email servers restrict attachments to 10 MB or less. For safe universal delivery, keep scanned PDF attachments under 5 MB. For multi-page documents, compressing after scanning typically achieves this easily.

Shrink your oversized scanned PDF in seconds — free, no signup required.

Compress My PDF

Related Articles