Why Is My PDF So Large?
It's a familiar surprise: a document that's "just a few pages of text" turns out to be 40 MB. PDFs balloon for a handful of predictable reasons. Once you know which one you're dealing with, the fix is usually quick.
Cause #1: Scanned pages (the biggest offender)
This is by far the most common reason. When you scan a document, each page is captured as a photograph â a grid of pixels â not as text. A single page scanned at 600 dpi in color can be several megabytes on its own. Ten such pages and you're at 30 MB+ before you've added anything else.
The fix: compress the file. Because scanned pages are already images, lossy compression shrinks them dramatically while keeping them perfectly legible. Re-scanning at a lower resolution (150â200 dpi for on-screen reading) prevents the problem at the source.
Cause #2: High-resolution or uncompressed images
Photos dropped into a document at full camera resolution, or screenshots saved as large PNGs, add up fast. A modern phone photo can be 5â10 MB each; a few of them dominate the whole file. PNG, while great for crisp graphics, is often far larger than JPEG for photographic content.
The fix: compression re-encodes those images efficiently. If you're building the document yourself, resize and compress images before placing them.
Cause #3: Embedded fonts
To guarantee a document looks identical everywhere, PDFs can embed the fonts they use. Embedding entire font families â especially large or multiple fonts â adds weight. Usually modest, but it adds up in font-heavy designs.
The fix: font subsetting (embedding only the characters used) is handled automatically by good PDF software; re-exporting with subsetting enabled helps.
Cause #4: Redundant and leftover data
PDFs accumulate cruft, especially after repeated edits:
- The same image stored multiple times instead of referenced once.
- Old revisions retained inside the file (incremental save history).
- Unused objects, bulky metadata, and form-field data.
- Comments, annotations, and layers you no longer need.
The fix: re-saving or rebuilding the PDF discards much of this. Lossless "optimize" passes clean up duplicates and unused objects.
Cause #5: Vector complexity
Occasionally the weight isn't images at all but extremely complex vector artwork â a map, a CAD export, or a chart with hundreds of thousands of points. These are rarer and harder to shrink without simplifying the artwork itself.
How to tell which cause you have
If the file is mostly scanned or photo content, it's almost certainly cause #1 or #2 â and compression will help enormously. If it's a text document that's mysteriously huge, suspect redundant data (#4) and try re-saving or optimizing.
The quick fix that covers most cases
For the overwhelming majority of oversized PDFs â scans and image-heavy files â a single compression pass solves the problem. CrunchyPDF's Compress tool re-renders and re-encodes each page in your browser, typically cutting file size by 40â80%. And because it runs locally, you can compress sensitive documents without uploading them anywhere.
One thing to remember: this kind of compression converts pages to images, so selectable text is lost in the output. If you need the text to stay searchable, keep an uncompressed master copy and compress only the version you're sending. For the full picture, see How PDF compression actually works.