PDF Text Not Selectable: 3 Causes and Step-by-Step Fixes
<p>When you click on text in a PDF and nothing highlights, the document is almost certainly one of three things: a scanned image with no text layer, a document with copy permissions disabled, or a file with a corrupted or misaligned text layer. Each of these problems looks identical from the outside — you click, nothing selects — but each requires a completely different fix. Applying the wrong fix wastes time and sometimes makes the problem harder to resolve.</p><p>This guide identifies which problem you are actually dealing with and gives you the specific steps to fix it. For scanned PDFs — the most common cause of unselectable text, accounting for roughly 70% of cases — the solution is OCR (optical character recognition), which reads the image content and generates a searchable text layer without altering the document's appearance. For locked PDFs with copy restrictions, the solution depends on whether you have authorization to remove those restrictions. For font and text layer issues, the fix involves re-exporting or reprocessing the original source document. Each scenario is covered in full below, along with a diagnostic step to help you identify which problem you have before you spend time on the wrong solution.</p>
How to Diagnose Why Your PDF Text Is Not Selectable
<p>Before you apply any fix, spend 90 seconds diagnosing the cause. The three root causes are visually similar but structurally different, and the correct diagnosis determines which fix you need. Applying an OCR solution to a locked PDF, for example, will produce a second image layer on top of the existing content without solving the copy restriction problem.</p><p>The fastest diagnostic is to open the PDF and check three things in sequence. First, <strong>zoom in to 400% and look at the text closely</strong>. If the letters appear slightly pixelated, show scan artifacts (specks, uneven edges, inconsistent baseline), or look like a photograph of printed text, the PDF is almost certainly image-based — a scanned document with no text layer. This covers approximately 70% of unselectable-text cases.</p><p>Second, if the text looks clean and sharp at high zoom (smooth letter edges, consistent baseline, no scan artifacts), the PDF was created from a digital source. In this case, right-click on the PDF (or check the document properties) and look for a security indicator. Most PDF readers show a padlock icon, a 'Secured' label in the title bar, or a 'Document Properties → Security' tab listing the restrictions applied. If you see 'Content Copying: Not Allowed', the document is digitally locked — the text is there, but copying it is restricted. This covers approximately 20% of unselectable-text cases.</p><p>Third, if the text looks sharp and the security settings show no copy restriction, but text still does not select reliably, you are dealing with a font or text layer issue: the PDF was exported with embedded fonts that are not properly mapped to Unicode characters, with a text layer that does not align with the visible text, or with a rendering problem in the specific PDF viewer you are using. This is the least common cause — approximately 10% of cases — and requires a different diagnostic approach detailed in the section below on font and text layer issues.</p><p>Understanding which problem you have before you start fixing will save significant time. Each of the three root causes is entirely fixable using free tools, but the workflows are different, and applying the wrong one produces no improvement. The sections below walk through each fix in detail, starting with the most common cause: scanned PDFs without a text layer.</p>
Fix 1: Run OCR to Add a Text Layer to Scanned PDFs
<p>A scanned PDF is not a text document — it is a collection of images, one per page, stored in a PDF container. When you look at the text in a scanned PDF, you are looking at a photograph of text. The PDF viewer has no idea that the pixels represent letters; it just renders the images. There is no underlying text to select because, technically, there is none — only the visual representation of text as image data.</p><p>OCR (optical character recognition) solves this by analyzing the images and generating a text layer that maps recognized characters to their positions on each page. After OCR is applied, the PDF looks identical to the original scan, but the PDF viewer now has a text layer to work with: you can select text, copy it, search for terms within the document, and use screen readers to access the content. High-quality OCR tools achieve 98–99% accuracy on clean, high-resolution scans and 90–95% accuracy on lower-quality or handwritten documents.</p><p>For OCR to work well, the source scan needs to meet minimum quality thresholds. A scan at 300 DPI (dots per inch) or higher produces reliable OCR results. Scans below 150 DPI produce poor character recognition — the letters are too low-resolution for the algorithm to reliably distinguish similar shapes (like 'l', 'I', and '1', or 'rn' and 'm'). If your PDF is a very low-resolution scan, improving the source scan quality before running OCR will produce dramatically better results than running OCR on a poor-quality image.</p><p>Page orientation also affects OCR accuracy significantly. Text that is rotated 90 or 180 degrees produces much lower accuracy than properly oriented text. Most OCR tools include automatic orientation detection, but you should visually verify that pages are correctly oriented before running OCR on a large document. Our guide on <a href='/en/blog/ocr-pdf-offline-without-cloud'>how to run OCR on PDFs offline without cloud upload</a> covers the specific tools and settings that produce the best results for different document types — including forms, legal documents, and multi-column layouts where standard OCR can struggle. Use <a href='/en/ocr'>LazyPDF's OCR tool</a> to add a searchable text layer to scanned PDFs directly in the browser, without uploading to a third-party cloud service.</p>
- 1Verify the PDF is actually a scanned imageOpen the PDF and zoom in to 300–400%. If the text shows pixelation, scan artifacts, or uneven edges — if it looks like a photograph of printed text rather than clean digital text — it is a scanned PDF. Confirm this by pressing Ctrl+A (or Cmd+A) to select all: if nothing selects, or if the entire page selects as a single image object rather than individual text characters, you have a scanned document without a text layer. This is the starting condition for the OCR fix.
- 2Check the scan resolution before running OCRGo to the PDF's document properties (File → Properties in most PDF readers) and look for the image resolution of the embedded scans. If the resolution is below 200 DPI, OCR accuracy will be poor. Re-scan the original document at 300 DPI or higher if possible. If you only have the low-resolution PDF and no access to the original document, run OCR anyway — 90% accuracy on a poor scan is still better than 0% selectable text — but be prepared to manually verify any extracted text before using it in critical workflows.
- 3Run OCR using LazyPDF's OCR toolUpload the scanned PDF to LazyPDF's OCR tool. The tool processes each page, recognizes the text using optical character recognition, and embeds a text layer in the PDF without altering its visual appearance. Download the resulting file. Open it in your PDF reader and test text selection: click and drag across a line of text. If text highlights correctly, OCR was successful. Search for a term you know is in the document (Ctrl+F or Cmd+F) to confirm the text is fully searchable.
- 4Verify OCR accuracy on a sample of pagesAfter OCR, copy a paragraph of text and paste it into a plain text editor. Scan for obvious errors: numbers mistaken for letters (0 vs O, 1 vs l), punctuation misread as characters, or words that look obviously wrong. OCR accuracy above 98% on clean scans is normal, but errors cluster in low-contrast areas, handwritten annotations, or tables. Correct critical errors manually. For legal or financial documents where exact text matters, budget time for a full accuracy review before relying on OCR output.
Fix 2: Remove Copy Restrictions From a Locked PDF
<p>A PDF with copy restrictions is a digital document where the text exists and is perfectly legible, but the document's permissions settings have specifically disabled the ability to select and copy content. This protection is applied using the PDF standard's permissions model, which allows document creators to allow reading while restricting printing, copying, modifying, and commenting independently. If you click on text and the cursor changes to a 'no' symbol or nothing highlights despite the text looking sharp and clean, permissions restriction is the likely cause.</p><p>Identifying this is straightforward: in Adobe Acrobat Reader, go to File → Properties → Security. The Security tab lists the permissions applied to the document. If 'Content Copying' shows 'Not Allowed', copy restriction is confirmed. Other PDF readers display this information differently — look for a padlock icon, a 'Restricted' or 'Secured' label, or a document information panel with a permissions section. The presence of a permissions password (also called the owner password in PDF terminology) is what enforces these restrictions.</p><p>Before proceeding with any restriction-removal approach, consider the legal and ethical context. Copy restrictions are applied intentionally by the document's creator. If you created the document yourself and simply forgot the permissions password, removing the restriction is unambiguously appropriate. If the document was shared with you by another party, removing copy restrictions may violate the terms under which you received the document, digital rights management agreements, or copyright protections. Use judgment: extracting text for accessibility purposes, for your own research notes, or from a document you own is generally reasonable; bypassing restrictions to reproduce or redistribute commercial content is not.</p><p>For documents you have authorization to unlock, <a href='/en/unlock'>LazyPDF's unlock tool</a> removes the permissions password and restores full access to the document's content. Once the restrictions are removed, text selection, copying, printing, and modification work normally. For a complete explanation of PDF password types, the difference between open passwords and permissions passwords, and how to handle each, see our guide on <a href='/en/blog/remove-pdf-password-free-without-adobe'>removing PDF passwords free without Adobe</a>. Note that LazyPDF's unlock tool removes permissions restrictions — it cannot unlock documents protected with an open password (the password required to open the document at all) without the correct passphrase, as this would undermine legitimate document security.</p>
- 1Confirm that copy restrictions are the causeOpen the PDF and go to File → Properties → Security (in Adobe Acrobat Reader) or the equivalent document information panel in your PDF viewer. Look for 'Content Copying: Not Allowed' or a similar restriction indicator. If you see this, permissions restriction is confirmed as the cause. If the security tab shows 'No Security' or 'Content Copying: Allowed' but text still does not select, the problem is a text layer issue — proceed to Fix 3 instead.
- 2Verify you have authorization to remove the restrictionsConfirm that you have a legitimate reason to remove the copy restrictions before proceeding. Documents you created yourself, documents shared with you for unrestricted use, or documents whose restrictions are preventing legitimate accessibility needs are appropriate candidates. If the document is a copyrighted publication, commercially licensed content, or a document shared with explicit restrictions that you agreed to, removing the restrictions may violate those terms. When in doubt, contact the document's creator and ask for an unrestricted version.
- 3Remove the permissions restriction using LazyPDF's unlock toolUpload the restricted PDF to LazyPDF's unlock tool. The tool removes the permissions password and outputs an unrestricted PDF with identical content and appearance. Download the unlocked version and test text selection: click and drag across a line of text in your PDF reader. If text highlights correctly, the restriction has been successfully removed. The file size will be identical or nearly identical to the original — unlocking does not reprocess the content, only the permissions metadata.
Fix 3: Repair PDFs With Corrupted or Misaligned Text Layers
<p>Font embedding issues and corrupted text layers are the least common cause of unselectable text, but they are the trickiest to resolve because the problem is less obvious. A PDF with a text layer issue typically looks perfectly normal — the text is sharp, digital-looking, and appears to have no restrictions — but selecting text produces garbled characters, selects the wrong region, or highlights text that does not correspond to what is visible on screen. This happens because the PDF's text layer (which the viewer uses for selection) does not correctly map to the visible glyph rendering.</p><p>The most common cause is a font encoding problem. PDF files can use several different font encodings to map character codes to glyphs. When a PDF is created with a non-standard or improperly configured encoding — particularly common with some design tools, legacy document converters, and PDFs created from certain Asian-language document systems — the mapping between the visible characters and the underlying text data breaks down. The viewer renders the correct visual glyphs but cannot extract the correct characters when the user selects text.</p><p>A related problem occurs with PDFs that have been compressed, optimized, or modified by tools that do not fully preserve the text layer. Aggressively compressing a PDF to reduce file size can corrupt the font subsetting information, resulting in a document where text appears correctly but selection and copying extract incorrect characters or fail entirely. This is one reason our guide on <a href='/en/blog/compress-pdf-without-losing-quality'>compressing PDFs without losing quality</a> emphasizes using compression tools that preserve the document structure rather than aggressively re-encoding content.</p><p>For PDFs with text layer issues, the most reliable fix is to re-export the document from its original source. If you have access to the Word document, spreadsheet, or design file that the PDF was created from, re-export it as a PDF using a different PDF generator. The built-in PDF export in Microsoft Office, Google Docs, and macOS print dialog all produce correctly encoded text layers for standard documents. If you do not have access to the original source file, running the PDF through a PDF repair or re-optimization tool can sometimes correct encoding issues by regenerating the internal document structure. For documents where the original source is completely unavailable, OCR is the fallback: running OCR on the PDF generates a new, correctly encoded text layer from the visible content, effectively replacing the broken text layer with a fresh one.</p>
How to Test Whether PDF Text Extraction Will Work
<p>Before sharing a PDF with colleagues, clients, or in a public submission, it is worth verifying that text extraction works correctly in the final file. A PDF that looks perfect in your PDF viewer but contains unselectable text will create accessibility problems for recipients using screen readers, cause failures in document processing systems that extract text automatically, and prevent recipients from copying content they need to reference or quote.</p><p>The simplest test is the copy-paste check: select all text in the document (Ctrl+A or Cmd+A), copy it, and paste it into a plain text editor. If the pasted text corresponds correctly to the visible document content — in approximately the right order, with roughly the right words — the text layer is functional. If the pasted result is blank, garbled, or contains only a fraction of the visible content, the text layer has a problem that needs to be addressed before sharing.</p><p>For documents that will be processed by automated systems — forms submitted to government portals, documents uploaded to legal case management systems, invoices processed by accounting software — the standard is higher than human readability. These systems extract text programmatically and expect specific characters at specific positions. Test by extracting the text using a command-line tool or a PDF text extraction API to confirm that the machine-readable content matches the visual content exactly. Documents with any discrepancy between visual content and extracted text will produce errors in automated processing systems.</p><p>Accessibility is a second important consideration. Screen readers used by visually impaired users rely entirely on the PDF's text layer. A PDF with no text layer (scanned) or a broken text layer (encoding issues) is effectively inaccessible to screen reader users. If your documents may be accessed by users with visual impairments, or if you are required to meet accessibility standards (Section 508 in the US, EN 301 549 in the EU), ensuring text selectability is a compliance requirement, not just a convenience. Our guide on <a href='/en/blog/pdf-accessibility-checklist-2026'>the PDF accessibility checklist for 2026</a> covers text layer requirements alongside other accessibility standards including headings, alt text, reading order, and color contrast.</p>
- 1Run the select-all copy-paste testOpen the PDF in your standard viewer. Press Ctrl+A (Windows) or Cmd+A (macOS) to select all content. Press Ctrl+C or Cmd+C to copy. Open a plain text editor (Notepad on Windows, TextEdit on macOS) and paste. Examine the result: is the text present and readable? Does it correspond to the visible document content? If the paste is empty, the PDF is image-based (scanned) and needs OCR. If the paste is garbled, there is a font encoding issue. If the paste is correct, the text layer is functional.
- 2Test search functionalityPress Ctrl+F or Cmd+F to open the PDF viewer's search function. Search for a specific word or phrase that you can see on page 1 of the document. If the term is found and highlighted correctly, text search is working. If the search returns no results despite the term being visibly present, the text layer is either absent (scanned) or incorrectly encoded. Note that some PDFs pass the copy-paste test but fail search due to subtle encoding differences — testing both is worth the extra 30 seconds.
Preventing Unselectable Text When Creating PDFs
<p>The best fix for unselectable PDF text is preventing it from happening in the first place. Whether you are creating PDFs from Word documents, design tools, scanned documents, or web pages, a few specific practices ensure the output has a fully functional text layer that works reliably across all PDF viewers and text extraction tools.</p><p>For PDFs created from office documents (Word, Excel, Google Docs), always use the application's native PDF export rather than a print-to-PDF driver. Native export in Microsoft Office (File → Save As → PDF) preserves fonts, text encoding, and document structure. Print-to-PDF drivers — including Microsoft Print to PDF and macOS print-to-PDF — sometimes produce PDFs with font rendering issues that affect text selection, particularly for documents with non-standard fonts or complex formatting. When exporting from Word, check that the 'ISO 19005-1 compliant (PDF/A)' option is selected for documents that need long-term archival-quality text access.</p><p>For scanned documents, establish a scanning protocol that guarantees OCR-ready input quality. Scan at 300 DPI minimum (600 DPI for documents with very small text), use a flatbed scanner for precision and avoid phone camera scans for formal documents (camera distortion degrades OCR accuracy by 8–15% compared to flatbed scans), ensure consistent lighting without shadows across the page, and apply automatic deskew and contrast adjustment in the scanner software. Running OCR immediately at the point of scanning — rather than scanning first and running OCR later — catches quality problems before the original document is out of reach.</p><p>For PDFs created from design tools (InDesign, Illustrator, Figma-exported PDFs), always embed all fonts rather than outlining them. Outlined fonts convert text to vector paths, which look identical on screen but are invisible to text extraction tools — selecting outlined text produces the same result as selecting a scanned image. When exporting from design tools, look for options labeled 'Include fonts', 'Embed fonts', or 'Do not outline text'. For documents that will be read by others, outlined text is appropriate only for stylized headlines or decorative elements where exact text extraction is not required.</p><p>Finally, always test a sample PDF after changing your creation workflow. Export one document using the new settings, run the select-all copy-paste test, and confirm that the text layer is intact before producing the full batch. A 2-minute test catches problems that would otherwise affect every document in the production run. Combine text verification with a file size check using <a href='/en/compress'>LazyPDF's compress tool</a> — an excessively large PDF is sometimes a sign of an encoding problem that is also causing text layer issues, and compressing with structure preservation often resolves both problems simultaneously.</p>
Frequently Asked Questions
Why can't I select text in my PDF even though it looks normal?
If the text looks sharp and digital but will not select, the most likely causes are: copy permissions disabled by the document creator (check File → Properties → Security for 'Content Copying: Not Allowed'), or a font encoding issue where the text layer exists but is incorrectly mapped. Use LazyPDF's unlock tool to remove copy restrictions, or re-export the document from its original source with correct font embedding settings.
How do I know if my PDF is scanned or digital?
Zoom in to 300–400% and examine the text closely. Scanned text shows pixelation, scan artifacts (specks, uneven letter edges, slightly blurry strokes), and looks like a photograph of printed text. Digital text has perfectly smooth, clean edges at any zoom level. Alternatively, press Ctrl+A to select all — if the entire page selects as a single image block rather than individual characters, it is a scanned PDF without a text layer.
Will running OCR on a scanned PDF change how it looks?
No. OCR adds an invisible text layer behind the visible content — the scanned images remain unchanged and the PDF looks identical before and after. The only difference is that text can now be selected, copied, and searched. The file size increases slightly because the text layer adds data, typically 5–15% for a standard document. The visual appearance, page layout, and image quality are completely unaffected by the OCR process.
Is it legal to remove copy restrictions from a PDF?
It depends on the document and your jurisdiction. For documents you created yourself, removing your own restrictions is unambiguously legal. For documents received from others, the answer depends on the terms under which you received it, copyright status, and local law. Removing restrictions for legitimate accessibility needs or personal research is generally reasonable. Bypassing restrictions to reproduce or distribute copyrighted commercial content is not. When in doubt, ask the document creator for an unrestricted version.
Why does copying text from my PDF produce garbled characters?
Garbled copied text is a font encoding problem. The PDF renders the correct visual glyphs but the underlying character mapping is broken, so copying extracts incorrect characters. This happens with PDFs from certain design tools that outline fonts, legacy converters with encoding bugs, or aggressively compressed PDFs that corrupt font data. The fix is to re-export from the original source document using a different PDF generator with correct font embedding.
Can I make a scanned PDF searchable without changing its appearance?
Yes. OCR (optical character recognition) adds a searchable text layer to scanned PDFs without altering their visual appearance. The scanned images remain as-is; the tool adds an invisible text layer that enables text selection, search, and copying. After OCR, the PDF is visually identical to the original scan but fully searchable. Use LazyPDF's OCR tool to add a text layer in the browser without uploading to a third-party cloud service.