Scanned books and articles
Pull selectable text out of scanned chapters for citation, search, and quote extraction.
Free PDF tool
Recognize text on scanned PDF pages in your browser. Download a plain-text transcript or a new PDF with an invisible text layer that makes the document searchable and copyable. Runs locally using Tesseract.js — no upload.
Upload a scanned PDF, choose language and output, then run OCR.
OCR runs in your browser using Tesseract.js. Language data downloads the first time you run it and is cached after.
Useful for copying into Word, sending to a translation tool, or feeding into search and analysis pipelines. One file with each page separated by a header line.
The page image stays as-is, with recognized text placed underneath as an invisible layer. PDF viewers can search, select, and copy the text while users still see the original scan.
Drop a scanned or image-based PDF onto the upload area.
Choose a language pack, plain text or searchable PDF, and an optional page range.
The first run downloads language data into your browser cache. Subsequent runs reuse it.
Pull selectable text out of scanned chapters for citation, search, and quote extraction.
Convert meeting whiteboard photos into searchable text so action items become findable.
Extract totals and reference numbers from receipts for expense reports and bookkeeping.
Make multi-page exhibits searchable so reviewers can jump to relevant passages.
Add a search layer to scanned letters, certificates, and other heritage documents.
Searchable PDFs work better with screen readers and assistive technology than pure image PDFs.
No. OCR runs entirely in your browser using Tesseract.js. The PDF and the recognized text stay on your device.
Tesseract downloads language data on first use. The browser caches it so subsequent runs are much faster.
Accuracy depends on scan quality, font, and language mix. Sharp scans of standard print typically yield 95%+ accuracy. Cursive, low-contrast, or poorly-scanned pages perform worse.
A PDF where the visible page image is paired with an invisible text layer in the same position. Users see the scan, but the document is searchable and the text can be selected and copied. The invisible layer is drawn with full transparency, which works in Chrome, Firefox, Adobe Reader, and Preview. A small number of older or minimal PDF viewers may not index it.
Match the dominant script in the document. Multi-language entries cover English plus the second script, which usually improves accuracy on mixed-language pages.
Plain text is downloaded as a .txt file. Searchable PDFs can be uploaded to PDFtoLink for an instant shareable URL with optional password protection.