PDF to Text Extractor
Pull all text out of a PDF in one shot and save as .txt. Multilingual fonts supported; everything happens in your browser.
How to Use
Drag or click to load. Works with multi-page PDFs (large files may take a few seconds).
Toggle page separators ("--- Page N ---") on or off. Off = continuous text.
See the extracted text on screen. Hit "Download .txt" or "Copy" to grab it.
FAQ
Does it handle scanned PDFs?
No. The tool reads the PDF's text layer. Scanned paper documents are pixel images without a text layer, so the output will be empty or garbled. Use the OCR tool for scans.
Korean / Japanese characters look broken
If the PDF embeds proper Unicode-mapped fonts, output is clean. Some PDFs rasterize CJK glyphs or use custom encodings — for those, use the OCR tool.
Does it preserve order?
Yes, page-by-page in order. Within a page, text is ordered by visual position (top-left → bottom-right).
Encrypted PDFs?
Password-protected PDFs must be unlocked first; this tool does not handle passwords.
Is anything sent to a server?
No. PDF.js parses the file locally; nothing leaves your browser.