Convert PDF to Text: Difference between revisions

From LemonWiki共筆
Jump to navigation Jump to search
No edit summary
m (Redirected page to Convert pdf to txt)
Tag: New redirect
 
Line 1: Line 1:
Converting PDF to Plain Text (TXT)
#REDIRECT [[Convert pdf to txt]]
 
{{LanguageSwitcher | content = [[Convert PDF to Text | EN]], [[Document_converter#PDF_轉換成純文字_(TXT) | 漢字]]  }}
 
* {{Gd}} utility: [https://www.xpdfreader.com/pdftotext-man.html pdftotext] on {{Win}}, {{Mac}}<ref>[https://brewinstall.org/install-pdftotext-mac-osx/ Install pdftotext on Mac OSX - Brew Cask | BrewInstall]</ref> & {{Linux}} ([https://en.wikipedia.org/wiki/Pdftotext pdftotext - Wikipedia | Wikipedia]) {{access | date=2023-06-02}}
** Usage: {{kbd | key=<nowiki>pdftotext [options] [PDF-file [text-file]]</nowiki>}}
** Note: [[Install pdftotext on windows]]
 
* {{Gd}} [http://pdftextonline.com/ PDF Text Extraction In Your Browser - PDFTextOnline] online service for extracting text from PDF files. (File size cannot exceed 10M) ([http://www.box.net/shared/pyve05ss4c Chinese test successful], test date: 2008-12-07.)
 
* [https://www.ghostscript.com/ Ghostscript] on {{Win}} & {{Linux}}<ref>[https://apple.stackexchange.com/questions/2487/how-to-convert-a-pdf-file-into-a-text-file How to convert a pdf file into a text file? - Ask Different]</ref>
 
* [http://www.convertpdftotext.net/ Convert pdf to Text - Convert pdf to txt - Convert online pdf to Text] ([http://www.box.net/shared/pyve05ss4c Chinese text becomes garbled], test date: 2008-12-07.)
 
* [http://www.zamzar.com/ Zamzar - Free online file conversion] online service: PDF to TXT conversion ([http://www.box.net/shared/pyve05ss4c Chinese text becomes garbled], test date: 2008-12-07.)
 
* [https://pdfcandy.com/tw/pdf-ocr.html PDF to Text - Free Online OCR Conversion Tool] Free version limitations: one file conversion per hour, file size is also limited {{access | date=2022-10-07}}
 
* [https://github.com/py-pdf/pdfly py-pdf/pdfly: CLI tool to extract (meta)data from PDF and manipulate PDF files] "A {{Acronym| acronym=CLI| def=Command-line interface}} application that uses [https://github.com/py-pdf/pdfly pypdf] to interact with PDFs."
** Usage: {{kbd | key=<nowiki>pdfly extract-text PDF-file > text-file</nowiki>}}
** Requirement: [https://www.python.org/downloads/ Python]
 
== Related pages ==
* [[OCR]] (Optical character recognition)
 
== References ==
<references />
 
[[Category: Tool]]

Latest revision as of 10:29, 6 January 2026

Redirect to: