Editing
PDF conversion
(section)
Jump to navigation
Jump to search
Warning:
You are not logged in. Your IP address will be publicly visible if you make any edits. If you
log in
or
create an account
, your edits will be attributed to your username, along with other benefits.
Anti-spam check. Do
not
fill this in!
==== PDF 轉換成純文字 (TXT) ==== * {{Gd}} [http://pdftextonline.com/ PDF Text Extraction In Your Browser - PDFTextOnline]線上服務,擷取PDF檔中的文字。(檔案大小不能超過10M) ([http://www.box.net/shared/pyve05ss4c 中文測試ok],測試日期: 2008-12-07。) * [https://www.ghostscript.com/ Ghostscript] on {{Win}} & {{Linux}}<ref>[https://apple.stackexchange.com/questions/2487/how-to-convert-a-pdf-file-into-a-text-file How to convert a pdf file into a text file? - Ask Different]</ref> * [http://www.convertpdftotext.net/ Convert pdf to Text - Convert pdf to txt - Convert online pdf to Text] ([http://www.box.net/shared/pyve05ss4c 中文變亂碼],測試日期: 2008-12-07。) * [http://www.zamzar.com/ Zamzar - Free online file conversion]線上服務: PDF轉成TXT ([http://www.box.net/shared/pyve05ss4c 中文變亂碼],測試日期: 2008-12-07。) * ([[OCR]]) [https://pdfcandy.com/tw/pdf-ocr.html PDF 轉文字–免費線上OCR轉換工具] 免費版限制:一小時轉換一個檔案,檔案大小也有限制{{access | date=2022-10-07}} [https://www.xpdfreader.com/pdftotext-man.html pdftotext] ([[Convert pdf to txt|Quick Start]]) * On {{Win}}: Part of [https://www.xpdfreader.com/index.html XpdfReader], dual licensed under GPL v2 and GPL v3<ref>[https://www.xpdfreader.com/opensource.html Xpdf Open Source]</ref> * On {{Mac}} & {{Linux}}: Part of [https://poppler.freedesktop.org/ Poppler]<ref>[https://brewinstall.org/install-pdftotext-mac-osx/ Install pdftotext on Mac OSX - Brew Cask | BrewInstall]</ref> (GPL v2 or later), historically derived from Xpdf ([https://en.wikipedia.org/wiki/Pdftotext Wikipedia]) {{access | date=2023-06-02}} {{Gd}} * Usage: {{kbd | key=<nowiki>pdftotext [options] [PDF-file [text-file]]</nowiki>}} e.g. {{kbd | key=<nowiki>pdftotext -enc UTF-8 example.pdf example.txt</nowiki>}} [https://github.com/py-pdf/pdfly py-pdf/pdfly: CLI tool to extract (meta)data from PDF and manipulate PDF files] "A {{Acronym| acronym=CLI| def=命令列介面(英語:Command-line interface)}} application that uses [https://github.com/py-pdf/pdfly pypdf] to interact with PDFs." * License: [https://github.com/py-pdf/pdfly/blob/main/LICENSE BSD 3-Clause License] {{Gd}} * Usage: {{kbd | key=<nowiki>pdfly extract-text PDF-file > text-file</nowiki>}} * Known issues: On Windows systems with Chinese locales (cp950), PDF text extraction may fail with Unicode encoding errors when encountering certain special characters like '\u25aa' (BLACK SMALL SQUARE). This is a character encoding limitation of the default Windows codepage<ref>[https://stackoverflow.com/questions/50933194/how-do-i-set-the-pythonutf8-environment-variable-to-enable-utf-8-encoding-by-def How do I set the PYTHONUTF8 environment variable to enable UTF-8 encoding by default in Python? - Stack Overflow]</ref>. * Requirement: [https://www.python.org/downloads/ Python] [https://github.com/ArtifexSoftware/mupdf mupd] (part of [https://mupdf.com/core MuPDF]) * License: [https://github.com/ArtifexSoftware/mupdf?tab=AGPL-3.0-1-ov-file GNU AFFERO GENERAL PUBLIC LICENSE] * Usage: {{kbd | key=<nowiki>mutool draw -F txt -o PDF-file text-file</nowiki>}}
Summary:
Please note that all contributions to LemonWiki共筆 are considered to be released under the Creative Commons Attribution-NonCommercial-ShareAlike (see
LemonWiki:Copyrights
for details). If you do not want your writing to be edited mercilessly and redistributed at will, then do not submit it here.
You are also promising us that you wrote this yourself, or copied it from a public domain or similar free resource.
Do not submit copyrighted work without permission!
Cancel
Editing help
(opens in new window)
Navigation menu
Personal tools
Not logged in
Talk
Contributions
Log in
Namespaces
Page
Discussion
English
Views
Read
Edit
View history
More
Search
Navigation
Main page
Current events
Recent changes
Random page
Help
Categories
Tools
What links here
Related changes
Special pages
Page information