OCR: Difference between revisions

Revision as of 11:07, 20 April 2022

OCR (optical character recognition), 光學字元辨識

OCR tools

Google DOCs: 上傳文件後，檔案名稱點選右鍵，「選擇開啟工具」 --> 「Google 文件」^[1] 英文可以順利辨識、簡體中文遇到問題。
- 教學: 不要浪費時間 key 資料啦！拍照上傳 Google 雲端，按個右鍵就自動幫你轉文字 | TechOrange

免費通話、免費傳訊的應用程式「LINE」
- 【教學】LINE 透過 OCR 文字辨識功能，直接讓圖片轉成文字技巧 - 瘋先生

Free Online OCR - convert scanned PDF and images to Word, JPEG to Word 不註冊有前 5 頁的額度、註冊會員有總共 50 頁的免費額度 [Last visited: 2018-02-06]

Google Keep內建辨識功能將圖片內容轉文字輸出 | 社群網路 | 數位 | 聯合新聞網 [Last visited: 2018-08-27]

免費在線OCR - 在線圖片識別 - 免費OCR軟件 - 免費OCR轉換成Word - 在線文字識別轉換 - 圖片文字識別軟件 - 圖片轉文字免費版限制每次可以批次上傳 3 頁、每天轉換 10 頁的 PDF 檔。 [Last visited: 2018-10-24]

MS Office 2003 需額外安裝的Office 工具: Microsoft Office Document Imaging (你也可以輕鬆做文字辨識(OCR))
1. (.pdf檔案轉為.mdi) PDF列印到 MS Office 2003 Document Imaging
2. (.mdi檔案轉為word檔) MS Office 2003 Document Imaging(.mdi) -> 使用OCR辨識/傳送文字到Word

Microsoft Office Document Imaging 中文簡體OCR辨識引擎

Optical Character Recognition tool that extracts text from major image format - Online Image, PDF, Latex, OCR Converter 繁體中文辨識結果不佳。[Last visited: 2020-10-29]

$ Free Online OCR - Convert JPEG, PNG, GIF, BMP, TIFF, PDF, DjVu to Text 可指定語言。線上網頁轉換，需要逐頁下載轉換後的檔案。使用 API 免費額度 20 頁^[2]

Best Free OCR API, Online OCR, Searchable PDF - Fresh 2022 On-Premise OCR Software 可指定語言

$ Vision AI | 透過機器學習技術取得圖片的深入分析結果 | Cloud Vision API | Google Cloud

講個秘訣：因為線上服務免費版會限制 PDF 檔案頁數，可使用切割軟體 PDF split and merge tools

OCR scripts

thiagoalessio/tesseract-ocr-for-php: A wrapper to work with Tesseract OCR inside PHP. 繁體中文辨識結果不佳。[Last visited: 2022-04-20]
ocropus/ocropy: Python-based tools for document analysis and OCR

常用文件的解析度設定

常用用途的解析度設定

文字辨識 75~150 dpi
圖文交雜 100~150 dpi
圖檔(螢幕上觀看) 150~250 dpi 個人經驗: 簡報掃描的圖檔，如果是小字 300 dpi 可以辨識，但建議調整到 600 dpi。
圖檔(有列印需求) 300 dpi以上
名片 150~200 dpi

出處：PCHome 2005/8

References

(請問) 有沒有可以大量OCR(圖文轉換)的軟體 - 看板 EZsoft - 批踢踢實業坊 [Last visited: 2018-02-15]

[1] Uploading and exporting: Uploading image files with text to Google Docs、將 PDF 和相片檔案轉換為文字 - 電腦 - Google 雲端硬碟說明

[2] Free Online OCR - OCR API

[1]

[2]

@@ Line 5: / Line 5: @@
 * {{Gd}} [https://docs.google.com/ Google DOCs]: 上傳文件後，檔案名稱點選右鍵，「選擇開啟工具」 --> 「Google 文件」<ref>[http://docs.google.com/support/bin/answer.py?answer=176692&hl=en Uploading and exporting: Uploading image files with text to Google Docs]、[https://support.google.com/drive/answer/176692?hl=zh-Hant&visit_id=1-636534874969716350-2978233269&rd=1 將 PDF 和相片檔案轉換為文字 - 電腦 - Google 雲端硬碟說明]</ref> 英文可以順利辨識、簡體中文遇到問題。
 ** 教學: [https://buzzorange.com/techorange/2019/12/09/convert-picture-into-word/ 不要浪費時間 key 資料啦！拍照上傳 Google 雲端，按個右鍵就自動幫你轉文字 | TechOrange]
 * {{Gd}} [https://line.me/zh-hant/ 免費通話、免費傳訊的應用程式「LINE」]
@@ Line 30: / Line 29: @@
 * ''$'' [https://cloud.google.com/vision/?hl=zh-tw Vision AI | 透過機器學習技術取得圖片的深入分析結果  |  Cloud Vision API  |  Google Cloud]
-因為線上服務免費版會限制 PDF 檔案頁數，可使用切割軟體 [[PDF split and merge tools]]
+: [[Image:Owl icon.jpg]] 講個秘訣：因為線上服務免費版會限制 PDF 檔案頁數，可使用切割軟體 [[PDF split and merge tools]]
+== OCR scripts ==
+* [https://github.com/thiagoalessio/tesseract-ocr-for-php thiagoalessio/tesseract-ocr-for-php: A wrapper to work with Tesseract OCR inside PHP.] 繁體中文辨識結果不佳。{{access | date=2022-04-20}}
+* [https://github.com/ocropus/ocropy ocropus/ocropy: Python-based tools for document analysis and OCR]
 == 常用文件的解析度設定 ==

OCR: Difference between revisions

Revision as of 11:07, 20 April 2022

Contents

OCR tools

OCR scripts

常用文件的解析度設定

References

Navigation menu

OCR: Difference between revisions

Revision as of 11:07, 20 April 2022

OCR tools

OCR scripts

常用文件的解析度設定

References

Navigation menu

Search