OCR: Difference between revisions
Jump to navigation
Jump to search
No edit summary |
|||
| Line 37: | Line 37: | ||
* [[Document_converter#PDF.E8.BD.89.E6.8F.9B.E6.88.90.E7.B4.94.E6.96.87.E5.AD.97 | PDF轉換成文字]] | * [[Document_converter#PDF.E8.BD.89.E6.8F.9B.E6.88.90.E7.B4.94.E6.96.87.E5.AD.97 | PDF轉換成文字]] | ||
== OCR scripts == | == OCR scripts & API == | ||
[https://github.com/ocropus/ocropy ocropus/ocropy: Python-based tools for document analysis and OCR] | [https://github.com/ocropus/ocropy ocropus/ocropy: Python-based tools for document analysis and OCR] | ||
| Line 49: | Line 49: | ||
* License: [https://github.com/tesseract-ocr/tesseract/blob/main/LICENSE Apache License 2.0]. PHP Fork: [https://github.com/thiagoalessio/tesseract-ocr-for-php/blob/main/MIT-LICENSE MIT License] | * License: [https://github.com/tesseract-ocr/tesseract/blob/main/LICENSE Apache License 2.0]. PHP Fork: [https://github.com/thiagoalessio/tesseract-ocr-for-php/blob/main/MIT-LICENSE MIT License] | ||
OCR {{Acronym| acronym=API| def=應用程式介面(英語:application programming interface)}} | OCR {{Acronym| acronym=API| def=應用程式介面(英語:application programming interface)}} | ||
Azure AI Vision/[https://azure.microsoft.com/zh-tw/services/cognitive-services/computer-vision/ 電腦視覺 | Microsoft Azure]: [https://docs.microsoft.com/zh-tw/azure/cognitive-services/Computer-vision/quickstarts-sdk/client-library?pivots=programming-language-rest-api&tabs=visual-studio 快速入門:光學字元辨識 (OCR) 用戶端程式庫或 REST API - Azure Cognitive Services | Microsoft Docs] | |||
* Support Language: 支援中文<ref>[https://docs.microsoft.com/zh-tw/azure/cognitive-services/computer-vision/language-support#optical-character-recognition-ocr 語言支援 - 電腦視覺 - Azure Cognitive Services | Microsoft Docs]</ref> | |||
[https://cloud.google.com/vision Vision AI | 透過機器學習技術取得圖片的深入分析結果 | Cloud Vision API | Google Cloud] | |||
* Support Language: 支援繁體中文 ({{kbd | key=zh-Hant}})<ref>[https://cloud.google.com/vision/docs/languages OCR Language Support | Cloud Vision API | Google Cloud]</ref> | |||
[https://ocr.space/OCRAPI Free OCR API] | |||
* Support Language: 支援繁體中文 ({{kbd | key=cht}}) | |||
[https://aws.amazon.com/tw/rekognition/?blog-cards.sort-by=item.additionalFields.createdDate&blog-cards.sort-order=desc Amazon Rekognition – 影片與影像 – AWS]: [https://docs.aws.amazon.com/rekognition/latest/dg/text-detection.html?pg=ln&sec=ft Detecting text - Amazon Rekognition]: | |||
* Support Language: {{exclaim}} 不支援中文<ref>[https://aws.amazon.com/tw/about-aws/whats-new/2021/11/amazon-rekognition-text-detection-7-new-languages-accuracy/ Amazon Rekognition text detection supports 7 new languages and improves accuracy] " Amazon Rekognition is designed to detect words in English, Arabic, Russian, German, French, Italian, Portuguese and Spanish."</ref><ref>[https://docs.aws.amazon.com/rekognition/latest/dg/text-detection.html Detecting text - Amazon Rekognition]</ref> {{access | date=2022-04-20}} | |||
Revision as of 11:55, 6 May 2024
OCR (optical character recognition), 光學字元辨識、圖片轉文字
OCR tools
圖片轉換成文字
Google DOCs: 上傳文件後,檔案名稱點選右鍵,「選擇開啟工具」 --> 「Google 文件」[1] 英文可以順利辨識、簡體中文遇到問題。
- Google Photos 將圖片上傳到 Google Photos,再點選「複製圖像中的文字」[Last visited: 2022-09-30]
- Free Online OCR - convert scanned PDF and images to Word, JPEG to Word 不註冊有前 5 頁的額度、註冊會員有總共 50 頁的免費額度 [Last visited: 2018-02-06]
- Google Keep內建辨識功能 將圖片內容轉文字輸出 | 社群網路 | 數位 | 聯合新聞網 [Last visited: 2018-08-27]
- 免費在線OCR - 在線圖片識別 - 免費OCR軟件 - 免費OCR轉換成Word - 在線文字識別轉換 - 圖片文字識別軟件 - 圖片轉文字 免費版限制每次可以批次上傳 3 頁、每天轉換 10 頁的 PDF 檔。 [Last visited: 2018-10-24]
- MS Office 2003 需額外安裝的Office 工具: Microsoft Office Document Imaging (你也可以輕鬆做文字辨識(OCR))
- (.pdf檔案轉為.mdi) PDF列印到 MS Office 2003 Document Imaging
- (.mdi檔案轉為word檔) MS Office 2003 Document Imaging(.mdi) -> 使用OCR辨識/傳送文字到Word
- Optical Character Recognition tool that extracts text from major image format - Online Image, PDF, Latex, OCR Converter 繁體中文辨識結果不佳。[Last visited: 2020-10-29]
- $ Free Online OCR - Convert JPEG, PNG, GIF, BMP, TIFF, PDF, DjVu to Text 可指定語言。線上網頁轉換,需要逐頁下載轉換後的檔案。使用 API 免費額度 20 頁[2]
講個秘訣:因為線上服務免費版會限制 PDF 檔案頁數,可使用切割軟體 PDF split and merge tools
PDF轉換成文字
OCR scripts & API
ocropus/ocropy: Python-based tools for document analysis and OCR
- Script Language: Python
- Support Language: < 10.
沒有提供中文 model 檔案 [Last visited: 2022-04-20] More on Models · ocropus-archive/DUP-ocropy Wiki - License: Apache License 2.0
tesseract-ocr/tesseract: Tesseract Open Source OCR Engine (main repository) [Last visited: 2022-06-19]
- Script Language: C++; Fork on PHP thiagoalessio/tesseract-ocr-for-php: A wrapper to work with Tesseract OCR inside PHP. [3],
- Support Language: 100+ contains Traditional Chinese 但是繁體中文辨識結果不佳。 [Last visited: 2022-04-20]. More on Languages/Scripts supported in different versions of Tesseract | tessdoc
- License: Apache License 2.0. PHP Fork: MIT License
OCR API
Azure AI Vision/電腦視覺 | Microsoft Azure: 快速入門:光學字元辨識 (OCR) 用戶端程式庫或 REST API - Azure Cognitive Services | Microsoft Docs
- Support Language: 支援中文[4]
Vision AI | 透過機器學習技術取得圖片的深入分析結果 | Cloud Vision API | Google Cloud
- Support Language: 支援繁體中文 (zh-Hant)[5]
- Support Language: 支援繁體中文 (cht)
Amazon Rekognition – 影片與影像 – AWS: Detecting text - Amazon Rekognition:
相關頁面
常用文件的解析度設定
常用用途的解析度設定
- 文字辨識 75~150 dpi
- 圖文交雜 100~150 dpi
- 圖檔(螢幕上觀看) 150~250 dpi
個人經驗: 簡報掃描的圖檔,如果是小字 300 dpi 可以辨識,但建議調整到 600 dpi。 - 圖檔(有列印需求) 300 dpi以上
- 名片 150~200 dpi
出處:PCHome 2005/8
References
- ↑ Uploading and exporting: Uploading image files with text to Google Docs、將 PDF 和相片檔案轉換為文字 - 電腦 - Google 雲端硬碟說明
- ↑ Free Online OCR - OCR API
- ↑ LANGUAGES AND SCRIPTS
- ↑ 語言支援 - 電腦視覺 - Azure Cognitive Services | Microsoft Docs
- ↑ OCR Language Support | Cloud Vision API | Google Cloud
- ↑ Amazon Rekognition text detection supports 7 new languages and improves accuracy " Amazon Rekognition is designed to detect words in English, Arabic, Russian, German, French, Italian, Portuguese and Spanish."
- ↑ Detecting text - Amazon Rekognition
相關文章
- (請問) 有沒有可以大量OCR(圖文轉換)的軟體 - 看板 EZsoft - 批踢踢實業坊 [Last visited: 2018-02-15]