OCR: Difference between revisions

From LemonWiki共筆
Jump to navigation Jump to search
No edit summary
Line 37: Line 37:
* [[Document_converter#PDF.E8.BD.89.E6.8F.9B.E6.88.90.E7.B4.94.E6.96.87.E5.AD.97 | PDF轉換成文字]]
* [[Document_converter#PDF.E8.BD.89.E6.8F.9B.E6.88.90.E7.B4.94.E6.96.87.E5.AD.97 | PDF轉換成文字]]


== OCR scripts ==
== OCR scripts & API ==


[https://github.com/ocropus/ocropy ocropus/ocropy: Python-based tools for document analysis and OCR]  
[https://github.com/ocropus/ocropy ocropus/ocropy: Python-based tools for document analysis and OCR]  
Line 49: Line 49:
* License: [https://github.com/tesseract-ocr/tesseract/blob/main/LICENSE Apache License 2.0]. PHP Fork: [https://github.com/thiagoalessio/tesseract-ocr-for-php/blob/main/MIT-LICENSE MIT License]
* License: [https://github.com/tesseract-ocr/tesseract/blob/main/LICENSE Apache License 2.0]. PHP Fork: [https://github.com/thiagoalessio/tesseract-ocr-for-php/blob/main/MIT-LICENSE MIT License]


== OCR API ==
OCR {{Acronym| acronym=API| def=應用程式介面(英語:application programming interface)}}  
OCR {{Acronym| acronym=API| def=應用程式介面(英語:application programming interface)}}  
* Azure AI Vision/[https://azure.microsoft.com/zh-tw/services/cognitive-services/computer-vision/ 電腦視覺 | Microsoft Azure]: [https://docs.microsoft.com/zh-tw/azure/cognitive-services/Computer-vision/quickstarts-sdk/client-library?pivots=programming-language-rest-api&tabs=visual-studio 快速入門:光學字元辨識 (OCR) 用戶端程式庫或 REST API - Azure Cognitive Services | Microsoft Docs] 支援中文<ref>[https://docs.microsoft.com/zh-tw/azure/cognitive-services/computer-vision/language-support#optical-character-recognition-ocr 語言支援 - 電腦視覺 - Azure Cognitive Services | Microsoft Docs]</ref>
 
* [https://cloud.google.com/vision Vision AI | 透過機器學習技術取得圖片的深入分析結果  |  Cloud Vision API  |  Google Cloud]: 支援繁體中文 ({{kbd | key=zh-Hant}})<ref>[https://cloud.google.com/vision/docs/languages OCR Language Support  |  Cloud Vision API  |  Google Cloud]</ref>
Azure AI Vision/[https://azure.microsoft.com/zh-tw/services/cognitive-services/computer-vision/ 電腦視覺 | Microsoft Azure]: [https://docs.microsoft.com/zh-tw/azure/cognitive-services/Computer-vision/quickstarts-sdk/client-library?pivots=programming-language-rest-api&tabs=visual-studio 快速入門:光學字元辨識 (OCR) 用戶端程式庫或 REST API - Azure Cognitive Services | Microsoft Docs]  
* [https://ocr.space/OCRAPI Free OCR API] 支援繁體中文 ({{kbd | key=cht}})
* Support Language: 支援中文<ref>[https://docs.microsoft.com/zh-tw/azure/cognitive-services/computer-vision/language-support#optical-character-recognition-ocr 語言支援 - 電腦視覺 - Azure Cognitive Services | Microsoft Docs]</ref>
* [https://aws.amazon.com/tw/rekognition/?blog-cards.sort-by=item.additionalFields.createdDate&blog-cards.sort-order=desc Amazon Rekognition – 影片與影像 – AWS]: [https://docs.aws.amazon.com/rekognition/latest/dg/text-detection.html?pg=ln&sec=ft Detecting text - Amazon Rekognition]: {{exclaim}} 不支援中文<ref>[https://aws.amazon.com/tw/about-aws/whats-new/2021/11/amazon-rekognition-text-detection-7-new-languages-accuracy/ Amazon Rekognition text detection supports 7 new languages and improves accuracy] " Amazon Rekognition is designed to detect words in English, Arabic, Russian, German, French, Italian, Portuguese and Spanish."</ref><ref>[https://docs.aws.amazon.com/rekognition/latest/dg/text-detection.html Detecting text - Amazon Rekognition]</ref> {{access | date=2022-04-20}}
 
[https://cloud.google.com/vision Vision AI | 透過機器學習技術取得圖片的深入分析結果  |  Cloud Vision API  |  Google Cloud]
* Support Language: 支援繁體中文 ({{kbd | key=zh-Hant}})<ref>[https://cloud.google.com/vision/docs/languages OCR Language Support  |  Cloud Vision API  |  Google Cloud]</ref>
 
[https://ocr.space/OCRAPI Free OCR API]  
* Support Language: 支援繁體中文 ({{kbd | key=cht}})
 
[https://aws.amazon.com/tw/rekognition/?blog-cards.sort-by=item.additionalFields.createdDate&blog-cards.sort-order=desc Amazon Rekognition – 影片與影像 – AWS]: [https://docs.aws.amazon.com/rekognition/latest/dg/text-detection.html?pg=ln&sec=ft Detecting text - Amazon Rekognition]:
* Support Language: {{exclaim}} 不支援中文<ref>[https://aws.amazon.com/tw/about-aws/whats-new/2021/11/amazon-rekognition-text-detection-7-new-languages-accuracy/ Amazon Rekognition text detection supports 7 new languages and improves accuracy] " Amazon Rekognition is designed to detect words in English, Arabic, Russian, German, French, Italian, Portuguese and Spanish."</ref><ref>[https://docs.aws.amazon.com/rekognition/latest/dg/text-detection.html Detecting text - Amazon Rekognition]</ref> {{access | date=2022-04-20}}





Revision as of 11:55, 6 May 2024

OCR (optical character recognition), 光學字元辨識、圖片轉文字


OCR tools

圖片轉換成文字

  • Google Photos 將圖片上傳到 Google Photos,再點選「複製圖像中的文字」[Last visited: 2022-09-30]
  • MS Office 2003 需額外安裝的Office 工具: Microsoft Office Document Imaging (你也可以輕鬆做文字辨識(OCR))
    1. (.pdf檔案轉為.mdi) PDF列印到 MS Office 2003 Document Imaging
    2. (.mdi檔案轉為word檔) MS Office 2003 Document Imaging(.mdi) -> 使用OCR辨識/傳送文字到Word
Owl icon.jpg 講個秘訣:因為線上服務免費版會限制 PDF 檔案頁數,可使用切割軟體 PDF split and merge tools

PDF轉換成文字

OCR scripts & API

ocropus/ocropy: Python-based tools for document analysis and OCR

tesseract-ocr/tesseract: Tesseract Open Source OCR Engine (main repository) [Last visited: 2022-06-19]

OCR API

Azure AI Vision/電腦視覺 | Microsoft Azure: 快速入門:光學字元辨識 (OCR) 用戶端程式庫或 REST API - Azure Cognitive Services | Microsoft Docs

  • Support Language: 支援中文[4]

Vision AI | 透過機器學習技術取得圖片的深入分析結果  |  Cloud Vision API  |  Google Cloud

  • Support Language: 支援繁體中文 (zh-Hant)[5]

Free OCR API

  • Support Language: 支援繁體中文 (cht)

Amazon Rekognition – 影片與影像 – AWS: Detecting text - Amazon Rekognition:

  • Support Language: Icon_exclaim.gif 不支援中文[6][7] [Last visited: 2022-04-20]


相關頁面

常用文件的解析度設定

常用用途的解析度設定

  • 文字辨識 75~150 dpi
  • 圖文交雜 100~150 dpi
  • 圖檔(螢幕上觀看) 150~250 dpi Icon_exclaim.gif 個人經驗: 簡報掃描的圖檔,如果是小字 300 dpi 可以辨識,但建議調整到 600 dpi。
  • 圖檔(有列印需求) 300 dpi以上
  • 名片 150~200 dpi

出處:PCHome 2005/8

References

相關文章