Text to speech: Difference between revisions
mNo edit summary |
|||
(31 intermediate revisions by one other user not shown) | |||
Line 1: | Line 1: | ||
Text to speech services | Text to speech services | ||
__TOC__ | |||
== Text to speech 工具 == | |||
[https://getpocket.com/ Pocket] on {{Android}} or iOS [https://help.getpocket.com/article/1081-listening-to-articles-in-pocket-with-text-to-speech Listening to Articles in Pocket with Text-to-Speech - Pocket Support] | [https://getpocket.com/ Pocket] on {{Android}} or iOS [https://help.getpocket.com/article/1081-listening-to-articles-in-pocket-with-text-to-speech Listening to Articles in Pocket with Text-to-Speech - Pocket Support] | ||
* Object: Saved web page {{exclaim}} the | * Object: Saved web page {{exclaim}} the content may not be saved completed but truncated. | ||
[https://translate.google.com/ Google 翻譯] | [https://translate.google.com/ Google 翻譯] | ||
* Object: Input text | * Object: Input text | ||
* Speed of speech: {{Gd}} Allow to slow down the speed of speech ({{kbd | key=<nowiki>ttsspeed=0.24</nowiki>}}) | |||
[https://play.google.com/store/apps/details?id=com.google.android.tts Google 文字轉語音 - Google Play Android 應用程式] on {{Android}} | [https://play.google.com/store/apps/details?id=com.google.android.tts Google 文字轉語音 - Google Play Android 應用程式] on {{Android}} | ||
* Object: Some books on Google play | * Object: Some books on Google play | ||
[https://viewer-ebook.books.com.tw/viewer/index.html?readlist=all 博客來電子書櫃] on {{Android}} | [https://viewer-ebook.books.com.tw/viewer/index.html?readlist=all 博客來電子書櫃] on {{Android}} 使用 Google 文字轉語音的服務 | ||
* Object: books | * Object: books | ||
[[Category:Tool]] | Kindle 部分版本支援 [https://www.amazon.com/gp/help/customer/display.html?nodeId=201286790 Amazon.com Help: Features Available in Kindle Books] | ||
* 物件: 書籍 | |||
[http://www.iq-t.com/PRODUCTS/textmp3_01.asp 文字MP3] on {{Win}} / [http://www.iq-t.com/SYSCOM/com01_01.asp TTS文字轉語音引擎] | |||
* 物件: 輸入文字 或 Excel | |||
[https://support.office.com/zh-tw/article/%E9%85%8D%E5%90%88%E5%A4%9A%E8%AA%9E%E7%B3%BB-tts-%E4%BD%BF%E7%94%A8%E8%AA%9E%E9%9F%B3%E5%8A%9F%E8%83%BD-e522a4f2-37cb-492b-be6a-8997d23dfe70 配合多語系 TTS 使用語音功能 - Office 支援] | |||
* 物件: 「OneNote、Outlook、PowerPoint 及 Word」 | |||
[http://tts.itri.org.tw/index.php 工研院文字轉語音Web服務] | |||
* 物件: 網頁 | |||
* 語言: 中、英文、台語 | |||
[https://azure.microsoft.com/zh-tw/services/cognitive-services/speech/ Bing Speech API - 語音辨識 | Microsoft Azure] | |||
* Object: Input text | |||
[https://cloud.google.com/text-to-speech/ Cloud Text-to-Speech - Speech Synthesis | Google Cloud] | |||
* Object: Input text | |||
[http://www.wizzardsoftware.com/text-to-speech-sdk.php Wizzard Speech I ATT Natural Voices SDK] {{access | date = 2018-09-04}} | |||
* Object: Input text | |||
* Language: English, Spanish | |||
[http://vozme.com/index.php?lang=en vozMe - From text to speech (speech synthesis)] {{access | date = 2018-09-04}} | |||
* Object: Input text | |||
* Language: English, Español, Italiano, Hindi, Português, Català | |||
[https://chrome.google.com/webstore/detail/read-aloud-a-text-to-spee/hdhinadidafjejdhmfkjgnolgimiaplp?hl=zh-TW Read Aloud: 文字語音朗讀助理 - Chrome 線上應用程式商店] | |||
* 物件: 網頁 | |||
* 語言: 中 | |||
[https://www.inside.com.tw/article/19966-howger-generator How 哥產生器!開發者整理素材,讓 HowHow 幫你講任何中文句子 - INSIDE] | |||
* 物件: 文字 | |||
* 語言: 中 | |||
== Speech to text 工具 == | |||
[https://cloud.google.com/speech/?hl=zh-tw Speech API - 語音辨識 | Google Cloud] 「語音轉文字採用機器學習技術」,免費版語音辨識的額度 60 分鐘,詳 [https://cloud.google.com/speech-to-text/pricing 定價 | Cloud Speech API Documentation | Google Cloud]。 {{access | date = 2018-09-04}} | |||
* Object: microphone & audio file (For audio file which longer than 1 minute, upload files to Google cloud storage. | |||
* Language: 120 languages <ref>[https://cloud.google.com/speech-to-text/docs/languages?hl=zh-tw Language Support | Cloud Speech-to-Text API | Google Cloud]</ref> | |||
* Sample code: | |||
* Related: [[Troubleshooting of Google cloud speech to text]]) | |||
[https://azure.microsoft.com/zh-tw/services/cognitive-services/speech/ Bing 語音 API - 語音辨識軟體 | Microsoft Azure] | |||
* Object: Audio file. Format: wav & ogg<ref>[https://docs.microsoft.com/zh-tw/azure/cognitive-services/speech-service/rest-speech-to-text 語音轉換文字 API 參考(REST)-語音服務 - Azure Cognitive Services | Microsoft Docs]</ref> | |||
* Language: Traditional Chinese, Simplified Chinese & English and more on the list<ref>[https://docs.microsoft.com/zh-tw/azure/cognitive-services/speech-service/language-support#speech-to-text 語言支援-語音服務 - Azure Cognitive Services | Microsoft Docs]</ref> | |||
* Sample code: [https://github.com/Azure-Samples/SpeechToText-REST Azure-Samples/SpeechToText-REST: REST Samples of Speech To Text API] | |||
* Related: | |||
[https://tw.olami.ai/open/website/apiandsolution/api_solution OLAMI 中文語音辨識 API|歐拉蜜人工智慧開放平台(威盛電子)] {{access | date = 2018-09-05}} | |||
* Object: Audio file. Format: wav & speex <ref>[https://tw.olami.ai/wiki/?mp=api_asr&content=api_asr1.html 文件中心 - OLAMI - 歐拉蜜人工智慧開放平台]</ref> | |||
* Language: Traditional Chinese & Simplified Chinese <ref>[https://github.com/olami-developers/olami-api-quickstart-curl-samples/tree/master/cloud-speech-recognition olami-api-quickstart-curl-samples/cloud-speech-recognition at master · olami-developers/olami-api-quickstart-curl-samples]</ref> | |||
* Sample code: [https://github.com/olami-developers/olami-api-quickstart-curl-samples/tree/master/cloud-speech-recognition olami-developers/olami-api-quickstart-curl-samples] | |||
* Related: [[Troubleshooting of Olami speech to text]] | |||
影片要產生文字,可利用 youtube 的 [https://support.google.com/youtube/answer/6373554?hl=en Use automatic captioning - YouTube Help],約需要半天時間 {{access | date = 2018-09-04}} 教學: [https://www.techbang.com/posts/2107 YouTube超佛心,自動幫你加入字幕! | T客邦] | |||
* Object: Video | |||
* Language: | |||
* Sample code: | |||
* Related: | |||
[https://www.xfyun.cn/doccenter/asr 语音识别 - 讯飞开放平台] {{access | date=2018-09-06}} | |||
* Object: speex audio file less than 1 minute <ref>[https://doc.xfyun.cn/rest_api/%E8%AF%AD%E9%9F%B3%E5%90%AC%E5%86%99.html 语音听写 · 科大讯飞REST_API开发指南]</ref> | |||
* Language: 中文(普通话)、英文、中文(粤语)、中文(四川话) | |||
* Sample code: | |||
* Related: | |||
[https://aws.amazon.com/tw/transcribe/ Amazon Transcribe – 自動語音辨識 – AWS] (API documentation: [https://docs.aws.amazon.com/transcribe/latest/dg/what-is-transcribe.html What Is Amazon Transcribe? - Amazon Transcribe]) {{access | date=2018-09-05}} | |||
* Object: Audio file (Stored in S3 bucket). "Valid formats for the audio are mp3, mp4, wav and flac. <ref>[https://docs.aws.amazon.com/transcribe/latest/dg/API_StartTranscriptionJob.html StartTranscriptionJob - Amazon Transcribe] For best results, use a lossless format, such as FLAC or WAV with PCM 16-bit encoding.Your audio input can be sampled at any rate between 8000 and 48000 Hz. We suggest that you use 8000 Hz for low-quality audio and 16000 Hz for high-quality audio.</ref>" | |||
* Language: English, Spanish | |||
* Sample code: | |||
* Related: | |||
[https://pulipulichen.github.io/HTML5-Speech-to-Text/ Web Speech to Text] 教學: [https://www.playpcesor.com/2019/12/Web-Speech-to-Text.html 免費!中文影片語音轉文字字幕,支援超大影片與長時間錄音] | |||
* 物件: 電腦影像、聲音、YouTube 網址 | |||
* 語言: 中文、英文、日文、韓文 | |||
== References == | |||
<References /> | |||
[[Category:Tool]] [[Category:NLP]] |
Revision as of 14:01, 5 June 2020
Text to speech services
Text to speech 工具
Pocket on Android or iOS Listening to Articles in Pocket with Text-to-Speech - Pocket Support
- Object: Saved web page the content may not be saved completed but truncated.
- Object: Input text
- Speed of speech: Allow to slow down the speed of speech (ttsspeed=0.24)
Google 文字轉語音 - Google Play Android 應用程式 on Android
- Object: Some books on Google play
博客來電子書櫃 on Android 使用 Google 文字轉語音的服務
- Object: books
Kindle 部分版本支援 Amazon.com Help: Features Available in Kindle Books
- 物件: 書籍
文字MP3 on Win / TTS文字轉語音引擎
- 物件: 輸入文字 或 Excel
- 物件: 「OneNote、Outlook、PowerPoint 及 Word」
- 物件: 網頁
- 語言: 中、英文、台語
Bing Speech API - 語音辨識 | Microsoft Azure
- Object: Input text
Cloud Text-to-Speech - Speech Synthesis | Google Cloud
- Object: Input text
Wizzard Speech I ATT Natural Voices SDK [Last visited: 2018-09-04]
- Object: Input text
- Language: English, Spanish
vozMe - From text to speech (speech synthesis) [Last visited: 2018-09-04]
- Object: Input text
- Language: English, Español, Italiano, Hindi, Português, Català
Read Aloud: 文字語音朗讀助理 - Chrome 線上應用程式商店
- 物件: 網頁
- 語言: 中
How 哥產生器!開發者整理素材,讓 HowHow 幫你講任何中文句子 - INSIDE
- 物件: 文字
- 語言: 中
Speech to text 工具
Speech API - 語音辨識 | Google Cloud 「語音轉文字採用機器學習技術」,免費版語音辨識的額度 60 分鐘,詳 定價 | Cloud Speech API Documentation | Google Cloud。 [Last visited: 2018-09-04]
- Object: microphone & audio file (For audio file which longer than 1 minute, upload files to Google cloud storage.
- Language: 120 languages [1]
- Sample code:
- Related: Troubleshooting of Google cloud speech to text)
Bing 語音 API - 語音辨識軟體 | Microsoft Azure
- Object: Audio file. Format: wav & ogg[2]
- Language: Traditional Chinese, Simplified Chinese & English and more on the list[3]
- Sample code: Azure-Samples/SpeechToText-REST: REST Samples of Speech To Text API
- Related:
OLAMI 中文語音辨識 API|歐拉蜜人工智慧開放平台(威盛電子) [Last visited: 2018-09-05]
- Object: Audio file. Format: wav & speex [4]
- Language: Traditional Chinese & Simplified Chinese [5]
- Sample code: olami-developers/olami-api-quickstart-curl-samples
- Related: Troubleshooting of Olami speech to text
影片要產生文字,可利用 youtube 的 Use automatic captioning - YouTube Help,約需要半天時間 [Last visited: 2018-09-04] 教學: YouTube超佛心,自動幫你加入字幕! | T客邦
- Object: Video
- Language:
- Sample code:
- Related:
语音识别 - 讯飞开放平台 [Last visited: 2018-09-06]
- Object: speex audio file less than 1 minute [6]
- Language: 中文(普通话)、英文、中文(粤语)、中文(四川话)
- Sample code:
- Related:
Amazon Transcribe – 自動語音辨識 – AWS (API documentation: What Is Amazon Transcribe? - Amazon Transcribe) [Last visited: 2018-09-05]
- Object: Audio file (Stored in S3 bucket). "Valid formats for the audio are mp3, mp4, wav and flac. [7]"
- Language: English, Spanish
- Sample code:
- Related:
Web Speech to Text 教學: 免費!中文影片語音轉文字字幕,支援超大影片與長時間錄音
- 物件: 電腦影像、聲音、YouTube 網址
- 語言: 中文、英文、日文、韓文
References
- ↑ Language Support | Cloud Speech-to-Text API | Google Cloud
- ↑ 語音轉換文字 API 參考(REST)-語音服務 - Azure Cognitive Services | Microsoft Docs
- ↑ 語言支援-語音服務 - Azure Cognitive Services | Microsoft Docs
- ↑ 文件中心 - OLAMI - 歐拉蜜人工智慧開放平台
- ↑ olami-api-quickstart-curl-samples/cloud-speech-recognition at master · olami-developers/olami-api-quickstart-curl-samples
- ↑ 语音听写 · 科大讯飞REST_API开发指南
- ↑ StartTranscriptionJob - Amazon Transcribe For best results, use a lossless format, such as FLAC or WAV with PCM 16-bit encoding.Your audio input can be sampled at any rate between 8000 and 48000 Hz. We suggest that you use 8000 Hz for low-quality audio and 16000 Hz for high-quality audio.