Text to speech: Difference between revisions

Revision as of 13:44, 6 September 2018

Text to speech services

This article "Text to speech" is still being written. If there are any incomplete parts, you are welcome to directly edit them. 這篇文章「Text to speech」內容還在撰寫中，如果有不完整的部分，歡迎你直接動手修改。

Text to speech 工具

Pocket on Android or iOS Listening to Articles in Pocket with Text-to-Speech - Pocket Support

Object: Saved web page the content may not be saved completed but truncated.

Google 翻譯

Object: Input text

Google 文字轉語音 - Google Play Android 應用程式 on Android

Object: Some books on Google play

博客來電子書櫃 on Android 使用 Google 文字轉語音的服務

Object: books

Kindle 部分版本支援 Amazon.com Help: Features Available in Kindle Books

物件: 書籍

文字MP3 on Win / TTS文字轉語音引擎

物件: 輸入文字或 Excel

配合多語系 TTS 使用語音功能 - Office 支援

物件: 「OneNote、Outlook、PowerPoint 及 Word」

工研院文字轉語音Web服務

物件: 網頁
語言: 中、英文、台語

Bing Speech API - 語音辨識 | Microsoft Azure

Object: Input text

Cloud Text-to-Speech - Speech Synthesis | Google Cloud

Object: Input text

Wizzard Speech I ATT Natural Voices SDK [Last visited: 2018-09-04]

Object: Input text
Language: English, Spanish

vozMe - From text to speech (speech synthesis) [Last visited: 2018-09-04]

Object: Input text
Language: English, Español, Italiano, Hindi, Português, Català

Speech to text 工具

Speech API - 語音辨識 | Google Cloud 「語音轉文字採用機器學習技術」，免費版語音辨識的額度 60 分鐘，詳定價 | Cloud Speech API Documentation | Google Cloud。 [Last visited: 2018-09-04]

Object: microphone & audio file (For audio file which longer than 1 minute, upload files to Google cloud storage. Related page: Troubleshooting of Google cloud speech to text)
Language: 120 languages ^[1]

OLAMI 中文語音辨識 API｜歐拉蜜人工智慧開放平台（威盛電子） [Last visited: 2018-09-05]

Object: Audio file. Format: wac & speex ^[2]
Language: Traditional Chinese & Simplified Chinese ^[3]

影片要產生文字，可利用 youtube 的 Use automatic captioning - YouTube Help，約需要半天時間 [Last visited: 2018-09-04] 教學: YouTube超佛心，自動幫你加入字幕！ | T客邦

Object: Video
Language:

语音识别 - 讯飞开放平台 [Last visited: 2018-09-06]

Object: speex audio file less than 1 minute ^[4]
Language: 中文（普通话）、英文、中文（粤语）、中文（四川话）

Amazon Transcribe – 自動語音辨識 – AWS (API documentation: What Is Amazon Transcribe? - Amazon Transcribe) [Last visited: 2018-09-05]

Object: Audio file (Stored in S3 bucket). "Valid formats for the audio are mp3, mp4, wav and flac. ^[5]"
Language: English, Spanish

References

↑ Language Support | Cloud Speech-to-Text API | Google Cloud
↑ 文件中心 - OLAMI - 歐拉蜜人工智慧開放平台
↑ olami-api-quickstart-curl-samples/cloud-speech-recognition at master · olami-developers/olami-api-quickstart-curl-samples
↑ 语音听写 · 科大讯飞REST_API开发指南
↑ StartTranscriptionJob - Amazon Transcribe For best results, use a lossless format, such as FLAC or WAV with PCM 16-bit encoding.Your audio input can be sampled at any rate between 8000 and 48000 Hz. We suggest that you use 8000 Hz for low-quality audio and 16000 Hz for high-quality audio.

[1] Language Support | Cloud Speech-to-Text API | Google Cloud

[2] 文件中心 - OLAMI - 歐拉蜜人工智慧開放平台

[3] -api-quickstart-curl-samples/cloud-speech-recognition at master · olami-developers/olami-api-quickstart-curl-samples

[4] 语音听写 · 科大讯飞REST_API开发指南

[5] StartTranscriptionJob - Amazon Transcribe For best results, use a lossless format, such as FLAC or WAV with PCM 16-bit encoding.Your audio input can be sampled at any rate between 8000 and 48000 Hz. We suggest that you use 8000 Hz for low-quality audio and 16000 Hz for high-quality audio.

[1]

[2]

[3]

[4]

[5]

@@ Line 59: / Line 59: @@
 * Language:
-[https://aws.amazon.com/tw/transcribe/ Amazon Transcribe – 自動語音辨識 – AWS] (API documentation: [https://docs.aws.amazon.com/transcribe/latest/dg/what-is-transcribe.html What Is Amazon Transcribe? - Amazon Transcribe])
+[https://www.xfyun.cn/doccenter/asr 语音识别 - 讯飞开放平台] {{access | date=2018-09-06}}
+* Object: speex audio file less than 1 minute <ref>[https://doc.xfyun.cn/rest_api/%E8%AF%AD%E9%9F%B3%E5%90%AC%E5%86%99.html 语音听写 · 科大讯飞REST_API开发指南]</ref>
+* Language: 中文（普通话）、英文、中文（粤语）、中文（四川话）
+[https://aws.amazon.com/tw/transcribe/ Amazon Transcribe – 自動語音辨識 – AWS] (API documentation: [https://docs.aws.amazon.com/transcribe/latest/dg/what-is-transcribe.html What Is Amazon Transcribe? - Amazon Transcribe]) {{access | date=2018-09-05}}
 * Object: Audio file (Stored in S3 bucket). "Valid formats for the audio are mp3, mp4, wav and flac. <ref>[https://docs.aws.amazon.com/transcribe/latest/dg/API_StartTranscriptionJob.html StartTranscriptionJob - Amazon Transcribe] For best results, use a lossless format, such as FLAC or WAV with PCM 16-bit encoding.Your audio input can be sampled at any rate between 8000 and 48000 Hz. We suggest that you use 8000 Hz for low-quality audio and 16000 Hz for high-quality audio.</ref>"
 * Language: English, Spanish

Text to speech: Difference between revisions

Revision as of 13:44, 6 September 2018

Text to speech 工具

Speech to text 工具

References

Navigation menu

Search