Text to speech: Difference between revisions

Text to speech (edit)

Revision as of 14:27, 10 September 2018

565 bytes added , 10 September 2018

→‎Speech to text 工具

Planetoid

Bureaucrats, Administrators

14,982

edits

@@ Line 53: / Line 53: @@
 * Object: microphone & audio file (For audio file which longer than 1 minute, upload files to Google cloud storage.
 * Language: 120 languages <ref>[https://cloud.google.com/speech-to-text/docs/languages?hl=zh-tw Language Support  |  Cloud Speech-to-Text API  |  Google Cloud]</ref>
+* Sample code:
 * Related: [[Troubleshooting of Google cloud speech to text]])
+[https://azure.microsoft.com/zh-tw/services/cognitive-services/speech/ Bing 語音 API - 語音辨識軟體 | Microsoft Azure]
+* Object:
+* Language:
+* Sample code: [https://github.com/Azure-Samples/SpeechToText-REST Azure-Samples/SpeechToText-REST: REST Samples of Speech To Text API]
+* Related:
 [https://tw.olami.ai/open/website/apiandsolution/api_solution OLAMI 中文語音辨識 API｜歐拉蜜人工智慧開放平台（威盛電子）] {{access | date = 2018-09-05}}
 * Object: Audio file. Format: wac & speex <ref>[https://tw.olami.ai/wiki/?mp=api_asr&content=api_asr1.html 文件中心 - OLAMI - 歐拉蜜人工智慧開放平台]</ref>
 * Language: Traditional Chinese & Simplified Chinese <ref>[https://github.com/olami-developers/olami-api-quickstart-curl-samples/tree/master/cloud-speech-recognition olami-api-quickstart-curl-samples/cloud-speech-recognition at master · olami-developers/olami-api-quickstart-curl-samples]</ref>
+* Sample code: [https://github.com/olami-developers/olami-api-quickstart-curl-samples/tree/master/cloud-speech-recognition olami-developers/olami-api-quickstart-curl-samples]
 * Related: [[Troubleshooting of Olami speech to text]]
@@ Line 63: / Line 71: @@
 * Object: Video
 * Language:
+* Sample code:
+* Related:
 [https://www.xfyun.cn/doccenter/asr 语音识别 - 讯飞开放平台] {{access | date=2018-09-06}}
 * Object: speex audio file less than 1 minute <ref>[https://doc.xfyun.cn/rest_api/%E8%AF%AD%E9%9F%B3%E5%90%AC%E5%86%99.html 语音听写 · 科大讯飞REST_API开发指南]</ref>
 * Language: 中文（普通话）、英文、中文（粤语）、中文（四川话）
+* Sample code:
+* Related:
 [https://aws.amazon.com/tw/transcribe/ Amazon Transcribe – 自動語音辨識 – AWS] (API documentation: [https://docs.aws.amazon.com/transcribe/latest/dg/what-is-transcribe.html What Is Amazon Transcribe? - Amazon Transcribe]) {{access | date=2018-09-05}}
 * Object: Audio file (Stored in S3 bucket). "Valid formats for the audio are mp3, mp4, wav and flac. <ref>[https://docs.aws.amazon.com/transcribe/latest/dg/API_StartTranscriptionJob.html StartTranscriptionJob - Amazon Transcribe] For best results, use a lossless format, such as FLAC or WAV with PCM 16-bit encoding.Your audio input can be sampled at any rate between 8000 and 48000 Hz. We suggest that you use 8000 Hz for low-quality audio and 16000 Hz for high-quality audio.</ref>"
 * Language: English, Spanish
+* Sample code:
+* Related:
 == References ==

Text to speech: Difference between revisions

Text to speech (edit)

Revision as of 14:27, 10 September 2018

Navigation menu

Search