14,982
edits
mNo edit summary |
|||
| Line 53: | Line 53: | ||
* Object: microphone & audio file (For audio file which longer than 1 minute, upload files to Google cloud storage. | * Object: microphone & audio file (For audio file which longer than 1 minute, upload files to Google cloud storage. | ||
* Language: 120 languages <ref>[https://cloud.google.com/speech-to-text/docs/languages?hl=zh-tw Language Support | Cloud Speech-to-Text API | Google Cloud]</ref> | * Language: 120 languages <ref>[https://cloud.google.com/speech-to-text/docs/languages?hl=zh-tw Language Support | Cloud Speech-to-Text API | Google Cloud]</ref> | ||
* Sample code: | |||
* Related: [[Troubleshooting of Google cloud speech to text]]) | * Related: [[Troubleshooting of Google cloud speech to text]]) | ||
[https://azure.microsoft.com/zh-tw/services/cognitive-services/speech/ Bing 語音 API - 語音辨識軟體 | Microsoft Azure] | |||
* Object: | |||
* Language: | |||
* Sample code: [https://github.com/Azure-Samples/SpeechToText-REST Azure-Samples/SpeechToText-REST: REST Samples of Speech To Text API] | |||
* Related: | |||
[https://tw.olami.ai/open/website/apiandsolution/api_solution OLAMI 中文語音辨識 API|歐拉蜜人工智慧開放平台(威盛電子)] {{access | date = 2018-09-05}} | [https://tw.olami.ai/open/website/apiandsolution/api_solution OLAMI 中文語音辨識 API|歐拉蜜人工智慧開放平台(威盛電子)] {{access | date = 2018-09-05}} | ||
* Object: Audio file. Format: wac & speex <ref>[https://tw.olami.ai/wiki/?mp=api_asr&content=api_asr1.html 文件中心 - OLAMI - 歐拉蜜人工智慧開放平台]</ref> | * Object: Audio file. Format: wac & speex <ref>[https://tw.olami.ai/wiki/?mp=api_asr&content=api_asr1.html 文件中心 - OLAMI - 歐拉蜜人工智慧開放平台]</ref> | ||
* Language: Traditional Chinese & Simplified Chinese <ref>[https://github.com/olami-developers/olami-api-quickstart-curl-samples/tree/master/cloud-speech-recognition olami-api-quickstart-curl-samples/cloud-speech-recognition at master · olami-developers/olami-api-quickstart-curl-samples]</ref> | * Language: Traditional Chinese & Simplified Chinese <ref>[https://github.com/olami-developers/olami-api-quickstart-curl-samples/tree/master/cloud-speech-recognition olami-api-quickstart-curl-samples/cloud-speech-recognition at master · olami-developers/olami-api-quickstart-curl-samples]</ref> | ||
* Sample code: [https://github.com/olami-developers/olami-api-quickstart-curl-samples/tree/master/cloud-speech-recognition olami-developers/olami-api-quickstart-curl-samples] | |||
* Related: [[Troubleshooting of Olami speech to text]] | * Related: [[Troubleshooting of Olami speech to text]] | ||
| Line 63: | Line 71: | ||
* Object: Video | * Object: Video | ||
* Language: | * Language: | ||
* Sample code: | |||
* Related: | |||
[https://www.xfyun.cn/doccenter/asr 语音识别 - 讯飞开放平台] {{access | date=2018-09-06}} | [https://www.xfyun.cn/doccenter/asr 语音识别 - 讯飞开放平台] {{access | date=2018-09-06}} | ||
* Object: speex audio file less than 1 minute <ref>[https://doc.xfyun.cn/rest_api/%E8%AF%AD%E9%9F%B3%E5%90%AC%E5%86%99.html 语音听写 · 科大讯飞REST_API开发指南]</ref> | * Object: speex audio file less than 1 minute <ref>[https://doc.xfyun.cn/rest_api/%E8%AF%AD%E9%9F%B3%E5%90%AC%E5%86%99.html 语音听写 · 科大讯飞REST_API开发指南]</ref> | ||
* Language: 中文(普通话)、英文、中文(粤语)、中文(四川话) | * Language: 中文(普通话)、英文、中文(粤语)、中文(四川话) | ||
* Sample code: | |||
* Related: | |||
[https://aws.amazon.com/tw/transcribe/ Amazon Transcribe – 自動語音辨識 – AWS] (API documentation: [https://docs.aws.amazon.com/transcribe/latest/dg/what-is-transcribe.html What Is Amazon Transcribe? - Amazon Transcribe]) {{access | date=2018-09-05}} | [https://aws.amazon.com/tw/transcribe/ Amazon Transcribe – 自動語音辨識 – AWS] (API documentation: [https://docs.aws.amazon.com/transcribe/latest/dg/what-is-transcribe.html What Is Amazon Transcribe? - Amazon Transcribe]) {{access | date=2018-09-05}} | ||
* Object: Audio file (Stored in S3 bucket). "Valid formats for the audio are mp3, mp4, wav and flac. <ref>[https://docs.aws.amazon.com/transcribe/latest/dg/API_StartTranscriptionJob.html StartTranscriptionJob - Amazon Transcribe] For best results, use a lossless format, such as FLAC or WAV with PCM 16-bit encoding.Your audio input can be sampled at any rate between 8000 and 48000 Hz. We suggest that you use 8000 Hz for low-quality audio and 16000 Hz for high-quality audio.</ref>" | * Object: Audio file (Stored in S3 bucket). "Valid formats for the audio are mp3, mp4, wav and flac. <ref>[https://docs.aws.amazon.com/transcribe/latest/dg/API_StartTranscriptionJob.html StartTranscriptionJob - Amazon Transcribe] For best results, use a lossless format, such as FLAC or WAV with PCM 16-bit encoding.Your audio input can be sampled at any rate between 8000 and 48000 Hz. We suggest that you use 8000 Hz for low-quality audio and 16000 Hz for high-quality audio.</ref>" | ||
* Language: English, Spanish | * Language: English, Spanish | ||
* Sample code: | |||
* Related: | |||
== References == | == References == | ||