Text to speech: Difference between revisions
| (23 intermediate revisions by the same user not shown) | |||
| Line 1: | Line 1: | ||
Text to speech services | Text to speech services | ||
{{Template:Generative AI Tool}} | |||
__TOC__ | __TOC__ | ||
| Line 20: | Line 22: | ||
* 物件: 書籍 | * 物件: 書籍 | ||
[ | [https://www.iqt.ai/ 網際智慧]: [https://www.voai.ai/ VoAI] - "絕好聲創|台灣口音高擬真AI聲優|AI配音、拍照/文字生成Podcast" | ||
* 物件: 輸入文字 | * 物件: 輸入文字 | ||
* | * 授權: 商業授權 | ||
[https://support.office.com/zh-tw/article/%E9%85%8D%E5%90%88%E5%A4%9A%E8%AA%9E%E7%B3%BB-tts-%E4%BD%BF%E7%94%A8%E8%AA%9E%E9%9F%B3%E5%8A%9F%E8%83%BD-e522a4f2-37cb-492b-be6a-8997d23dfe70 配合多語系 TTS 使用語音功能 - Office 支援] | [https://support.office.com/zh-tw/article/%E9%85%8D%E5%90%88%E5%A4%9A%E8%AA%9E%E7%B3%BB-tts-%E4%BD%BF%E7%94%A8%E8%AA%9E%E9%9F%B3%E5%8A%9F%E8%83%BD-e522a4f2-37cb-492b-be6a-8997d23dfe70 配合多語系 TTS 使用語音功能 - Office 支援] | ||
| Line 54: | Line 56: | ||
* 語言: 輸入中文,會聽到老外腔講中文 | * 語言: 輸入中文,會聽到老外腔講中文 | ||
[https:// | ''$'' [https://voicenotes.com/ Voicenotes | AI Voice Notes App] | ||
* | * Input: Simply record your voice directly in the app. ({{exclaim}} Note: File uploads are not currently supported) | ||
* | * Language: Automatic language detection. For example, if you record in spoken Chinese, you'll receive a Chinese transcript. | ||
* | |||
[https://stability.ai/news/stable-audio-2-0 Introducing Stable Audio 2.0 — Stability AI] | |||
* 尚未提供線上服務 | |||
''停止服務'' [http://tts.itri.org.tw/index.php 工研院文字轉語音Web服務] | ''停止服務'' [http://tts.itri.org.tw/index.php 工研院文字轉語音Web服務] | ||
* 物件: 網頁 | * 物件: 網頁 | ||
* 語言: 中、英文、台語 | * 語言: 中、英文、台語 | ||
== Text to sound effect == | |||
[https://www.optimizerai.xyz/my/all OptimizerAI : Get Unlimited Sounds] | |||
* Free Usage limit: Free for use, downloading the file is not permitted. | |||
== Speech to text 工具 == | == Speech to text 工具 == | ||
{{Gd}} [https://github.com/openai/whisper openai/whisper: Robust Speech Recognition via Large-Scale Weak Supervision] | |||
* Support Language: 99 languages | |||
* Input file: Audio files | |||
* Speaker identification: Need to integrate with (1) [https://github.com/m-bain/whisperX m-bain/whisperX: WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)] or (2) [https://github.com/pyannote/pyannote-audio pyannote/pyannote-audio: Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding] | |||
* Real-Time Subtitles or Translation: Not Available | |||
* Related: | |||
** [https://huggingface.co/spaces/Xenova/whisper-webgpu Whisper WebGPU - a Hugging Face Space by Xenova] | |||
** [https://github.com/aaaddress1/Whisper.py?fbclid=IwAR1rwZH-USj2NIt8pLYRGhIqWvQWUj1FQTx83qpBncno3ANWDUBI_duWr9M aaaddress1/Whisper.py: 白癡喔還要下 pip install 誰會用啦—隨開即用 Windows 版 OpenAI Whisper 逐字稿產生器] on {{Win}} 介紹:[https://www.playpcesor.com/2023/04/whisperdesktop-ai.html WhisperDesktop 語音轉文字免費單機軟體,AI 影片字幕實測比較] | |||
** 🎙️ MacWhisper https://goodsnooze.gumroad.com/l/macwhisper on {{Mac}} | |||
[https://cloud.google.com/speech/?hl=zh-tw Speech API - 語音辨識 | Google Cloud] 「語音轉文字採用機器學習技術」,免費版語音辨識的額度 60 分鐘,詳 [https://cloud.google.com/speech-to-text/pricing 定價 | Cloud Speech API Documentation | Google Cloud]。 {{access | date = 2018-09-04}} | [https://cloud.google.com/speech/?hl=zh-tw Speech API - 語音辨識 | Google Cloud] 「語音轉文字採用機器學習技術」,免費版語音辨識的額度 60 分鐘,詳 [https://cloud.google.com/speech-to-text/pricing 定價 | Cloud Speech API Documentation | Google Cloud]。 {{access | date = 2018-09-04}} | ||
* | * Input: microphone & audio file (For audio file which longer than 1 minute, upload files to Google cloud storage. | ||
* Language: 120 languages <ref>[https://cloud.google.com/speech-to-text/docs/languages?hl=zh-tw Language Support | Cloud Speech-to-Text API | Google Cloud]</ref> | * Language: 120 languages <ref>[https://cloud.google.com/speech-to-text/docs/languages?hl=zh-tw Language Support | Cloud Speech-to-Text API | Google Cloud]</ref> | ||
* Sample code: | * Sample code: | ||
| Line 72: | Line 90: | ||
[https://azure.microsoft.com/zh-tw/services/cognitive-services/speech/ Bing 語音 API - 語音辨識軟體 | Microsoft Azure] | [https://azure.microsoft.com/zh-tw/services/cognitive-services/speech/ Bing 語音 API - 語音辨識軟體 | Microsoft Azure] | ||
* | * Input: Audio file. Format: wav & ogg<ref>[https://docs.microsoft.com/zh-tw/azure/cognitive-services/speech-service/rest-speech-to-text 語音轉換文字 API 參考(REST)-語音服務 - Azure Cognitive Services | Microsoft Docs]</ref> | ||
* Language: Traditional Chinese, Simplified Chinese & English and more on the list<ref>[https://docs.microsoft.com/zh-tw/azure/cognitive-services/speech-service/language-support#speech-to-text 語言支援-語音服務 - Azure Cognitive Services | Microsoft Docs]</ref> | * Language: Traditional Chinese, Simplified Chinese & English and more on the list<ref>[https://docs.microsoft.com/zh-tw/azure/cognitive-services/speech-service/language-support#speech-to-text 語言支援-語音服務 - Azure Cognitive Services | Microsoft Docs]</ref> | ||
* Sample code: [https://github.com/Azure-Samples/SpeechToText-REST Azure-Samples/SpeechToText-REST: REST Samples of Speech To Text API] | * Sample code: [https://github.com/Azure-Samples/SpeechToText-REST Azure-Samples/SpeechToText-REST: REST Samples of Speech To Text API] | ||
* Related: | * Related: | ||
[https://app.clipchamp.com/ Clipchamp] {{access | date = 2025-04-02}} | |||
* Input: audio or video file | |||
* Support Language: 80+ languages<ref>[https://support.microsoft.com/en-us/topic/how-to-use-autocaptions-in-clipchamp-ccb0520b-38f6-4fa9-aca8-872c2964946a How to use autocaptions in Clipchamp - Microsoft Support]</ref><ref>[https://learn.microsoft.com/zh-tw/azure/ai-services/speech-service/language-support?tabs=stt 語言支援 - 語音服務 - Azure AI services | Microsoft Learn]</ref> | |||
* Comments: The free version seems to have no limitation on video duration, and you can also use AI to convert videos or audio into transcripts for free. However, during testing, the subtitles displayed for each time code were not complete sentences. | |||
[https://ink.dwave.cc/en-US/pricing Meeting Ink - AI notetaker to transcribe and summarize your meetings and recordings.] | |||
* Support Language: | |||
* Input file: Audio files | |||
* Speaker identification: Available {{Gd}} | |||
* Real-Time Subtitles or Translation: Pro plan only ''$'' | |||
* Free limit: 30 minutes max | |||
[https://tw.olami.ai/open/website/apiandsolution/api_solution OLAMI 中文語音辨識 API|歐拉蜜人工智慧開放平台(威盛電子)] {{access | date = 2018-09-05}} | [https://tw.olami.ai/open/website/apiandsolution/api_solution OLAMI 中文語音辨識 API|歐拉蜜人工智慧開放平台(威盛電子)] {{access | date = 2018-09-05}} | ||
* | * Input: Audio file. Format: wav & speex <ref>[https://tw.olami.ai/wiki/?mp=api_asr&content=api_asr1.html 文件中心 - OLAMI - 歐拉蜜人工智慧開放平台]</ref> | ||
* Language: Traditional Chinese & Simplified Chinese <ref>[https://github.com/olami-developers/olami-api-quickstart-curl-samples/tree/master/cloud-speech-recognition olami-api-quickstart-curl-samples/cloud-speech-recognition at master · olami-developers/olami-api-quickstart-curl-samples]</ref> | * Language: Traditional Chinese & Simplified Chinese <ref>[https://github.com/olami-developers/olami-api-quickstart-curl-samples/tree/master/cloud-speech-recognition olami-api-quickstart-curl-samples/cloud-speech-recognition at master · olami-developers/olami-api-quickstart-curl-samples]</ref> | ||
* Sample code: [https://github.com/olami-developers/olami-api-quickstart-curl-samples/tree/master/cloud-speech-recognition olami-developers/olami-api-quickstart-curl-samples] | * Sample code: [https://github.com/olami-developers/olami-api-quickstart-curl-samples/tree/master/cloud-speech-recognition olami-developers/olami-api-quickstart-curl-samples] | ||
| Line 84: | Line 114: | ||
影片要產生文字,可利用 youtube 的 [https://support.google.com/youtube/answer/6373554?hl=en Use automatic captioning - YouTube Help],約需要半天時間 {{access | date = 2018-09-04}} 教學: [https://www.techbang.com/posts/2107 YouTube超佛心,自動幫你加入字幕! | T客邦] | 影片要產生文字,可利用 youtube 的 [https://support.google.com/youtube/answer/6373554?hl=en Use automatic captioning - YouTube Help],約需要半天時間 {{access | date = 2018-09-04}} 教學: [https://www.techbang.com/posts/2107 YouTube超佛心,自動幫你加入字幕! | T客邦] | ||
* | * Input: Video | ||
* Language: | * Language: | ||
* Sample code: | * Sample code: | ||
| Line 90: | Line 120: | ||
[https://www.xfyun.cn/doccenter/asr 语音识别 - 讯飞开放平台] {{access | date=2018-09-06}} | [https://www.xfyun.cn/doccenter/asr 语音识别 - 讯飞开放平台] {{access | date=2018-09-06}} | ||
* | * Input: speex audio file less than 1 minute <ref>[https://doc.xfyun.cn/rest_api/%E8%AF%AD%E9%9F%B3%E5%90%AC%E5%86%99.html 语音听写 · 科大讯飞REST_API开发指南]</ref> | ||
* Language: 中文(普通话)、英文、中文(粤语)、中文(四川话) | * Language: 中文(普通话)、英文、中文(粤语)、中文(四川话) | ||
* Sample code: | * Sample code: | ||
| Line 96: | Line 126: | ||
[https://aws.amazon.com/tw/transcribe/ Amazon Transcribe – 自動語音辨識 – AWS] (API documentation: [https://docs.aws.amazon.com/transcribe/latest/dg/what-is-transcribe.html What Is Amazon Transcribe? - Amazon Transcribe]) {{access | date=2018-09-05}} | [https://aws.amazon.com/tw/transcribe/ Amazon Transcribe – 自動語音辨識 – AWS] (API documentation: [https://docs.aws.amazon.com/transcribe/latest/dg/what-is-transcribe.html What Is Amazon Transcribe? - Amazon Transcribe]) {{access | date=2018-09-05}} | ||
* | * Input: Audio file (Stored in S3 bucket). "Valid formats for the audio are mp3, mp4, wav and flac. <ref>[https://docs.aws.amazon.com/transcribe/latest/dg/API_StartTranscriptionJob.html StartTranscriptionJob - Amazon Transcribe] For best results, use a lossless format, such as FLAC or WAV with PCM 16-bit encoding.Your audio input can be sampled at any rate between 8000 and 48000 Hz. We suggest that you use 8000 Hz for low-quality audio and 16000 Hz for high-quality audio.</ref>" | ||
* Language: English, Spanish | * Language: English, Spanish | ||
* Sample code: | * Sample code: | ||
| Line 111: | Line 141: | ||
* Free limit: 5 minutes | * Free limit: 5 minutes | ||
[https://github.com/ | [https://github.com/SYSTRAN/faster-whisper?tab=readme-ov-file SYSTRAN/faster-whisper: Faster Whisper transcription with CTranslate2] | ||
* Language: | * Language: Fork from OpenAI Whisper | ||
* Sample code: | * Sample code: [https://colab.research.google.com/drive/1TqmzTY5ZXcYBoBGbwSVBtwxlFajMIcRc?usp=sharing] | ||
* Related: | * Related: | ||
* Free limit: | * Free limit: | ||
* Instruction: [https://gsyan888.blogspot.com/2023/11/faster-whisper.html 雄::gsyan: 以 Faster Whisper 將影音辨識為文字檔案(字幕或逐字稿)] | |||
[https://www.mygoodtape.com/ Good Tape] | |||
* Support Language: | |||
* Input file: Audio files | |||
* Speaker identification: Available {{Gd}} | |||
* Real-Time Subtitles or Translation: Not Available | |||
* Free limit: 20 minutes max | |||
* Language: | [https://www.larksuite.com/ Lark | Business Chat & Collaboration Tool] ([https://zh.wikipedia.org/wiki/%E9%A3%9E%E4%B9%A6 飞书 - 維基百科,自由的百科全書]) | ||
* Language: | |||
* Sample code: | * Sample code: | ||
* Related: | * Related: | ||
* Free limit: | * Free limit: | ||
[https:// | [https://github.com/Const-me/Whisper Const-me/Whisper: High-performance GPGPU inference of OpenAI's Whisper automatic speech recognition (ASR) model] on {{Win}} | ||
* Language: | * Language: | ||
* Sample code: | * Sample code: | ||
* Related: | * Related: | ||
* Free limit: | * Free limit: | ||
[https:// | [https://web.itranscribe.co/#/homepage iTranscribe: Transcribe Audio & Video to Text] | ||
* Language: | * Language: | ||
* Sample code: | * Sample code: | ||
| Line 135: | Line 174: | ||
* Free limit: | * Free limit: | ||
[https:// | [https://www.capcut.cn/ 剪映官網-全能易用的桌面端剪輯軟體-輕而易剪 上演大幕] 中國軟體 {{exclaim}} | ||
* Language: | * Language: | ||
* Sample code: | * Sample code: | ||
| Line 143: | Line 182: | ||
== Further reading == | == Further reading == | ||
* [https://www.playpcesor.com/2023/04/whisperdesktop-ai.html WhisperDesktop 語音轉文字免費單機軟體,AI 影片字幕實測比較] | * [https://www.playpcesor.com/2023/04/whisperdesktop-ai.html WhisperDesktop 語音轉文字免費單機軟體,AI 影片字幕實測比較] | ||
* [https://fc.bnext.com.tw/articles/view/3590?utm_source=fc_weekly&utm_medium=email&bx_heid=5084316299&utm_campaign=09-10-2024 一手評測|開箱 Good Tape、雅婷逐字稿、Vocol.ai,哪款 AI 逐字稿軟體最好用?|未來商務] | |||
* [[Troubleshooting of whisperX]] | |||
如果改善 TTS | 如果改善 TTS | ||
* [https://joehuang-pop.github.io/2020/07/02/Google-API-%E6%9C%89%E9%9D%88%E9%AD%82%E7%9A%84Google%E5%B0%8F%E5%A7%90%EF%BC%8C%E4%BD%BF%E7%94%A8-SSML%E6%8A%80%E8%A1%93%E5%BC%B7%E5%8C%96Text-to-Speech/ (Google API) 有靈魂的Google小姐,使用 SSML技術強化Text-to-Speech | 黃大仙的雲端修行室] | * [https://joehuang-pop.github.io/2020/07/02/Google-API-%E6%9C%89%E9%9D%88%E9%AD%82%E7%9A%84Google%E5%B0%8F%E5%A7%90%EF%BC%8C%E4%BD%BF%E7%94%A8-SSML%E6%8A%80%E8%A1%93%E5%BC%B7%E5%8C%96Text-to-Speech/ (Google API) 有靈魂的Google小姐,使用 SSML技術強化Text-to-Speech | 黃大仙的雲端修行室] | ||
== Related keywords == | |||
* [[Video to text | voice to text]] | |||
== References == | == References == | ||
Latest revision as of 14:23, 1 April 2026
Text to speech services
Text to speech 工具[edit]
Pocket on Android or iOS Listening to Articles in Pocket with Text-to-Speech - Pocket Support
- Object: Saved web page
the content may not be saved completed but truncated.
- Object: Input text
- Speed of speech:
Allow to slow down the speed of speech (ttsspeed=0.24)
Google 文字轉語音 - Google Play Android 應用程式 on Android
- Object: Some books on Google play
博客來電子書櫃 on Android 使用 Google 文字轉語音的服務
- Object: books
Kindle 部分版本支援 Amazon.com Help: Features Available in Kindle Books
- 物件: 書籍
網際智慧: VoAI - "絕好聲創|台灣口音高擬真AI聲優|AI配音、拍照/文字生成Podcast"
- 物件: 輸入文字
- 授權: 商業授權
- 物件: 「OneNote、Outlook、PowerPoint 及 Word」
Bing Speech API - 語音辨識 | Microsoft Azure
- Object: Input text
Cloud Text-to-Speech - Speech Synthesis | Google Cloud
- Object: Input text
Wizzard Speech I ATT Natural Voices SDK [Last visited: 2018-09-04]
- Object: Input text
- Language: English, Spanish
vozMe - From text to speech (speech synthesis) [Last visited: 2018-09-04]
- Object: Input text
- Language: English, Español, Italiano, Hindi, Português, Català
Read Aloud: 文字語音朗讀助理 - Chrome 線上應用程式商店
- 物件: 網頁
- 語言: 中
How 哥產生器!開發者整理素材,讓 HowHow 幫你講任何中文句子 - INSIDE
- 物件: 文字
- 語言: 中
ElevenLabs - Generative AI Text to Speech & Voice Cloning
- 物件: 文字
- 語言: 輸入中文,會聽到老外腔講中文
$ Voicenotes | AI Voice Notes App
- Input: Simply record your voice directly in the app. (
Note: File uploads are not currently supported) - Language: Automatic language detection. For example, if you record in spoken Chinese, you'll receive a Chinese transcript.
Introducing Stable Audio 2.0 — Stability AI
- 尚未提供線上服務
停止服務 工研院文字轉語音Web服務
- 物件: 網頁
- 語言: 中、英文、台語
Text to sound effect[edit]
OptimizerAI : Get Unlimited Sounds
- Free Usage limit: Free for use, downloading the file is not permitted.
Speech to text 工具[edit]
openai/whisper: Robust Speech Recognition via Large-Scale Weak Supervision
- Support Language: 99 languages
- Input file: Audio files
- Speaker identification: Need to integrate with (1) m-bain/whisperX: WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization) or (2) pyannote/pyannote-audio: Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
- Real-Time Subtitles or Translation: Not Available
- Related:
Speech API - 語音辨識 | Google Cloud 「語音轉文字採用機器學習技術」,免費版語音辨識的額度 60 分鐘,詳 定價 | Cloud Speech API Documentation | Google Cloud。 [Last visited: 2018-09-04]
- Input: microphone & audio file (For audio file which longer than 1 minute, upload files to Google cloud storage.
- Language: 120 languages [1]
- Sample code:
- Related: Troubleshooting of Google cloud speech to text)
Bing 語音 API - 語音辨識軟體 | Microsoft Azure
- Input: Audio file. Format: wav & ogg[2]
- Language: Traditional Chinese, Simplified Chinese & English and more on the list[3]
- Sample code: Azure-Samples/SpeechToText-REST: REST Samples of Speech To Text API
- Related:
Clipchamp [Last visited: 2025-04-02]
- Input: audio or video file
- Support Language: 80+ languages[4][5]
- Comments: The free version seems to have no limitation on video duration, and you can also use AI to convert videos or audio into transcripts for free. However, during testing, the subtitles displayed for each time code were not complete sentences.
Meeting Ink - AI notetaker to transcribe and summarize your meetings and recordings.
- Support Language:
- Input file: Audio files
- Speaker identification: Available

- Real-Time Subtitles or Translation: Pro plan only $
- Free limit: 30 minutes max
OLAMI 中文語音辨識 API|歐拉蜜人工智慧開放平台(威盛電子) [Last visited: 2018-09-05]
- Input: Audio file. Format: wav & speex [6]
- Language: Traditional Chinese & Simplified Chinese [7]
- Sample code: olami-developers/olami-api-quickstart-curl-samples
- Related: Troubleshooting of Olami speech to text
影片要產生文字,可利用 youtube 的 Use automatic captioning - YouTube Help,約需要半天時間 [Last visited: 2018-09-04] 教學: YouTube超佛心,自動幫你加入字幕! | T客邦
- Input: Video
- Language:
- Sample code:
- Related:
语音识别 - 讯飞开放平台 [Last visited: 2018-09-06]
- Input: speex audio file less than 1 minute [8]
- Language: 中文(普通话)、英文、中文(粤语)、中文(四川话)
- Sample code:
- Related:
Amazon Transcribe – 自動語音辨識 – AWS (API documentation: What Is Amazon Transcribe? - Amazon Transcribe) [Last visited: 2018-09-05]
- Input: Audio file (Stored in S3 bucket). "Valid formats for the audio are mp3, mp4, wav and flac. [9]"
- Language: English, Spanish
- Sample code:
- Related:
Web Speech to Text 教學: 免費!中文影片語音轉文字字幕,支援超大影片與長時間錄音
- 物件: 電腦影像、聲音、YouTube 網址
- 語言: 中文、英文、日文、韓文
Voicetapp - AI Voice to Text Transcription
- Language: 中文、英文等多種語言
- Sample code:
- Related:
- Free limit: 5 minutes
SYSTRAN/faster-whisper: Faster Whisper transcription with CTranslate2
- Language: Fork from OpenAI Whisper
- Sample code: [1]
- Related:
- Free limit:
- Instruction: 雄::gsyan: 以 Faster Whisper 將影音辨識為文字檔案(字幕或逐字稿)
- Support Language:
- Input file: Audio files
- Speaker identification: Available

- Real-Time Subtitles or Translation: Not Available
- Free limit: 20 minutes max
Lark | Business Chat & Collaboration Tool (飞书 - 維基百科,自由的百科全書)
- Language:
- Sample code:
- Related:
- Free limit:
Const-me/Whisper: High-performance GPGPU inference of OpenAI's Whisper automatic speech recognition (ASR) model on Win
- Language:
- Sample code:
- Related:
- Free limit:
iTranscribe: Transcribe Audio & Video to Text
- Language:
- Sample code:
- Related:
- Free limit:
剪映官網-全能易用的桌面端剪輯軟體-輕而易剪 上演大幕 中國軟體
- Language:
- Sample code:
- Related:
- Free limit:
Further reading[edit]
- WhisperDesktop 語音轉文字免費單機軟體,AI 影片字幕實測比較
- 一手評測|開箱 Good Tape、雅婷逐字稿、Vocol.ai,哪款 AI 逐字稿軟體最好用?|未來商務
- Troubleshooting of whisperX
如果改善 TTS
Related keywords[edit]
References[edit]
- ↑ Language Support | Cloud Speech-to-Text API | Google Cloud
- ↑ 語音轉換文字 API 參考(REST)-語音服務 - Azure Cognitive Services | Microsoft Docs
- ↑ 語言支援-語音服務 - Azure Cognitive Services | Microsoft Docs
- ↑ How to use autocaptions in Clipchamp - Microsoft Support
- ↑ 語言支援 - 語音服務 - Azure AI services | Microsoft Learn
- ↑ 文件中心 - OLAMI - 歐拉蜜人工智慧開放平台
- ↑ olami-api-quickstart-curl-samples/cloud-speech-recognition at master · olami-developers/olami-api-quickstart-curl-samples
- ↑ 语音听写 · 科大讯飞REST_API开发指南
- ↑ StartTranscriptionJob - Amazon Transcribe For best results, use a lossless format, such as FLAC or WAV with PCM 16-bit encoding.Your audio input can be sampled at any rate between 8000 and 48000 Hz. We suggest that you use 8000 Hz for low-quality audio and 16000 Hz for high-quality audio.