Text to speech: Difference between revisions

Input: Simply record your voice directly in the app. ( Note: File uploads are not currently supported)
Language: Automatic language detection. For example, if you record in spoken Chinese, you'll receive a Chinese transcript.

Introducing Stable Audio 2.0 — Stability AI

尚未提供線上服務

停止服務 工研院文字轉語音Web服務

物件: 網頁
語言: 中、英文、台語

Text to sound effect[edit]

OptimizerAI : Get Unlimited Sounds

Free Usage limit: Free for use, downloading the file is not permitted.

Speech to text 工具[edit]

Speech to text

Related keywords[edit]

voice to text

References[edit]

@@ Line 1: / Line 1: @@
 Text to speech services
+{{Template:Generative AI Tool}}
 __TOC__
@@ Line 20: / Line 22: @@
 * 物件: 書籍
-[http://www.iq-t.com/PRODUCTS/textmp3_01.asp 文字MP3] on {{Win}} / [http://www.iq-t.com/SYSCOM/com01_01.asp TTS文字轉語音引擎]
+[https://www.iqt.ai/ 網際智慧]: [https://www.voai.ai/ VoAI] - "絕好聲創｜台灣口音高擬真AI聲優｜AI配音、拍照/文字生成Podcast"
-* 物件: 輸入文字 或 Excel
+* 物件: 輸入文字
-* 授權：如果需要購買「公播商業授權」等其它語音相關應用需求需要與廠商聯絡！ {{exclaim}}
+* 授權: 商業授權
 [https://support.office.com/zh-tw/article/%E9%85%8D%E5%90%88%E5%A4%9A%E8%AA%9E%E7%B3%BB-tts-%E4%BD%BF%E7%94%A8%E8%AA%9E%E9%9F%B3%E5%8A%9F%E8%83%BD-e522a4f2-37cb-492b-be6a-8997d23dfe70 配合多語系 TTS 使用語音功能 - Office 支援]
@@ Line 54: / Line 56: @@
 * 語言: 輸入中文，會聽到老外腔講中文
-[https://www.heygen.com/ HeyGen - AI Video Generator]
+''$'' [https://voicenotes.com/ Voicenotes | AI Voice Notes App]
-* 物件: 文字
+* Input: Simply record your voice directly in the app. ({{exclaim}} Note: File uploads are not currently supported)
-* 語言:
+* Language: Automatic language detection. For example, if you record in spoken Chinese, you'll receive a Chinese transcript.
-* 輸出：影片
+[https://stability.ai/news/stable-audio-2-0 Introducing Stable Audio 2.0 — Stability AI]
+* 尚未提供線上服務
 ''停止服務'' [http://tts.itri.org.tw/index.php 工研院文字轉語音Web服務]
 * 物件: 網頁
 * 語言: 中、英文、台語
+== Text to sound effect ==
+[https://www.optimizerai.xyz/my/all OptimizerAI : Get Unlimited Sounds]
+* Free Usage limit: Free for use, downloading the file is not permitted.
 == Speech to text 工具 ==
+[[Speech to text]]
-[https://cloud.google.com/speech/?hl=zh-tw Speech API - 語音辨識  |  Google Cloud] 「語音轉文字採用機器學習技術」，免費版語音辨識的額度 60 分鐘，詳 [https://cloud.google.com/speech-to-text/pricing 定價  |  Cloud Speech API Documentation  |  Google Cloud]。 {{access | date = 2018-09-04}}
-* Object: microphone & audio file (For audio file which longer than 1 minute, upload files to Google cloud storage.
-* Language: 120 languages <ref>[https://cloud.google.com/speech-to-text/docs/languages?hl=zh-tw Language Support  |  Cloud Speech-to-Text API  |  Google Cloud]</ref>
-* Sample code:
-* Related: [[Troubleshooting of Google cloud speech to text]])
-[https://azure.microsoft.com/zh-tw/services/cognitive-services/speech/ Bing 語音 API - 語音辨識軟體 | Microsoft Azure]
-* Object: Audio file. Format: wav & ogg<ref>[https://docs.microsoft.com/zh-tw/azure/cognitive-services/speech-service/rest-speech-to-text 語音轉換文字 API 參考（REST）-語音服務 - Azure Cognitive Services | Microsoft Docs]</ref>
-* Language: Traditional Chinese, Simplified Chinese & English and more on the list<ref>[https://docs.microsoft.com/zh-tw/azure/cognitive-services/speech-service/language-support#speech-to-text 語言支援-語音服務 - Azure Cognitive Services | Microsoft Docs]</ref>
-* Sample code: [https://github.com/Azure-Samples/SpeechToText-REST Azure-Samples/SpeechToText-REST: REST Samples of Speech To Text API]
-* Related:
-[https://tw.olami.ai/open/website/apiandsolution/api_solution OLAMI 中文語音辨識 API｜歐拉蜜人工智慧開放平台（威盛電子）] {{access | date = 2018-09-05}}
-* Object: Audio file. Format: wav & speex <ref>[https://tw.olami.ai/wiki/?mp=api_asr&content=api_asr1.html 文件中心 - OLAMI - 歐拉蜜人工智慧開放平台]</ref>
-* Language: Traditional Chinese & Simplified Chinese <ref>[https://github.com/olami-developers/olami-api-quickstart-curl-samples/tree/master/cloud-speech-recognition olami-api-quickstart-curl-samples/cloud-speech-recognition at master · olami-developers/olami-api-quickstart-curl-samples]</ref>
-* Sample code: [https://github.com/olami-developers/olami-api-quickstart-curl-samples/tree/master/cloud-speech-recognition olami-developers/olami-api-quickstart-curl-samples]
-* Related: [[Troubleshooting of Olami speech to text]]
-影片要產生文字，可利用 youtube 的 [https://support.google.com/youtube/answer/6373554?hl=en Use automatic captioning - YouTube Help]，約需要半天時間 {{access | date = 2018-09-04}} 教學: [https://www.techbang.com/posts/2107 YouTube超佛心，自動幫你加入字幕！ | T客邦]
-* Object: Video
-* Language:
-* Sample code:
-* Related:
-[https://www.xfyun.cn/doccenter/asr 语音识别 - 讯飞开放平台] {{access | date=2018-09-06}}
-* Object: speex audio file less than 1 minute <ref>[https://doc.xfyun.cn/rest_api/%E8%AF%AD%E9%9F%B3%E5%90%AC%E5%86%99.html 语音听写 · 科大讯飞REST_API开发指南]</ref>
-* Language: 中文（普通话）、英文、中文（粤语）、中文（四川话）
-* Sample code:
-* Related:
-[https://aws.amazon.com/tw/transcribe/ Amazon Transcribe – 自動語音辨識 – AWS] (API documentation: [https://docs.aws.amazon.com/transcribe/latest/dg/what-is-transcribe.html What Is Amazon Transcribe? - Amazon Transcribe]) {{access | date=2018-09-05}}
-* Object: Audio file (Stored in S3 bucket). "Valid formats for the audio are mp3, mp4, wav and flac. <ref>[https://docs.aws.amazon.com/transcribe/latest/dg/API_StartTranscriptionJob.html StartTranscriptionJob - Amazon Transcribe] For best results, use a lossless format, such as FLAC or WAV with PCM 16-bit encoding.Your audio input can be sampled at any rate between 8000 and 48000 Hz. We suggest that you use 8000 Hz for low-quality audio and 16000 Hz for high-quality audio.</ref>"
-* Language: English, Spanish
-* Sample code:
-* Related:
-[https://pulipulichen.github.io/HTML5-Speech-to-Text/ Web Speech to Text] 教學: [https://www.playpcesor.com/2019/12/Web-Speech-to-Text.html 免費！中文影片語音轉文字字幕，支援超大影片與長時間錄音]
-* 物件: 電腦影像、聲音、YouTube 網址
-* 語言: 中文、英文、日文、韓文
-[https://app.voicetapp.com/ Voicetapp - AI Voice to Text Transcription]
-* Language: 中文、英文等多種語言
-* Sample code:
-* Related:
-* Free limit: 5 minutes
-[https://github.com/openai/whisper openai/whisper: Robust Speech Recognition via Large-Scale Weak Supervision]
-* Language: 99 languages
-* Sample code:
-* Related:
-* Free limit:
-🎙️ MacWhisper https://goodsnooze.gumroad.com/l/macwhisper
-* Language: 100 languages
-* Sample code:
-* Related:
-* Free limit:
-[https://www.mygoodtape.com/ Good Tape]
-* Language:
-* Sample code:
-* Related:
-* Free limit: 20 minutes max
-[https://www.larksuite.com/ Lark | Business Chat & Collaboration Tool] ([https://zh.wikipedia.org/wiki/%E9%A3%9E%E4%B9%A6 飞书 - 維基百科，自由的百科全書])
-* Language:
-* Sample code:
-* Related:
-* Free limit:
-[https://github.com/aaaddress1/Whisper.py?fbclid=IwAR1rwZH-USj2NIt8pLYRGhIqWvQWUj1FQTx83qpBncno3ANWDUBI_duWr9M aaaddress1/Whisper.py: 白癡喔還要下 pip install 誰會用啦—隨開即用 Windows 版 OpenAI Whisper 逐字稿產生器]
-* Language:
-* Sample code:
-* Related:
-* Free limit:
 == Further reading ==
 * [https://www.playpcesor.com/2023/04/whisperdesktop-ai.html WhisperDesktop 語音轉文字免費單機軟體，AI 影片字幕實測比較]
+* [https://fc.bnext.com.tw/articles/view/3590?utm_source=fc_weekly&utm_medium=email&bx_heid=5084316299&utm_campaign=09-10-2024 一手評測｜開箱 Good Tape、雅婷逐字稿、Vocol.ai，哪款 AI 逐字稿軟體最好用？｜未來商務]
+* [[Troubleshooting of whisperX]]
 如果改善 TTS
 * [https://joehuang-pop.github.io/2020/07/02/Google-API-%E6%9C%89%E9%9D%88%E9%AD%82%E7%9A%84Google%E5%B0%8F%E5%A7%90%EF%BC%8C%E4%BD%BF%E7%94%A8-SSML%E6%8A%80%E8%A1%93%E5%BC%B7%E5%8C%96Text-to-Speech/ (Google API) 有靈魂的Google小姐，使用 SSML技術強化Text-to-Speech | 黃大仙的雲端修行室]
+== Related keywords ==
+* [[Video to text | voice to text]]
 == References ==

Text to speech: Difference between revisions

Latest revision as of 10:48, 6 June 2026

Contents

Text to speech 工具[edit]

Text to sound effect[edit]

Speech to text 工具[edit]

Further reading[edit]

Related keywords[edit]

References[edit]

Navigation menu

Text to speech: Difference between revisions

Latest revision as of 10:48, 6 June 2026

Text to speech 工具[edit]

Text to sound effect[edit]

Speech to text 工具[edit]

Further reading[edit]

Related keywords[edit]

References[edit]

Navigation menu

Search