Text to speech: Difference between revisions

From LemonWiki共筆
Jump to navigation Jump to search
mNo edit summary
 
(22 intermediate revisions by the same user not shown)
Line 22: Line 22:
* 物件: 書籍
* 物件: 書籍


[http://www.iq-t.com/PRODUCTS/textmp3_01.asp 文字MP3] on {{Win}} / [http://www.iq-t.com/SYSCOM/com01_01.asp TTS文字轉語音引擎]
[https://www.iqt.ai/ 網際智慧]: [https://www.voai.ai/ VoAI] - "絕好聲創|台灣口音高擬真AI聲優|AI配音、拍照/文字生成Podcast"
* 物件: 輸入文字 或 Excel
* 物件: 輸入文字
* 授權:如果需要購買「公播商業授權」等其它語音相關應用需求需要與廠商聯絡! {{exclaim}}
* 授權: 商業授權


[https://support.office.com/zh-tw/article/%E9%85%8D%E5%90%88%E5%A4%9A%E8%AA%9E%E7%B3%BB-tts-%E4%BD%BF%E7%94%A8%E8%AA%9E%E9%9F%B3%E5%8A%9F%E8%83%BD-e522a4f2-37cb-492b-be6a-8997d23dfe70 配合多語系 TTS 使用語音功能 - Office 支援]
[https://support.office.com/zh-tw/article/%E9%85%8D%E5%90%88%E5%A4%9A%E8%AA%9E%E7%B3%BB-tts-%E4%BD%BF%E7%94%A8%E8%AA%9E%E9%9F%B3%E5%8A%9F%E8%83%BD-e522a4f2-37cb-492b-be6a-8997d23dfe70 配合多語系 TTS 使用語音功能 - Office 支援]
Line 56: Line 56:
* 語言: 輸入中文,會聽到老外腔講中文
* 語言: 輸入中文,會聽到老外腔講中文


''$'' [https://voicenotes.com/ Voicenotes | AI Voice Notes App]
* Input: Simply record your voice directly in the app. ({{exclaim}} Note: File uploads are not currently supported)
* Language: Automatic language detection. For example, if you record in spoken Chinese, you'll receive a Chinese transcript.
[https://stability.ai/news/stable-audio-2-0 Introducing Stable Audio 2.0 — Stability AI]
* 尚未提供線上服務


''停止服務'' [http://tts.itri.org.tw/index.php 工研院文字轉語音Web服務]
''停止服務'' [http://tts.itri.org.tw/index.php 工研院文字轉語音Web服務]
* 物件: 網頁
* 物件: 網頁
* 語言: 中、英文、台語
* 語言: 中、英文、台語
== Text to sound effect ==
[https://www.optimizerai.xyz/my/all OptimizerAI : Get Unlimited Sounds]
* Free Usage limit: Free for use, downloading the file is not permitted.


== Speech to text 工具 ==
== Speech to text 工具 ==
 
[[Speech to text]]
[https://cloud.google.com/speech/?hl=zh-tw Speech API - 語音辨識  |  Google Cloud] 「語音轉文字採用機器學習技術」,免費版語音辨識的額度 60 分鐘,詳 [https://cloud.google.com/speech-to-text/pricing 定價  |  Cloud Speech API Documentation  |  Google Cloud]。 {{access | date = 2018-09-04}}
* Object: microphone & audio file (For audio file which longer than 1 minute, upload files to Google cloud storage.
* Language: 120 languages <ref>[https://cloud.google.com/speech-to-text/docs/languages?hl=zh-tw Language Support  |  Cloud Speech-to-Text API  |  Google Cloud]</ref>
* Sample code:
* Related: [[Troubleshooting of Google cloud speech to text]])
 
[https://azure.microsoft.com/zh-tw/services/cognitive-services/speech/ Bing 語音 API - 語音辨識軟體 | Microsoft Azure]
* Object: Audio file. Format: wav & ogg<ref>[https://docs.microsoft.com/zh-tw/azure/cognitive-services/speech-service/rest-speech-to-text 語音轉換文字 API 參考(REST)-語音服務 - Azure Cognitive Services | Microsoft Docs]</ref>
* Language: Traditional Chinese, Simplified Chinese & English and more on the list<ref>[https://docs.microsoft.com/zh-tw/azure/cognitive-services/speech-service/language-support#speech-to-text 語言支援-語音服務 - Azure Cognitive Services | Microsoft Docs]</ref>
* Sample code: [https://github.com/Azure-Samples/SpeechToText-REST Azure-Samples/SpeechToText-REST: REST Samples of Speech To Text API]
* Related:
 
[https://tw.olami.ai/open/website/apiandsolution/api_solution OLAMI 中文語音辨識 API|歐拉蜜人工智慧開放平台(威盛電子)] {{access | date = 2018-09-05}}
* Object: Audio file. Format: wav & speex <ref>[https://tw.olami.ai/wiki/?mp=api_asr&content=api_asr1.html 文件中心 - OLAMI - 歐拉蜜人工智慧開放平台]</ref>
* Language: Traditional Chinese & Simplified Chinese <ref>[https://github.com/olami-developers/olami-api-quickstart-curl-samples/tree/master/cloud-speech-recognition olami-api-quickstart-curl-samples/cloud-speech-recognition at master · olami-developers/olami-api-quickstart-curl-samples]</ref>
* Sample code: [https://github.com/olami-developers/olami-api-quickstart-curl-samples/tree/master/cloud-speech-recognition olami-developers/olami-api-quickstart-curl-samples]
* Related: [[Troubleshooting of Olami speech to text]]
 
影片要產生文字,可利用 youtube 的 [https://support.google.com/youtube/answer/6373554?hl=en Use automatic captioning - YouTube Help],約需要半天時間 {{access | date = 2018-09-04}} 教學: [https://www.techbang.com/posts/2107 YouTube超佛心,自動幫你加入字幕! | T客邦]
* Object: Video
* Language:
* Sample code:
* Related:
 
[https://www.xfyun.cn/doccenter/asr 语音识别 - 讯飞开放平台] {{access | date=2018-09-06}}
* Object: speex audio file less than 1 minute <ref>[https://doc.xfyun.cn/rest_api/%E8%AF%AD%E9%9F%B3%E5%90%AC%E5%86%99.html 语音听写 · 科大讯飞REST_API开发指南]</ref>
* Language: 中文(普通话)、英文、中文(粤语)、中文(四川话)
* Sample code:
* Related:
 
[https://aws.amazon.com/tw/transcribe/ Amazon Transcribe – 自動語音辨識 – AWS] (API documentation: [https://docs.aws.amazon.com/transcribe/latest/dg/what-is-transcribe.html What Is Amazon Transcribe? - Amazon Transcribe]) {{access | date=2018-09-05}}
* Object: Audio file (Stored in S3 bucket). "Valid formats for the audio are mp3, mp4, wav and flac. <ref>[https://docs.aws.amazon.com/transcribe/latest/dg/API_StartTranscriptionJob.html StartTranscriptionJob - Amazon Transcribe] For best results, use a lossless format, such as FLAC or WAV with PCM 16-bit encoding.Your audio input can be sampled at any rate between 8000 and 48000 Hz. We suggest that you use 8000 Hz for low-quality audio and 16000 Hz for high-quality audio.</ref>"
* Language: English, Spanish
* Sample code:
* Related:
 
[https://pulipulichen.github.io/HTML5-Speech-to-Text/ Web Speech to Text] 教學: [https://www.playpcesor.com/2019/12/Web-Speech-to-Text.html 免費!中文影片語音轉文字字幕,支援超大影片與長時間錄音]
* 物件: 電腦影像、聲音、YouTube 網址
* 語言: 中文、英文、日文、韓文
 
[https://app.voicetapp.com/ Voicetapp - AI Voice to Text Transcription]
* Language: 中文、英文等多種語言
* Sample code:
* Related:
* Free limit: 5 minutes
 
[https://github.com/openai/whisper openai/whisper: Robust Speech Recognition via Large-Scale Weak Supervision]
* Language: 99 languages
* Sample code:
* Related:
* Free limit:
 
🎙️ MacWhisper https://goodsnooze.gumroad.com/l/macwhisper
* Language: 100 languages
* Sample code:
* Related:
* Free limit:
 
[https://www.mygoodtape.com/ Good Tape]
* Language:
* Sample code:
* Related:
* Free limit: 20 minutes max
 
[https://www.larksuite.com/ Lark | Business Chat & Collaboration Tool] ([https://zh.wikipedia.org/wiki/%E9%A3%9E%E4%B9%A6 飞书 - 維基百科,自由的百科全書])
* Language:
* Sample code:
* Related:
* Free limit:
 
[https://github.com/aaaddress1/Whisper.py?fbclid=IwAR1rwZH-USj2NIt8pLYRGhIqWvQWUj1FQTx83qpBncno3ANWDUBI_duWr9M aaaddress1/Whisper.py: 白癡喔還要下 pip install 誰會用啦—隨開即用 Windows 版 OpenAI Whisper 逐字稿產生器]
* Language:
* Sample code:
* Related:
* Free limit:


== Further reading ==
== Further reading ==
* [https://www.playpcesor.com/2023/04/whisperdesktop-ai.html WhisperDesktop 語音轉文字免費單機軟體,AI 影片字幕實測比較]
* [https://www.playpcesor.com/2023/04/whisperdesktop-ai.html WhisperDesktop 語音轉文字免費單機軟體,AI 影片字幕實測比較]
* [https://fc.bnext.com.tw/articles/view/3590?utm_source=fc_weekly&utm_medium=email&bx_heid=5084316299&utm_campaign=09-10-2024 一手評測|開箱 Good Tape、雅婷逐字稿、Vocol.ai,哪款 AI 逐字稿軟體最好用?|未來商務]
* [[Troubleshooting of whisperX]]


如果改善 TTS
如果改善 TTS
* [https://joehuang-pop.github.io/2020/07/02/Google-API-%E6%9C%89%E9%9D%88%E9%AD%82%E7%9A%84Google%E5%B0%8F%E5%A7%90%EF%BC%8C%E4%BD%BF%E7%94%A8-SSML%E6%8A%80%E8%A1%93%E5%BC%B7%E5%8C%96Text-to-Speech/ (Google API) 有靈魂的Google小姐,使用 SSML技術強化Text-to-Speech | 黃大仙的雲端修行室]
* [https://joehuang-pop.github.io/2020/07/02/Google-API-%E6%9C%89%E9%9D%88%E9%AD%82%E7%9A%84Google%E5%B0%8F%E5%A7%90%EF%BC%8C%E4%BD%BF%E7%94%A8-SSML%E6%8A%80%E8%A1%93%E5%BC%B7%E5%8C%96Text-to-Speech/ (Google API) 有靈魂的Google小姐,使用 SSML技術強化Text-to-Speech | 黃大仙的雲端修行室]
== Related keywords ==
* [[Video to text | voice to text]]


== References ==
== References ==

Latest revision as of 10:48, 6 June 2026

Text to speech services


Text to speech 工具[edit]

Pocket on Android Os android.png or iOS Listening to Articles in Pocket with Text-to-Speech - Pocket Support

  • Object: Saved web page Icon_exclaim.gif the content may not be saved completed but truncated.

Google 翻譯

  • Object: Input text
  • Speed of speech: Good.gif Allow to slow down the speed of speech (ttsspeed=0.24)

Google 文字轉語音 - Google Play Android 應用程式 on Android Os android.png

  • Object: Some books on Google play

博客來電子書櫃 on Android Os android.png 使用 Google 文字轉語音的服務

  • Object: books

Kindle 部分版本支援 Amazon.com Help: Features Available in Kindle Books

  • 物件: 書籍

網際智慧: VoAI - "絕好聲創|台灣口音高擬真AI聲優|AI配音、拍照/文字生成Podcast"

  • 物件: 輸入文字
  • 授權: 商業授權

配合多語系 TTS 使用語音功能 - Office 支援

  • 物件: 「OneNote、Outlook、PowerPoint 及 Word」

Bing Speech API - 語音辨識 | Microsoft Azure

  • Object: Input text

Cloud Text-to-Speech - Speech Synthesis  |  Google Cloud

  • Object: Input text

Wizzard Speech I ATT Natural Voices SDK [Last visited: 2018-09-04]

  • Object: Input text
  • Language: English, Spanish

vozMe - From text to speech (speech synthesis) [Last visited: 2018-09-04]

  • Object: Input text
  • Language: English, Español, Italiano, Hindi, Português, Català

Read Aloud: 文字語音朗讀助理 - Chrome 線上應用程式商店

  • 物件: 網頁
  • 語言: 中

How 哥產生器!開發者整理素材,讓 HowHow 幫你講任何中文句子 - INSIDE

  • 物件: 文字
  • 語言: 中

ElevenLabs - Generative AI Text to Speech & Voice Cloning

  • 物件: 文字
  • 語言: 輸入中文,會聽到老外腔講中文

$ Voicenotes | AI Voice Notes App

  • Input: Simply record your voice directly in the app. (Icon_exclaim.gif Note: File uploads are not currently supported)
  • Language: Automatic language detection. For example, if you record in spoken Chinese, you'll receive a Chinese transcript.


Introducing Stable Audio 2.0 — Stability AI

  • 尚未提供線上服務

停止服務 工研院文字轉語音Web服務

  • 物件: 網頁
  • 語言: 中、英文、台語

Text to sound effect[edit]

OptimizerAI : Get Unlimited Sounds

  • Free Usage limit: Free for use, downloading the file is not permitted.

Speech to text 工具[edit]

Speech to text

Further reading[edit]

如果改善 TTS

Related keywords[edit]

References[edit]