Text to speech: Difference between revisions

From LemonWiki共筆
Jump to navigation Jump to search
 
(49 intermediate revisions by the same user not shown)
Line 1: Line 1:
Text to speech services
Text to speech services


{{Template:Draft}}
{{Template:Generative AI Tool}}
 
__TOC__


== Text to speech 工具 ==
== Text to speech 工具 ==
Line 9: Line 11:
[https://translate.google.com/ Google 翻譯]
[https://translate.google.com/ Google 翻譯]
* Object: Input text
* Object: Input text
* Speed of speech: {{Gd}} Allow to slow down the speed of speech ({{kbd | key=<nowiki>ttsspeed=0.24</nowiki>}})


[https://play.google.com/store/apps/details?id=com.google.android.tts Google 文字轉語音 - Google Play Android 應用程式] on {{Android}}
[https://play.google.com/store/apps/details?id=com.google.android.tts Google 文字轉語音 - Google Play Android 應用程式] on {{Android}}
Line 19: Line 22:
* 物件: 書籍
* 物件: 書籍


[http://www.iq-t.com/PRODUCTS/textmp3_01.asp 文字MP3] on {{Win}} / [http://www.iq-t.com/SYSCOM/com01_01.asp TTS文字轉語音引擎]
[https://www.iqt.ai/ 網際智慧]: [https://www.voai.ai/ VoAI] - "絕好聲創|台灣口音高擬真AI聲優|AI配音、拍照/文字生成Podcast"
* 物件: 輸入文字 或 Excel
* 物件: 輸入文字
* 授權: 商業授權


[https://support.office.com/zh-tw/article/%E9%85%8D%E5%90%88%E5%A4%9A%E8%AA%9E%E7%B3%BB-tts-%E4%BD%BF%E7%94%A8%E8%AA%9E%E9%9F%B3%E5%8A%9F%E8%83%BD-e522a4f2-37cb-492b-be6a-8997d23dfe70 配合多語系 TTS 使用語音功能 - Office 支援]
[https://support.office.com/zh-tw/article/%E9%85%8D%E5%90%88%E5%A4%9A%E8%AA%9E%E7%B3%BB-tts-%E4%BD%BF%E7%94%A8%E8%AA%9E%E9%9F%B3%E5%8A%9F%E8%83%BD-e522a4f2-37cb-492b-be6a-8997d23dfe70 配合多語系 TTS 使用語音功能 - Office 支援]
* 物件: 「OneNote、Outlook、PowerPoint 及 Word」
* 物件: 「OneNote、Outlook、PowerPoint 及 Word」
[http://tts.itri.org.tw/index.php 工研院文字轉語音Web服務]
* 物件: 網頁
* 語言: 中、英文、台語


[https://azure.microsoft.com/zh-tw/services/cognitive-services/speech/ Bing Speech API - 語音辨識 | Microsoft Azure]
[https://azure.microsoft.com/zh-tw/services/cognitive-services/speech/ Bing Speech API - 語音辨識 | Microsoft Azure]
Line 44: Line 44:
* Language: English, Español, Italiano, Hindi, Português, Català
* Language: English, Español, Italiano, Hindi, Português, Català


[https://chrome.google.com/webstore/detail/read-aloud-a-text-to-spee/hdhinadidafjejdhmfkjgnolgimiaplp?hl=zh-TW Read Aloud: 文字語音朗讀助理 - Chrome 線上應用程式商店]
* 物件: 網頁
* 語言: 中
[https://www.inside.com.tw/article/19966-howger-generator How 哥產生器!開發者整理素材,讓 HowHow 幫你講任何中文句子 - INSIDE]
* 物件: 文字
* 語言: 中
[https://elevenlabs.io/ ElevenLabs - Generative AI Text to Speech & Voice Cloning]
* 物件: 文字
* 語言: 輸入中文,會聽到老外腔講中文
''$'' [https://voicenotes.com/ Voicenotes | AI Voice Notes App]
* Input: Simply record your voice directly in the app. ({{exclaim}} Note: File uploads are not currently supported)
* Language: Automatic language detection. For example, if you record in spoken Chinese, you'll receive a Chinese transcript.
[https://stability.ai/news/stable-audio-2-0 Introducing Stable Audio 2.0 — Stability AI]
* 尚未提供線上服務
''停止服務'' [http://tts.itri.org.tw/index.php 工研院文字轉語音Web服務]
* 物件: 網頁
* 語言: 中、英文、台語
== Text to sound effect ==
[https://www.optimizerai.xyz/my/all OptimizerAI : Get Unlimited Sounds]
* Free Usage limit: Free for use, downloading the file is not permitted.


== Speech to text 工具 ==
== Speech to text 工具 ==
[[Speech to text]]


[https://cloud.google.com/speech/?hl=zh-tw Speech API - 語音辨識  |  Google Cloud] 「語音轉文字採用機器學習技術」,免費版語音辨識的額度 60 分鐘,詳 [https://cloud.google.com/speech-to-text/pricing 定價  |  Cloud Speech API Documentation  |  Google Cloud]。 {{access | date = 2018-09-04}}
== Further reading ==
* Object: microphone & audio file (For audio file which longer than 1 minute, upload files to Google cloud storage. Related page: [[Troubleshooting of Google cloud speech to text]])
* [https://www.playpcesor.com/2023/04/whisperdesktop-ai.html WhisperDesktop 語音轉文字免費單機軟體,AI 影片字幕實測比較]
* Language: 120 languages <ref>[https://cloud.google.com/speech-to-text/docs/languages?hl=zh-tw Language Support  |  Cloud Speech-to-Text API  |  Google Cloud]</ref>
* [https://fc.bnext.com.tw/articles/view/3590?utm_source=fc_weekly&utm_medium=email&bx_heid=5084316299&utm_campaign=09-10-2024 一手評測|開箱 Good Tape、雅婷逐字稿、Vocol.ai,哪款 AI 逐字稿軟體最好用?|未來商務]
 
* [[Troubleshooting of whisperX]]
[https://tw.olami.ai/open/website/apiandsolution/api_solution OLAMI 中文語音辨識 API|歐拉蜜人工智慧開放平台(威盛電子)] {{access | date = 2018-09-05}}
* Object: Audio file. Format: wac & speex <ref>[https://tw.olami.ai/wiki/?mp=api_asr&content=api_asr1.html 文件中心 - OLAMI - 歐拉蜜人工智慧開放平台]</ref>
* Language: Traditional Chinese & Simplified Chinese <ref>[https://github.com/olami-developers/olami-api-quickstart-curl-samples/tree/master/cloud-speech-recognition olami-api-quickstart-curl-samples/cloud-speech-recognition at master · olami-developers/olami-api-quickstart-curl-samples]</ref>


影片要產生文字,可利用 youtube 的 [https://support.google.com/youtube/answer/6373554?hl=en Use automatic captioning - YouTube Help],約需要半天時間 {{access | date = 2018-09-04}} 教學: [https://www.techbang.com/posts/2107 YouTube超佛心,自動幫你加入字幕! | T客邦]
如果改善 TTS
* Object: Video
* [https://joehuang-pop.github.io/2020/07/02/Google-API-%E6%9C%89%E9%9D%88%E9%AD%82%E7%9A%84Google%E5%B0%8F%E5%A7%90%EF%BC%8C%E4%BD%BF%E7%94%A8-SSML%E6%8A%80%E8%A1%93%E5%BC%B7%E5%8C%96Text-to-Speech/ (Google API) 有靈魂的Google小姐,使用 SSML技術強化Text-to-Speech | 黃大仙的雲端修行室]
* Language:


[https://aws.amazon.com/tw/transcribe/ Amazon Transcribe – 自動語音辨識 – AWS] (API documentation: [https://docs.aws.amazon.com/transcribe/latest/dg/what-is-transcribe.html What Is Amazon Transcribe? - Amazon Transcribe])
== Related keywords ==
* Object: Audio file (Stored in S3 bucket). "Valid formats for the audio are mp3, mp4, wav and flac. <ref>[https://docs.aws.amazon.com/transcribe/latest/dg/API_StartTranscriptionJob.html StartTranscriptionJob - Amazon Transcribe] For best results, use a lossless format, such as FLAC or WAV with PCM 16-bit encoding.Your audio input can be sampled at any rate between 8000 and 48000 Hz. We suggest that you use 8000 Hz for low-quality audio and 16000 Hz for high-quality audio.</ref>"
* [[Video to text | voice to text]]
* Language: English, Spanish


== References ==
== References ==

Latest revision as of 10:48, 6 June 2026

Text to speech services


Text to speech 工具[edit]

Pocket on Android Os android.png or iOS Listening to Articles in Pocket with Text-to-Speech - Pocket Support

  • Object: Saved web page Icon_exclaim.gif the content may not be saved completed but truncated.

Google 翻譯

  • Object: Input text
  • Speed of speech: Good.gif Allow to slow down the speed of speech (ttsspeed=0.24)

Google 文字轉語音 - Google Play Android 應用程式 on Android Os android.png

  • Object: Some books on Google play

博客來電子書櫃 on Android Os android.png 使用 Google 文字轉語音的服務

  • Object: books

Kindle 部分版本支援 Amazon.com Help: Features Available in Kindle Books

  • 物件: 書籍

網際智慧: VoAI - "絕好聲創|台灣口音高擬真AI聲優|AI配音、拍照/文字生成Podcast"

  • 物件: 輸入文字
  • 授權: 商業授權

配合多語系 TTS 使用語音功能 - Office 支援

  • 物件: 「OneNote、Outlook、PowerPoint 及 Word」

Bing Speech API - 語音辨識 | Microsoft Azure

  • Object: Input text

Cloud Text-to-Speech - Speech Synthesis  |  Google Cloud

  • Object: Input text

Wizzard Speech I ATT Natural Voices SDK [Last visited: 2018-09-04]

  • Object: Input text
  • Language: English, Spanish

vozMe - From text to speech (speech synthesis) [Last visited: 2018-09-04]

  • Object: Input text
  • Language: English, Español, Italiano, Hindi, Português, Català

Read Aloud: 文字語音朗讀助理 - Chrome 線上應用程式商店

  • 物件: 網頁
  • 語言: 中

How 哥產生器!開發者整理素材,讓 HowHow 幫你講任何中文句子 - INSIDE

  • 物件: 文字
  • 語言: 中

ElevenLabs - Generative AI Text to Speech & Voice Cloning

  • 物件: 文字
  • 語言: 輸入中文,會聽到老外腔講中文

$ Voicenotes | AI Voice Notes App

  • Input: Simply record your voice directly in the app. (Icon_exclaim.gif Note: File uploads are not currently supported)
  • Language: Automatic language detection. For example, if you record in spoken Chinese, you'll receive a Chinese transcript.


Introducing Stable Audio 2.0 — Stability AI

  • 尚未提供線上服務

停止服務 工研院文字轉語音Web服務

  • 物件: 網頁
  • 語言: 中、英文、台語

Text to sound effect[edit]

OptimizerAI : Get Unlimited Sounds

  • Free Usage limit: Free for use, downloading the file is not permitted.

Speech to text 工具[edit]

Speech to text

Further reading[edit]

如果改善 TTS

Related keywords[edit]

References[edit]