Text to speech

From LemonWiki共筆
Revision as of 11:19, 10 September 2025 by Planetoid (talk | contribs) (→‎Speech to text 工具)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Jump to navigation Jump to search

Text to speech services


Text to speech 工具[edit]

Pocket on Android   or iOS Listening to Articles in Pocket with Text-to-Speech - Pocket Support

  • Object: Saved web page Icon_exclaim.gif the content may not be saved completed but truncated.

Google 翻譯

  • Object: Input text
  • Speed of speech: Good.gif Allow to slow down the speed of speech (ttsspeed=0.24)

Google 文字轉語音 - Google Play Android 應用程式 on Android  

  • Object: Some books on Google play

博客來電子書櫃 on Android   使用 Google 文字轉語音的服務

  • Object: books

Kindle 部分版本支援 Amazon.com Help: Features Available in Kindle Books

  • 物件: 書籍

網際智慧: VoAI - "絕好聲創|台灣口音高擬真AI聲優|AI配音、拍照/文字生成Podcast"

  • 物件: 輸入文字
  • 授權: 商業授權

配合多語系 TTS 使用語音功能 - Office 支援

  • 物件: 「OneNote、Outlook、PowerPoint 及 Word」

Bing Speech API - 語音辨識 | Microsoft Azure

  • Object: Input text

Cloud Text-to-Speech - Speech Synthesis  |  Google Cloud

  • Object: Input text

Wizzard Speech I ATT Natural Voices SDK [Last visited: 2018-09-04]

  • Object: Input text
  • Language: English, Spanish

vozMe - From text to speech (speech synthesis) [Last visited: 2018-09-04]

  • Object: Input text
  • Language: English, Español, Italiano, Hindi, Português, Català

Read Aloud: 文字語音朗讀助理 - Chrome 線上應用程式商店

  • 物件: 網頁
  • 語言: 中

How 哥產生器!開發者整理素材,讓 HowHow 幫你講任何中文句子 - INSIDE

  • 物件: 文字
  • 語言: 中

ElevenLabs - Generative AI Text to Speech & Voice Cloning

  • 物件: 文字
  • 語言: 輸入中文,會聽到老外腔講中文

$ Voicenotes | AI Voice Notes App

  • Input: Simply record your voice directly in the app. (Icon_exclaim.gif Note: File uploads are not currently supported)
  • Language: Automatic language detection. For example, if you record in spoken Chinese, you'll receive a Chinese transcript.


Introducing Stable Audio 2.0 — Stability AI

  • 尚未提供線上服務

停止服務 工研院文字轉語音Web服務

  • 物件: 網頁
  • 語言: 中、英文、台語

Text to sound effect[edit]

OptimizerAI : Get Unlimited Sounds

  • Free Usage limit: Free for use, downloading the file is not permitted.

Speech to text 工具[edit]

Good.gif openai/whisper: Robust Speech Recognition via Large-Scale Weak Supervision

Speech API - 語音辨識  |  Google Cloud 「語音轉文字採用機器學習技術」,免費版語音辨識的額度 60 分鐘,詳 定價  |  Cloud Speech API Documentation  |  Google Cloud[Last visited: 2018-09-04]

Bing 語音 API - 語音辨識軟體 | Microsoft Azure

Clipchamp [Last visited: 2025-04-02]

  • Input: audio or video file
  • Support Language: 80+ languages[4][5]
  • Comments: The free version seems to have no limitation on video duration, and you can also use AI to convert videos or audio into transcripts for free. However, during testing, the subtitles displayed for each time code were not complete sentences.

Meeting Ink - AI notetaker to transcribe and summarize your meetings and recordings.

  • Support Language:
  • Input file: Audio files
  • Speaker identification: Available Good.gif
  • Real-Time Subtitles or Translation: Pro plan only $
  • Free limit: 30 minutes max

OLAMI 中文語音辨識 API|歐拉蜜人工智慧開放平台(威盛電子) [Last visited: 2018-09-05]

影片要產生文字,可利用 youtube 的 Use automatic captioning - YouTube Help,約需要半天時間 [Last visited: 2018-09-04] 教學: YouTube超佛心,自動幫你加入字幕! | T客邦

  • Input: Video
  • Language:
  • Sample code:
  • Related:

语音识别 - 讯飞开放平台 [Last visited: 2018-09-06]

  • Input: speex audio file less than 1 minute [8]
  • Language: 中文(普通话)、英文、中文(粤语)、中文(四川话)
  • Sample code:
  • Related:

Amazon Transcribe – 自動語音辨識 – AWS (API documentation: What Is Amazon Transcribe? - Amazon Transcribe) [Last visited: 2018-09-05]

  • Input: Audio file (Stored in S3 bucket). "Valid formats for the audio are mp3, mp4, wav and flac. [9]"
  • Language: English, Spanish
  • Sample code:
  • Related:

Web Speech to Text 教學: 免費!中文影片語音轉文字字幕,支援超大影片與長時間錄音

  • 物件: 電腦影像、聲音、YouTube 網址
  • 語言: 中文、英文、日文、韓文

Voicetapp - AI Voice to Text Transcription

  • Language: 中文、英文等多種語言
  • Sample code:
  • Related:
  • Free limit: 5 minutes

SYSTRAN/faster-whisper: Faster Whisper transcription with CTranslate2

Good Tape

  • Support Language:
  • Input file: Audio files
  • Speaker identification: Available Good.gif
  • Real-Time Subtitles or Translation: Not Available
  • Free limit: 20 minutes max


Lark | Business Chat & Collaboration Tool (飞书 - 維基百科,自由的百科全書)

  • Language:
  • Sample code:
  • Related:
  • Free limit:

Const-me/Whisper: High-performance GPGPU inference of OpenAI's Whisper automatic speech recognition (ASR) model on Win  

  • Language:
  • Sample code:
  • Related:
  • Free limit:

iTranscribe: Transcribe Audio & Video to Text

  • Language:
  • Sample code:
  • Related:
  • Free limit:

剪映官網-全能易用的桌面端剪輯軟體-輕而易剪 上演大幕 中國軟體 Icon_exclaim.gif

  • Language:
  • Sample code:
  • Related:
  • Free limit:

Further reading[edit]

如果改善 TTS

Related keywords[edit]

References[edit]