Editing Speech to text
Revision as of 10:48, 6 June 2026 by Planetoid (talk | contribs) (Created page with "== Speech to text 工具 == {{Gd}} [https://github.com/openai/whisper openai/whisper: Robust Speech Recognition via Large-Scale Weak Supervision] * Support Language: 99 languages * Input file: Audio files * Speaker identification: Need to integrate with (1) [https://github.com/m-bain/whisperX m-bain/whisperX: WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)] or (2) [https://github.com/pyannote/pyannote-audio pyannote/pyannote-audio: Neura...")
Warning: You are editing an out-of-date revision of this page. If you publish it, any changes made since this revision will be lost.
Warning: You are not logged in. Your IP address will be publicly visible if you make any edits. If you log in or create an account, your edits will be attributed to your username, along with other benefits.
Retrieved from "https://wiki.planetoid.info/index.php/Speech_to_text"