14,672
edits
Line 75: | Line 75: | ||
{{Gd}} [https://github.com/openai/whisper openai/whisper: Robust Speech Recognition via Large-Scale Weak Supervision] | {{Gd}} [https://github.com/openai/whisper openai/whisper: Robust Speech Recognition via Large-Scale Weak Supervision] | ||
* Support Language: 99 languages | * Support Language: 99 languages | ||
* | * Input file: Audio files | ||
* | * Speaker identification: Need to integrate with [https://github.com/m-bain/whisperX m-bain/whisperX: WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)] | ||
* | * Real-Time Subtitles or Translation: Not Available | ||
* Related: | * Related: | ||
** [https://huggingface.co/spaces/Xenova/whisper-webgpu Whisper WebGPU - a Hugging Face Space by Xenova] | ** [https://huggingface.co/spaces/Xenova/whisper-webgpu Whisper WebGPU - a Hugging Face Space by Xenova] | ||
Line 100: | Line 100: | ||
* Comments: The free version seems to have no limitation on video duration, and you can also use AI to convert videos or audio into transcripts for free. However, during testing, the subtitles displayed for each time code were not complete sentences. | * Comments: The free version seems to have no limitation on video duration, and you can also use AI to convert videos or audio into transcripts for free. However, during testing, the subtitles displayed for each time code were not complete sentences. | ||
[https://ink.dwave.cc/en-US/pricing Meeting Ink - AI notetaker to transcribe and summarize your meetings and recordings.] | |||
* Support Language: | |||
* Input file: Audio files | |||
* Speaker identification: Available {{Gd}} | |||
* Real-Time Subtitles or Translation: Pro plan only ''$'' | |||
* Free limit: 30 minutes max | |||
[https://tw.olami.ai/open/website/apiandsolution/api_solution OLAMI 中文語音辨識 API|歐拉蜜人工智慧開放平台(威盛電子)] {{access | date = 2018-09-05}} | [https://tw.olami.ai/open/website/apiandsolution/api_solution OLAMI 中文語音辨識 API|歐拉蜜人工智慧開放平台(威盛電子)] {{access | date = 2018-09-05}} | ||
Line 143: | Line 149: | ||
[https://www.mygoodtape.com/ Good Tape] | [https://www.mygoodtape.com/ Good Tape] | ||
* Language: | * Support Language: | ||
* | * Input file: Audio files | ||
* | * Speaker identification: Available {{Gd}} | ||
* Real-Time Subtitles or Translation: Not Available | |||
* Free limit: 20 minutes max | * Free limit: 20 minutes max | ||
[https://www.larksuite.com/ Lark | Business Chat & Collaboration Tool] ([https://zh.wikipedia.org/wiki/%E9%A3%9E%E4%B9%A6 飞书 - 維基百科,自由的百科全書]) | [https://www.larksuite.com/ Lark | Business Chat & Collaboration Tool] ([https://zh.wikipedia.org/wiki/%E9%A3%9E%E4%B9%A6 飞书 - 維基百科,自由的百科全書]) |