Text to speech: Difference between revisions

Jump to navigation Jump to search
m
 
Line 76: Line 76:
* Support Language: 99 languages
* Support Language: 99 languages
* Input file: Audio files
* Input file: Audio files
* Speaker identification: Need to integrate with [https://github.com/m-bain/whisperX m-bain/whisperX: WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)]
* Speaker identification: Need to integrate with (1) [https://github.com/m-bain/whisperX m-bain/whisperX: WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)] or (2) [https://github.com/pyannote/pyannote-audio pyannote/pyannote-audio: Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding]
* Real-Time Subtitles or Translation: Not Available
* Real-Time Subtitles or Translation: Not Available
* Related:
* Related:

Navigation menu