Troubleshooting of whisperX: Difference between revisions

5,687 bytes added , 4 December 2024

Created page with "[https://github.com/m-bain/whisperX whisperX] is an enhanced version of OpenAI's Whisper, offering fast automatic speech recognition with word-level timestamps and speaker diarization. It uses the faster-whisper backend and can run the large-v2 model on less than 8GB of GPU memory. whisperX also includes voice activity detection (VAD) preprocessing, reducing hallucinations and supporting batch processing. == whisperX Troubleshooting Guide == === Error: HF_TOKEN environ..."

Planetoid

Bureaucrats, Administrators

15,030

edits

Troubleshooting of whisperX: Difference between revisions

Troubleshooting of whisperX (edit)

Revision as of 19:02, 4 December 2024

Navigation menu

Troubleshooting of whisperX: Difference between revisions

Troubleshooting of whisperX (edit)

Revision as of 19:02, 4 December 2024

Navigation menu

Search