Troubleshooting of whisperX: Difference between revisions

Jump to navigation Jump to search
No edit summary
Line 106: Line 106:
* '''Compatibility:''' Ensure that the versions of CUDA, cuDNN, and PyTorch are compatible with each other. Refer to the [https://pytorch.org/get-started/previous-versions/ PyTorch documentation] for version compatibility details.
* '''Compatibility:''' Ensure that the versions of CUDA, cuDNN, and PyTorch are compatible with each other. Refer to the [https://pytorch.org/get-started/previous-versions/ PyTorch documentation] for version compatibility details.
* '''Virtual Environments:''' If you’re using a virtual environment, make sure it has access to the system’s CUDA and cuDNN installations. You might need to install CUDA and cuDNN within the virtual environment or ensure that the environment variables are correctly set.
* '''Virtual Environments:''' If you’re using a virtual environment, make sure it has access to the system’s CUDA and cuDNN installations. You might need to install CUDA and cuDNN within the virtual environment or ensure that the environment variables are correctly set.
=== WhisperX Audio Transcription Commands for Multiple Languages ===
Standard Command: English transcription output
<pre>
whisperx /path/to/audio/file.wav \
        --model large-v3 \
        --language en \
        --diarize \
        --batch_size 24 \
        --no_align \
        --chunk_size 10 \
        --hf_token your_huggingface_token \
        --output_dir /path/to/output/directory \
        --output_format all
</pre>
To change output to Thai, modify the following parameters:
Method 1: Set language to Thai {{kbd | key=<nowiki>--language th</nowiki>}}
<pre>
whisperx /path/to/audio/file.wav \
        --model large-v3 \
        --language th \
        --diarize \
        --batch_size 24 \
        --no_align \
        --chunk_size 10 \
        --hf_token your_huggingface_token \
        --output_dir /path/to/output/directory \
        --output_format all
</pre>
Method 2: Auto-detect language (remove the {{kbd | key=<nowiki>--language</nowiki>}} parameter)
<pre>
whisperx /path/to/audio/file.wav \
        --model large-v3 \
        --diarize \
        --batch_size 24 \
        --no_align \
        --chunk_size 10 \
        --hf_token your_huggingface_token \
        --output_dir /path/to/output/directory \
        --output_format all
</pre>
Common Language Code Reference:
# th = Thai
# zh = Chinese
# en = English
# ja = Japanese
# ko = Korean
# es = Spanish
# fr = French


== whisperX Transcript File Format Guide ==
== whisperX Transcript File Format Guide ==

Navigation menu