Troubleshooting of whisperX: Difference between revisions

Troubleshooting of whisperX (edit)

Revision as of 18:26, 1 September 2025

1,401 bytes added , 1 September 2025

→‎whisperX Troubleshooting Guide

Planetoid

Bureaucrats, Administrators

14,958

edits

@@ Line 106: / Line 106: @@
 * '''Compatibility:''' Ensure that the versions of CUDA, cuDNN, and PyTorch are compatible with each other. Refer to the [https://pytorch.org/get-started/previous-versions/ PyTorch documentation] for version compatibility details.
 * '''Virtual Environments:''' If you’re using a virtual environment, make sure it has access to the system’s CUDA and cuDNN installations. You might need to install CUDA and cuDNN within the virtual environment or ensure that the environment variables are correctly set.
+=== WhisperX Audio Transcription Commands for Multiple Languages ===
+Standard Command: English transcription output
+<pre>
+whisperx /path/to/audio/file.wav \
+        --model large-v3 \
+        --language en \
+        --diarize \
+        --batch_size 24 \
+        --no_align \
+        --chunk_size 10 \
+        --hf_token your_huggingface_token \
+        --output_dir /path/to/output/directory \
+        --output_format all
+</pre>
+To change output to Thai, modify the following parameters:
+Method 1: Set language to Thai {{kbd | key=<nowiki>--language th</nowiki>}}
+<pre>
+whisperx /path/to/audio/file.wav \
+        --model large-v3 \
+        --language th \
+        --diarize \
+        --batch_size 24 \
+        --no_align \
+        --chunk_size 10 \
+        --hf_token your_huggingface_token \
+        --output_dir /path/to/output/directory \
+        --output_format all
+</pre>
+Method 2: Auto-detect language (remove the {{kbd | key=<nowiki>--language</nowiki>}} parameter)
+<pre>
+whisperx /path/to/audio/file.wav \
+        --model large-v3 \
+        --diarize \
+        --batch_size 24 \
+        --no_align \
+        --chunk_size 10 \
+        --hf_token your_huggingface_token \
+        --output_dir /path/to/output/directory \
+        --output_format all
+</pre>
+Common Language Code Reference:
+# th = Thai
+# zh = Chinese
+# en = English
+# ja = Japanese
+# ko = Korean
+# es = Spanish
+# fr = French
 == whisperX Transcript File Format Guide ==

Troubleshooting of whisperX: Difference between revisions

Troubleshooting of whisperX (edit)

Revision as of 18:26, 1 September 2025

Navigation menu

Search