This page contains content sourced from the release page of my open-source project. For the latest information, please visit GitHub: https://github.com/jianchang512/stt/releases/0.0
faster-whisper Model Download (for stt project and "pyvideotrans Video Translation Dubbing" project's faster-whisper mode). Scroll down for openai-whisper models.
small 415MBsmall Baidu Netdisksmall.en 415MB
large-v1 Baidu Netdisklarge-v1 huggingface
large-v2 huggingfacelarge-v2 Baidu Netdisk
large-v3 huggingfacelarge-v3 Baidu Netdisk
distil-whisper-medium.en 671MBdistil-medium Baidu Netdisk
distil-whisper-large-v2 1.27Gdistil-large-v2 Baidu Netdisk
distil-whisper-large-v3 1.3Gdistil-whisper-large-v3 Baidu Netdisk
After downloading and extracting, copy the "models--Systran--faster-xx" folder from the archive into the
models
directory. After extracting and copying, the folder list under themodels
directory should look like this:Archive Content
Folder List under
models
directory after Correct Placement
openai-whisper Model Download (Only for "pyvideotrans Video Translation Dubbing Software" using the openai-whisper mode)
After downloading, place the .pt
file in the models
folder under the software directory.
FunASR Chinese Model Download
Baidu Netdisk Download (including speech recognition, punctuation restoration, and noise reduction models): https://pan.baidu.com/s/1v5wagiid6-K7GX9Pif4reA?pwd=y2ef
Huggingface (Download address outside the Great Firewall): https://huggingface.co/spaces/mortimerme/s4/resolve/main/FunASR-Chinese-models.7z?download=true
After downloading and extracting, you will see 3 folders:
iic
,damo
, and.__temp
. Copy these folders to themodels/hub
folder of the video translation software and overwrite.
cuBLASxx.dll and cuDNN Download
If you encounter "cublasxxx.dll does not exist" or a crash after enabling CUDA acceleration, please download this file. Copy the DLL files inside to C:/Windows/System32
or the software's root directory (where the .exe file is located).
Enter cmd
in the address bar of any folder to open a command prompt, then enter the command nvcc -V
to check your current CUDA version.
CUDA 11.x version download here: https://github.com/jianchang512/stt/releases/download/0.0/cuBLAS.and.cuDNN_CUDA11_win_v4.7z
CUDA 12.x version download here: https://github.com/jianchang512/stt/releases/download/0.0/cuBLAS.and.cuDNN_CUDA12_win_v1.7z
uvr5 Model Download
After downloading and extracting, you will get a uvr5_weights
folder. Copy this folder to the root directory of the video translation and dubbing software.
ffmpeg.exe Download
If you are using Windows and see an error indicating that the ffmpeg command cannot be found, you can download the following two files and place them in the root directory of the software or in the
ffmpeg
folder under the root directory.
https://github.com/jianchang512/stt/releases/download/0.0/ffmpeg.exe
https://github.com/jianchang512/stt/releases/download/0.0/ffprobe.exe