Skip to content

This page contains content sourced from the release page of my open-source project. For the latest information, please visit GitHub: https://github.com/jianchang512/stt/releases/0.0

faster-whisper Model Download (for stt project and "pyvideotrans Video Translation Dubbing" project's faster-whisper mode). Scroll down for openai-whisper models.

image

tiny 64MBtiny.en 64MB

base 124MBbase.en 124MB

small 415MBsmall Baidu Netdisksmall.en 415MB

medium 1.27Gmedium.en 1.27G

large-v1 Baidu Netdisklarge-v1 huggingface

large-v2 huggingfacelarge-v2 Baidu Netdisk

large-v3 huggingfacelarge-v3 Baidu Netdisk

large-v3-turbo 1.3G

distil-whisper-small.en 282MB

distil-whisper-medium.en 671MBdistil-medium Baidu Netdisk

distil-whisper-large-v2 1.27Gdistil-large-v2 Baidu Netdisk

distil-whisper-large-v3 1.3Gdistil-whisper-large-v3 Baidu Netdisk

After downloading and extracting, copy the "models--Systran--faster-xx" folder from the archive into the models directory. After extracting and copying, the folder list under the models directory should look like this:

Archive Content

image

Folder List under models directory after Correct Placement

image




openai-whisper Model Download (Only for "pyvideotrans Video Translation Dubbing Software" using the openai-whisper mode)

image

After downloading, place the .pt file in the models folder under the software directory. image

tiny.pt modeltiny.en.pt model

base.pt modelbase.en.pt model

small.pt model

small.en.pt model

medium.pt model

medium.en.pt model

large-v1.pt model

large-v2.pt model

large-v3.pt model

large-v3-turbo.pt model

image



FunASR Chinese Model Download

Baidu Netdisk Download (including speech recognition, punctuation restoration, and noise reduction models): https://pan.baidu.com/s/1v5wagiid6-K7GX9Pif4reA?pwd=y2ef

Huggingface (Download address outside the Great Firewall): https://huggingface.co/spaces/mortimerme/s4/resolve/main/FunASR-Chinese-models.7z?download=true

After downloading and extracting, you will see 3 folders: iic, damo, and .__temp. Copy these folders to the models/hub folder of the video translation software and overwrite.

image



cuBLASxx.dll and cuDNN Download

If you encounter "cublasxxx.dll does not exist" or a crash after enabling CUDA acceleration, please download this file. Copy the DLL files inside to C:/Windows/System32 or the software's root directory (where the .exe file is located).

Enter cmd in the address bar of any folder to open a command prompt, then enter the command nvcc -V to check your current CUDA version.

CUDA 11.x version download here: https://github.com/jianchang512/stt/releases/download/0.0/cuBLAS.and.cuDNN_CUDA11_win_v4.7z

CUDA 12.x version download here: https://github.com/jianchang512/stt/releases/download/0.0/cuBLAS.and.cuDNN_CUDA12_win_v1.7z



uvr5 Model Download

Click to download uvr5 model

After downloading and extracting, you will get a uvr5_weights folder. Copy this folder to the root directory of the video translation and dubbing software.



ffmpeg.exe Download

If you are using Windows and see an error indicating that the ffmpeg command cannot be found, you can download the following two files and place them in the root directory of the software or in the ffmpeg folder under the root directory.

https://github.com/jianchang512/stt/releases/download/0.0/ffmpeg.exe

https://github.com/jianchang512/stt/releases/download/0.0/ffprobe.exe