Skip to content

The content on this page is sourced from the release page of my open-source project. For the latest information, please visit GitHub: https://github.com/jianchang512/stt/releases/0.0

faster-whisper Model Downloads, Suitable for stt Projects and "pyvideotrans Video Translation Dubbing" Project's Faster-Whisper Mode. For openai-whisper Models, Please Scroll Down.

image

tiny 64MBtiny.en 64MB

base 124MBbase.en 124MB

small 415MBsmall 百度网盘small.en 415MB

medium 1.27Gmedium.en 1.27G

large-v1 百度网盘large-v1 huggingface

large-v2 huggingfacelarge-v2 百度网盘

large-v3 huggingfacelarge-v3 百度网盘

large-v3-turbo 1.3G

distil-whisper-small.en 282MB

distil-whisper-medium.en 671MBdistil-medium百度网盘

distil-whisper-large-v2 1.27Gdistil-large-v2 百度网盘

distil-whisper-large-v3 1.3Gdistil-whisper-large-v3百度网盘

After downloading, extract the "models--Systran--faster-xx" folder from the archive and copy it to the models directory. After extracting and copying, the list of folders in the models directory should look like this:

Archive content

image

Correctly placed folder list under the models directory

image




openai-whisper Model Downloads, Only Suitable for openai-whisper Mode of "pyvideotrans Video Translation Dubbing Software"

image

After downloading, place the pt file in the models folder under the software directory. image

tiny.pt Modeltiny.en.pt Model

base.pt Modelbase.en.pt Model

small.pt Model

small.en.pt Model

medium.pt Model

medium.en.pt Model

large-v1.pt Model

large-v2.pt Model

large-v3.pt Model

large-v3-turbo.pt Model

image



FunASR Chinese Model Download

Baidu Netdisk Download (including speech recognition, punctuation restoration, and noise reduction models): https://pan.baidu.com/s/1v5wagiid6-K7GX9Pif4reA?pwd=y2ef

Huggingface (overseas download address): https://huggingface.co/spaces/mortimerme/s4/resolve/main/FunASR-Chinese-models.7z?download=true

After downloading and extracting, you will see 3 folders iic, damo, .__temp. Copy them to the models/hub folder of the video translation software and overwrite.

image



cuBLASxx.dll and cuDNN Download

If you encounter "cublasxxx.dll not found" or a crash after enabling CUDA acceleration, please download this file and copy the DLL files inside to C:/Windows/System32 or the software's root directory (where the EXE is located).

Enter cmd in the address bar of any folder to open a black window, and enter the command nvcc -V to view the current CUDA version.

CUDA11.x version download here: https://github.com/jianchang512/stt/releases/download/0.0/cuBLAS.and.cuDNN_CUDA11_win_v4.7z

CUDA12.x version download here: https://github.com/jianchang512/stt/releases/download/0.0/cuBLAS.and.cuDNN_CUDA12_win_v1.7z



uvr5 Model Download

Click to download uvr5 model

After downloading, extract the archive to obtain a uvr5_weights folder. Copy this folder to the root directory of the video translation and dubbing software.



ffmpeg.exe Download

If you are using Windows and receive a "ffmpeg command not found" message, you can download these two files and place them in the software's root directory or in the ffmpeg folder under the software's root directory.

https://github.com/jianchang512/stt/releases/download/0.0/ffmpeg.exe

https://github.com/jianchang512/stt/releases/download/0.0/ffprobe.exe