本软件涉及所有模型和下载地址

为减小软件体积，不内置任何模型，将在第一次使用时自动在线下载，主要从国外模型仓库huggingface.co、国内镜像站hf-mirror.com和阿里魔塔modelscope.cn下载

模型体积普遍较大、国外模型仓库国内无法直接访问，需科学上网，无科学上网时软件内部会自动切换国内镜像站
即便已科学上网，但若是科学工具网络不够稳定仍可能下载失败；
而国内镜像站有下载频率限制，下载速度也不够快、不够稳定。
由于以上种种原因，下载失败非常常见，很多其他错误也多由模型下载失败后间接引发的。

本页面列出所有用到的模型下载地址、下载后本地存放位置，若自动下载总是失败，可先删掉已下载的模型文件，然后尝试手动下载。

国外模型仓库地址：https://huggingface.co
部分阿里模型和onnx模型将从阿里魔塔下载： https://modelscope.cn

下载注意事项

一般每个模型都有多个文件组成，而不同模型可能存在相同名字的文件，例如大多都存在model.safetensors，如果你下载目录下已存在同名文件，浏览器可能为自动为你重命名，例如你下载的是model.safetensors,可能被自动命名为model(2).safetensors，你必须重新改回原名，然后放到要求的存放位置，软件才能识别到。

在 huggingface.co 网站，点击下载图标，即可下载该模型文件

语音识别渠道所用模型

Qwen-ASR(本地内置)

0.6B模型：

下载地址：https://huggingface.co/Qwen/Qwen3-ASR-0.6B/tree/main
存放位置：软件目录/models/models--Qwen--Qwen3-ASR-0.6B

1.7B模型：

下载地址：https://huggingface.co/Qwen/Qwen3-ASR-1.7B/tree/main
存放位置：软件目录/models/models--Qwen--Qwen3-ASR-1.7B

Firered中文(本地内置)

下载地址：https://modelscope.cn/models/himyworld/videotrans/resolve/master/fireredasr2aed.zip
存放位置：软件目录/models/复制压缩包内的 fireredasr 文件夹到此

Dolphin(本地内置)

下载地址：https://modelscope.cn/models/himyworld/videotrans/resolve/master/dolphin.zip
存放位置：软件目录/models/复制压缩包内的 dolphin 文件夹到此

Omnilingual亚洲语言(本地内置)

下载地址：https://modelscope.cn/models/himyworld/videotrans/resolve/master/omnilingual.zip
存放位置：软件目录/models/复制压缩包内的 omnilingual 文件夹到此

parakeet日语(本地内置)

下载地址：https://modelscope.cn/models/himyworld/videotrans/resolve/master/parakeet-ja.zip
存放位置：软件目录/models/复制压缩包内的 parakeet 文件夹到此

faster-whisper(本地内置)

模型名称	本地存放位置	下载地址
tiny	`软件目录/modles/models--Systran--faster-whisper-tiny`	https://huggingface.co/Systran/faster-whisper-tiny/tree/main
base	`软件目录/modles/models--Systran--faster-whisper-base`	https://huggingface.co/Systran/faster-whisper-base/tree/main
small	`软件目录/modles/models--Systran--faster-whisper-small`	https://huggingface.co/Systran/faster-whisper-small/tree/main
medium	`软件目录/modles/models--Systran--faster-whisper-medium`	https://huggingface.co/Systran/faster-whisper-medium/tree/main
large-v1	`软件目录/modles/models--Systran--faster-whisper-large-v1`	https://huggingface.co/Systran/faster-whisper-large-v1/tree/main
large-v2	`软件目录/modles/models--Systran--faster-whisper-large-v2`	https://huggingface.co/Systran/faster-whisper-large-v2/tree/main
large-v3	`软件目录/modles/models--Systran--faster-whisper-large-v3`	https://huggingface.co/Systran/faster-whisper-large-v3/tree/main
large-v3-turbo	`软件目录/modles/models--mobiuslabsgmbh--faster-whisper-large-v3-turbo`	https://huggingface.co/mobiuslabsgmbh/faster-whisper-large-v3-turbo/tree/main
----------	-----------	---------------------
tiny.en	`软件目录/modles/models--Systran--faster-whisper-tiny.en`	https://huggingface.co/Systran/faster-whisper-tiny.en/tree/main
base.en	`软件目录/modles/models--Systran--faster-whisper-base.en`	https://huggingface.co/Systran/faster-whisper-base.en/tree/main
small.en	`软件目录/modles/models--Systran--faster-whisper-small.en`	https://huggingface.co/Systran/faster-whisper-small.en/tree/main
medium.en	`软件目录/modles/models--Systran--faster-whisper-medium.en`	https://huggingface.co/Systran/faster-whisper-medium.en/tree/main
----------	-----------	---------------------
distil-large-v2	`软件目录/modles/models--Systran--faster-distil-whisper-large-v2`	https://huggingface.co/Systran/faster-distil-whisper-large-v2/tree/main
distil-large-v3	`软件目录/modles/models--Systran--faster-distil-whisper-large-v3`	https://huggingface.co/Systran/faster-distil-whisper-large-v3/tree/main
distil-large-v3.5	`软件目录/modles/models--distil-whisper--distil-large-v3.5-ct2`	https://huggingface.co/distil-whisper/distil-large-v3.5-ct2/tree/main
distil-small.en	`软件目录/modles/models--Systran--faster-distil-whisper-small.en`	https://huggingface.co/Systran/faster-distil-whisper-small.en/tree/main
distil-medium.en	`软件目录/modles/models--Systran--faster-distil-whisper-medium.en`	https://huggingface.co/Systran/faster-distil-whisper-medium.en/tree/main

openai-whisper(本地内置)

下载后的.pt模型文件直接放在 软件目录/models 文件夹内即可

HuggingFace_ASR(本地内置)

zai-org/GLM-ASR-Nano-2512
- 下载地址：https://huggingface.co/zai-org/GLM-ASR-Nano-2512/tree/main
- 存放位置：软件目录/models/models--zai-org--GLM-ASR-Nano-2512
anke01/whisper-small-uyghur
- 下载地址：https://huggingface.co/anke01/whisper-small-uyghur/tree/main
- 存放地址：软件目录/models/models--anke01--whisper-small-uyghur
nvidia/parakeet-ctc-1.1b（英语）
- 下载地址：https://huggingface.co/nvidia/parakeet-ctc-1.1b/tree/main
- 存放地址：软件目录/models/models--nvidia--parakeet-ctc-1.1b
reazon-research/japanese-wav2vec2-large-rs35kh（日语）
- 下载地址：https://huggingface.co/reazon-research/japanese-wav2vec2-large-rs35kh/tree/main
- 存放地址：软件目录/models/models--reazon-research--japanese-wav2vec2-large-rs35kh
kotoba-tech/kotoba-whisper-v2.0（日语）
- 下载地址：https://huggingface.co/kotoba-tech/kotoba-whisper-v2.0/tree/main
- 存放地址：软件目录/models/models--kotoba-tech--kotoba-whisper-v2.0
biodatlab/whisper-th-large-v3（泰语）
- 下载地址：https://huggingface.co/biodatlab/whisper-th-large-v3/tree/main
- 存放地址：软件目录/models/models--biodatlab--whisper-th-large-v3
vinai/Phowhisper-large（越南语）
- 下载地址：https://huggingface.co/vinai/Phowhisper-large/tree/main
- 存放地址：软件目录/models/models--vinai--Phowhisper-large
openai/whisper-large-v3
- 下载地址：https://huggingface.co/openai/whisper-large-v3/tree/main
- 存放地址：软件目录/models/models--openai--whisper-large-v3

翻译渠道(字幕翻译)所用模型

Hy-MT2-1.8B(本地内置)

下载地址： https://huggingface.co/tencent/Hy-MT2-1.8B/tree/main
存放地址：软件目录/models/models--tencent--Hy-MT2-1.8B

M2M100(本地内置)

下载地址：https://modelscope.cn/models/himyworld/videotrans/resolve/master/m2m100_12b_model.zip
存放地址：软件目录/models/复制压缩包内的 m2m100_12b 文件夹到此

配音渠道所用模型

Piper(本地内置)

下载地址：https://huggingface.co/rhasspy/piper-voices/tree/main
存放位置：软件目录/models/piper

VITS(本地内置)

下载地址：https://modelscope.cn/models/himyworld/videotrans/resolve/master/vits-tts.zip
存放位置：软件目录/models/复制压缩包内的 vits 文件夹到此

ZipVoice(本地内置)

下载地址：https://modelscope.cn/models/himyworld/videotrans/resolve/master/zipvoice-tts.zip
存放位置：软件目录/models/复制压缩包内的 zipvoice 文件夹到此

OmniVoice(本地内置)

下载地址：https://huggingface.co/k2-fsa/OmniVoice/tree/main
存放位置：软件目录/models/models--k2-fsa--OmniVoice

MOSS-TTS-Nano(本地内置)

下载地址：https://huggingface.co/OpenMOSS-Team/MOSS-TTS-Nano-100M/tree/main
存放位置：软件目录/models/MOSS-TTS-Nano-100M
下载地址：https://huggingface.co/OpenMOSS-Team/MOSS-Audio-Tokenizer-Nano-ONNX/tree/main
存放位置：软件目录/models/MOSS-Audio-Tokenizer-Nano-ONNX

ChatterBox(本地内置)

下载地址：https://huggingface.co/ResembleAI/chatterbox/tree/main
存放位置：软件目录/models/models--ResembleAI--chatterbox

Supertonic(本地内置)

下载地址：https://huggingface.co/Supertone/supertonic-3/tree/main
存放位置：软件目录/models/models--Supertone--supertonic-3

Qwen3-TTS(本地内置)

下载地址：https://huggingface.co/Qwen/Qwen3-TTS-12Hz-0.6B-Base/tree/main
存放位置：软件目录/models/models--Qwen--Qwen3-TTS-12Hz-0.6B-Base
下载地址：https://huggingface.co/Qwen/Qwen3-TTS-12Hz-0.6B-CustomVoice/tree/main
存放位置：软件目录/models/models--Qwen--Qwen3-TTS-12Hz-0.6B-CustomVoice

Confucius-TTS(本地内置)

下载地址：https://huggingface.co/netease-youdao/Confucius4-TTS/tree/main
存放位置：软件目录/models/models--netease-youdao--Confucius4-TTS
下载地址：https://huggingface.co/facebook/w2v-bert-2.0/tree/main
存放位置：软件目录/models/models--models--facebook--w2v-bert-2.0
下载地址：https://huggingface.co/nvidia/bigvgan_v2_22khz_80band_256x/tree/main
存放位置：软件目录/models/models--nvidia--bigvgan_v2_22khz_80band_256x
下载地址：https://huggingface.co/funasr/campplus/tree/main
存放位置：软件目录/models/models--funasr--campplus

F5-TTS(本地内置)

官方中英模型(1.35G)：https://huggingface.co/SWivid/F5-TTS/tree/main/F5TTS_v1_Base

存放位置: 软件目录/models/models--SWivid--F5-TTS/F5TTS_v1_Base

日语模型(5.6G)：https://huggingface.co/Jmica/F5TTS/tree/main/JA_21999120

存放位置: 软件目录/models/models--Jmica--F5TTS/JA_21999120

法语模型(5.4G)：https://huggingface.co/RASPIAUDIO/F5-French-MixedSpeakers-reduced/tree/main

存放位置: 软件目录/models/models--RASPIAUDIO--F5-French-MixedSpeakers-reduced

德语模型(1.35G)：https://huggingface.co/hvoss-techfak/F5-TTS-German/tree/main

存放位置: 软件目录/models/models--hvoss-techfak--F5-TTS-German

俄语模型(3.4G)：https://huggingface.co/hotstone228/F5-TTS-Russian/tree/main

存放位置: 软件目录/models/models--hotstone228--F5-TTS-Russian

意大利语模型(1.35G)：https://huggingface.co/alien79/F5-TTS-italian/tree/main

存放位置: 软件目录/models/models--alien79--F5-TTS-italian

西班牙语模型(5.4G)：https://huggingface.co/jpgallegoar/F5-Spanish/tree/main

存放位置: 软件目录/models/models--jpgallegoar--F5-Spanish

印地语模型(2.5G)：https://huggingface.co/SPRINGLab/F5-Hindi-24KHz/tree/main

存放位置: 软件目录/models/models--SPRINGLab/F5-Hindi-24KHz

阿拉伯语模型(2.6G)：https://huggingface.co/silma-ai/silma-tts/tree/main

存放位置: 软件目录/models/models--silma-ai--silma-tts

说话人分离模型

内置模型:

下载地址：https://modelscope.cn/models/himyworld/videotrans/tree/master/onnx

在该地址页下载3dspeaker_speech_eres2net_large_sv_zh-cn_3dspeaker_16k.onnx 、nemo_en_titanet_small.onnx、seg_model.onnx 这3个文件

存放位置：软件目录/models/onnx/将下载的3个文件放到此处

reverb:

-下载地址：https://huggingface.co/Revai/reverb-diarization-v1

存放位置: 软件目录/models/models--Revai--reverb-diarization-v1

pyannote

下载地址：https://huggingface.co/pyannote/speaker-diarization-3.1
存放位置: 软件目录/models/models--pyannote--speaker-diarization-3.1

Ali camp++:

下载地址： https://modelscope.cn/models/iic/speech_campplus_speaker-diarization_common
存放位置：软件目录/models/speech_campplus_speaker-diarization_common

分离人声背景模型、降噪模型、标点恢复模型

下载地址：https://modelscope.cn/models/himyworld/videotrans/tree/master/onnx 存放位置：下载该页面所有 .onnx 文件后存放到 软件目录/models/onnx/ 文件夹内

实时语音识别模型

下载地址: https://modelscope.cn/models/himyworld/videotrans/resolve/master/realtimestt.zip
存放位置: 下载后解压回看到一个 onnx 文件夹，将其内所有文件复制到 软件目录/models/onnx/

本软件涉及所有模型和下载地址 ​

语音识别渠道所用模型 ​

Qwen-ASR(本地内置) ​

Firered中文(本地内置) ​

Dolphin(本地内置) ​

Omnilingual亚洲语言(本地内置) ​

parakeet日语(本地内置) ​

faster-whisper(本地内置) ​

openai-whisper(本地内置) ​

HuggingFace_ASR(本地内置) ​

翻译渠道(字幕翻译)所用模型 ​

Hy-MT2-1.8B(本地内置) ​

M2M100(本地内置) ​

配音渠道所用模型 ​

Piper(本地内置) ​

VITS(本地内置) ​

ZipVoice(本地内置) ​

OmniVoice(本地内置) ​

MOSS-TTS-Nano(本地内置) ​

ChatterBox(本地内置) ​

Supertonic(本地内置) ​

Qwen3-TTS(本地内置) ​

Confucius-TTS(本地内置) ​

F5-TTS(本地内置) ​

说话人分离模型 ​

内置模型: ​

reverb: ​

pyannote ​

Ali camp++: ​

分离人声背景模型、降噪模型、标点恢复模型 ​

实时语音识别模型 ​