Skip to content

FunASR Chinese Recognition:

FunASR is a set of open-source speech recognition models from Alibaba. It performs better than the Whisper series in Chinese speech scenarios. Video translation software already supports its use through HTTP calls via the zh_recogn and SenseVoice projects. You only need to deploy the corresponding zh_recogn and SenseVoice integration package, and after starting it, fill in the API address in the video translation settings.

However, many users are still confused about this operation. Therefore, starting from version v2.97, this function has been integrated into the video translation software. This means you no longer need to deploy and start the zh_recogn and SenseVoice projects separately. You can directly select FunASR Chinese Recognition in the software.

image.png

Select FunASR Chinese in Speech Recognition

After selecting FunASR Chinese Recognition in Speech Recognition, you can choose to use the paraformer-zh model or the SenseVoiceSmall model. It is recommended to choose the former, as it offers better performance and speed than the latter.

image.png

Download Model Online for the First Time Using FunASR Chinese Recognition

To avoid a large package size, the FunASR models are not integrated into the software package. The first time you use it, it will automatically download from modelscope.cn and save it to the hub folder under the models folder in the software directory. Depending on your network conditions, this may take from a few minutes to tens of minutes or even longer. As long as there are no red errors, be patient and wait for the download to complete.

image.png