Skip to content

With the rapid development of artificial intelligence technology, the barrier to video translation has been greatly lowered, making it feasible to achieve a completely local, offline, and zero-cost solution.

However, the biggest challenges of local deployment solutions are complex deployment and limitations in hardware performance. Models are often smaller, and translation quality is difficult to optimize. The full version of pyvideotrans provides both local and online API solutions. Although powerful, even downloading it can be a challenge for novices - the installation package without models is as large as 1.9GB, and the volume soars to over 5GB with models.

To solve these problems, following the 302.ai Lite version, we have launched the Alibaba Bailian Lite version. This version does not require downloading models and has no special hardware requirements. You only need to activate the service in Alibaba Cloud Bailian and obtain an API KEY to quickly experience the convenience of video translation.

The Lite version includes video translation, speech recognition, subtitle dubbing, and subtitle translation, meeting basic daily needs.

Unlike the full version, the functions of the Lite version rely on the platform's API service. After the free quota provided by the platform is used up, you need to pay to continue using it. However, considering its convenient deployment and higher translation quality, as well as the decreasing price of API services, this is undoubtedly worthwhile for users seeking efficiency.

Of course, if you do not consider paid options at all, you can still continue to use the full-featured pyvideotrans full version.

Bailian Lite Version Download Address

Baidu Netdisk: https://pan.baidu.com/s/1XsAt8Vt1_IccOKt0QAvC_g?pwd=6rgd

Github: https://github.com/jianchang512/pyvideotrans/releases/download/v3.36/pyvideotrans-ali-bailian-3.88.7z

Comparison Table: Full Version vs. Bailian Lite Version

Featurepyvideotrans Full Versionpyvideotrans Bailian Lite Version
Software Size1.9GB without models, 5GB+ with models130MB
Ease of UseComplex configuration, high customizabilitySimple to use, just fill in the API KEY
VPN Required?Required for Gemini, ChatGPT, and Google channelsNot required
Usage CostCan be completely free, fully local and offline useRequires activating Alibaba Cloud Bailian service, payment required after free quota is exhausted
FunctionalityPowerful, supports all features of the Lite version plus moreOnly supports video translation, speech recognition, speech synthesis, and subtitle translation
Dubbing VoicesSupports more, can support more third-party TTS services via APIAlibaba Bailian model only supports Chinese, English, German, Italian, and Thai. Built-in edge-tts supports more languages

How to Choose a Version:

  • pyvideotrans Full Version is suitable for:

    • Users who want to use it completely for free.
    • Users who have a certain degree of hands-on ability and are willing to tinker.
    • Users who can use a VPN.
    • Users who want to deeply understand and master more detailed features.
  • pyvideotrans Bailian Lite Version is suitable for:

    • Users who do not want to spend too much effort on deployment and configuration and just want to use it simply.
    • Users who are willing to pay for API services.
    • Users who are not familiar with or do not want to use a VPN.

The following are instructions on how to activate Alibaba Cloud Bailian and Alibaba Cloud OSS, as well as instructions on how to fill them in the software.

I: Create an Alibaba Bailian API KEY

  1. First, you need to have an Alibaba Cloud account and be authenticated with your real name.

Register, log in, and authenticate here: https://www.aliyun.com

  1. Get the API KEY for Alibaba Bailian

After logging in, directly open this address until the API KEY acquisition page: https://bailian.console.aliyun.com/?apiKey=1#/api-key

Create directly as shown in the figure.

image.png

View and copy after creation.

image.png

Most models have a free quota.

II: Create an Alibaba Cloud OSS Bucket

Why is this needed? Because Alibaba Cloud's speech recognition API does not support directly uploading audio and video files. You must pass the network URL address of the audio and video to it, and then it downloads the audio and video through the URL on the server for recognition.

It's not worth setting up a server specifically for this. The easiest way is to directly use Alibaba Cloud OSS, upload to OSS, and pass an intranet address to the API, which can also avoid generating download traffic.

1. After logging in to Alibaba Cloud, open the website to activate the OSS service.

Directly open this address: https://oss.console.aliyun.com/overview If it is not activated, you will be prompted to activate it.

2. After activation, the interface is as follows. Start creating a Bucket.

Click Create Bucket as shown below.

image.png

Be sure to select the North China 2 (Beijing) region for internal network use.

image.png

Keep other settings as default.

3. Enable Public Read permission.

This must be enabled, otherwise it cannot be accessed.

After successful creation, click Bucket List in the upper left corner, find the name you just created, and click to enter the management interface of the Bucket.

image.png

After entering, as shown in the figure below, click Block Public Access.

image.png

After clicking, as shown in the figure, the default is enabled. Turn it off.

image.png

image.png

After confirming that it is closed, continue to click "Read and Write Permissions", then click "Settings", and then select "Public Read". Note that you need to click "Settings" first before you can select "Public Read".

image.png

After selecting "Public Read", a prompt will pop up. Click "Continue Modification".

image.png

Then save it.

image.png

Don't worry about it reminding you of the extra traffic costs, because the North China 2 (Beijing) node is accessed through the internal network, and the uploaded files are only used by the internal network during the speech recognition stage. After you have finished the video translation work, you can delete all the uploaded files at any time.

III: Get the AccessKey

To upload files to OSS, you need an AccessKey.

After creating the OSS, directly open this address: https://ram.console.aliyun.com/profile/access-keys

Select according to the figure below, and ignore its suggestions.

image.png

After entering the page, click "Create AccessKey" on the left.

image.png

Then you may need to verify your mobile phone number. After the verification is passed, the automatically created AccessKey ID and AccessKey Secret will be displayed.

image.png

image.png

Remember these two pieces of information.

IV: Fill in the Alibaba Bailian information into the software.

Fill in the OSS Bucket name, Bailian's API KEY, AccessKey ID, and AccessKey Secret created above into the software, as shown in the figure below.

image.png

Alibaba Bailian models used in the software

  1. In the speech recognition stage, that is, the stage of converting speech in audio and video into subtitles, the SenseVoiceSmall model is used, which supports more than 20 languages and has a certain free quota.
  2. In the speech synthesis stage, that is, the dubbing stage according to subtitles, a combination of CosyVoice, Sambert, and edge-tts is used. Among them, edge-tts is Microsoft's free speech synthesis service, and CosyVoice and Sambert are Alibaba Bailian's speech synthesis models, which have a certain free quota.
  3. In the subtitle translation stage, the Tongyi Qianwen large model is used qwen-plus-1125,qwen-plus-1127,qwen-turbo-1101,qwen-max,qwen-max-latest,qwen-plus,qwen2.5-72b-instruct. Models ending in numbers have a free quota, others do not.

Precautions

  1. If you use the video translation or audio and video to subtitles function, you must activate OSS and fill in the Bucket name and AccessKey, otherwise you will not be able to use it.
  2. If other functions are normal, but the audio and video to subtitles, that is, the speech recognition function, is wrong, then it is likely that you did not create OSS, or did not enable the public read permission of the Bucket.
  3. The video translation software itself is free to download and use. The costs incurred by third-party APIs are not related to the software.