Skip to content

Starting from version v3.74-0720, the pyVideoTrans video translation software has integrated Alibaba's Qwen-TTS text-to-speech service!

What is Qwen-TTS?

Simply put, Qwen-TTS is an advanced text-to-speech technology that converts text into highly realistic and natural-sounding human voices. A key highlight is its ability to automatically adjust the rhythm and emotion based on the text content. It also supports three dialects: Beijing, Shanghainese (Wu), and Sichuanese.

It currently supports the following voices for you to choose from. All characters support both Chinese and English:

  • Bilingual (Chinese/English): Chelsie, Cherry, Ethan, Serena
  • Beijing Dialect: Dylan
  • Shanghainese: Jada
  • Sichuanese: Sunny

How to Use the Qwen-TTS Feature?

No complex setup is required. With just a few simple steps, you can start using the powerful Qwen-TTS in pyVideoTrans.

Step 1: Obtain and Configure Your API KEY

Alibaba provides a free quota for every user.

  1. Please click this link to visit the Alibaba Cloud Bailian platform: https://bailian.console.aliyun.com/?tab=model#/api-key

  1. Log in to your Alibaba Cloud account (if you don't have one, follow the prompts to register).

  2. On the API-KEY management page, click "Create API-KEY". The system will automatically generate a string starting with "sk-". This is your API KEY. Please copy this string.

  3. Return to the pyVideoTrans software, find TTS Settings in the top menu bar, click it, and select Qwen TTS from the dropdown menu.

  4. In the pop-up Qwen TTS configuration window, paste the API KEY you just copied into the "API KEY" input field. You can click the "Test" button to listen to a sample. If you hear the voice, the configuration is successful. Finally, click Save.

Step 2: Using Qwen-TTS in Video Translation

Once the configuration is complete, you can enable Qwen-TTS when processing individual videos.

  • On the main interface of pyVideoTrans, find the "Dubbing Channel" dropdown menu, click it, and select "Qwen TTS".
  • In the adjacent "Voice Character" menu, you can choose your preferred voice, for example, select "Cherry" for a standard female voice, or "Sunny" for a fun Sichuanese voiceover.

Step 3: Using in Batch Dubbing and Multi-Character Dubbing

The powerful features of Qwen-TTS are also applicable to batch processing tasks, significantly improving your work efficiency.

  • Batch Dubbing for Subtitles: If you have multiple SRT subtitle files that need dubbing, you can switch to the "Batch Dubbing for Subtitles" interface. Simply select "Qwen TTS" and your desired character in the "Dubbing Channel" section below.

  • Multi-Character Dubbing for Subtitles: This feature is also applicable when handling dialogues with multiple characters. You can assign different Qwen-TTS voices to different characters in the "Multi-Character Dubbing for Subtitles" section.