Skip to content

The IMS Toucan TTS project claims to support voiceovers in over 7000 languages. After downloading and testing it, it does work, but the quality is just okay. If you don't have high expectations, it's usable.

Unlike edge-tts, this project doesn't offer several fixed voice options. Instead, it assigns one fixed voice to each language. You can fine-tune the random voice, seed, and gender using parameters like prosody_creativity/duration_scaling_factor/voice_seed/emb1.

Project address: https://github.com/DigitalPhonetics/IMS-Toucan

Local Deployment Method

You can directly deploy the source code according to the instructions on the project's official website: https://github.com/DigitalPhonetics/IMS-Toucan

I've also created a Windows integration package for those who don't want to go through the hassle.

Download the integration package from Baidu Netdisk and extract it to a directory, for example, D:/python/IMS-Toucan.

Integration package download address: https://pan.baidu.com/s/1om62tz-fmq4o5sijmHmnMQ?pwd=dck6

After extracting, you'll find an espeak-ng-X64.msi file. You can install it or not. Installing it will improve the sound quality. Just double-click it and follow the default steps.

image.png

You'll see 3 bat files in the directory. Double-click to execute them.

image.png

启动api加简单网页.bat (Start API and Simple Webpage.bat):

Double-clicking this will start an API service and open a simple webpage, which can be used to connect to the custom TTS interface of video translation software. This API only supports the 24 most common languages.

image.png

The API address is http://127.0.0.1:5020/api, which can be filled in the custom TTS interface of the video translation software.

启动完整网页ui.bat (Start Complete Webpage UI.bat):

Double-clicking this will start the official IMS Toucan web interface, which supports synthesis and voiceovers in all languages. You can try exploring it yourself.

image.png

If the browser doesn't automatically open the page, manually copy the address and open it in the browser when the terminal displays the following: image.png

启动高级QT-ui.bat (Start Advanced QT-UI.bat):

Double-clicking this will start the built-in software interface. This interface is not localized. If you are interested, you can research it.

image.png

Important Notes

  1. When starting, the terminal window may display a lot of information, as shown in the figure below. This is not an error, just ignore it.

    image.png

  2. The API and complete webpage UI will automatically open the corresponding page in the browser after starting. The advanced QT will automatically open the software.

  3. Sometimes, a bunch of errors may be prompted, including https://docs.microsoft.com Microsoft website. At this time, please close the window and re-run the bat as administrator by right-clicking.

  4. The integration package comes with a model, but when starting, it may detect whether there is a model update. It needs to connect to https://huggingface.co, which cannot be accessed from outside China. You need to prepare your own proxy. When HTTPSConnect appears in the error, it means you need a global or system proxy.

Using in Video Translation Software

First, upgrade the video translation software to the latest patch package. Download address: https://pyvideotrans.com

After starting the software, click Menu - TTS Settings - Custom TTS Interface, and fill in http://127.0.0.1:5020/api in the API address. You can fill in any letters in the role list, such as a,b,c, etc.

image.png

image.png

After testing and ensuring there are no problems, you can use it.

image.png