Skip to content

Starting from version 2.04, the video translation software supports using the API of Fish TTS for dubbing.

Deploy Fish TTS and Start the API Service

Note: From version v3.63 onwards, pyvideotrans must ensure that the version of fish-speecch is not lower than 1.5.0, otherwise it cannot be used.

First, you need to deploy fish-speech(>=1.5.0) and start the API service. Note that the API service must be started. Only starting the webUI, that is, only opening the browser web page, is not allowed. The default API address is http://127.0.0.1:8080/v1/tts.

Please refer to the relevant documentation for specific deployment, API startup, and other operation methods.

fish-speech tts project open source address https://github.com/fishaudio/fish-speech

Document address https://speech.fish.audio

Fill in the API in the Video Translation Software

After deploying and starting, confirm that the API has been started, then open the video translation software, and in the upper left corner, select Settings Menu-Fish TTS. Fill in the API corresponding to fish tts in the API address. If you have not modified it, the default is http://127.0.0.1:8080/v1/tts

Next, fill in the reference audio and reference text

The reference audio is the wav audio file of the voice you want to use. The audio duration is best 10s-20s, clear and without background noise. Please put the audio file in the root directory of this translation software, together with sp.exe.

Suppose you want to use the audio named 1.wav, put it in the folder where sp.exe or sp.py is located. Assume that the text content corresponding to 1.wav is "Hello, my dear friends"

Then you need to fill in

1.wav#Hello, my dear friends

If you also want to use 2.wav and want to put it in a special folder, you can create a new directory in the folder where sp.exe or sp.py is located. Assume that the new directory name is wavs, and its corresponding text is "The weather is nice today, let's go swimming", then fill in

1.wav#Hello, my dear friends
wavs/2.wav#The weather is nice today, let's go swimming

If the reference text is very long, keep the natural line breaks, do not manually add carriage returns.

After saving, select FishTTS in the "Dubbing Channel" on the main interface, and you can see the added reference audio in the "Dubbing Role"

Precautions

  1. Please use simple English or numbers to name the parameter audio name, do not contain spaces. If you want to put all the reference audio in one folder (this folder needs to be created in the directory where sp.exe is located), use simple English or numbers for the folder name, and do not contain spaces.
  2. Fish-speech supports a limited number of languages, please ensure that the text is within the supported range
  3. If you have added reference audio in the settings, but the corresponding role is not displayed in the main interface, please switch the dubbing channel to edgeTTS first, and then switch back to FishTTS