Skip to content

Video translation software version 2.04 and later support using the Fish tts API for voiceovers.

Deploying Fish TTS and Starting the API Service

Note: pyvideotrans version v3.63 and later require fish-speech version to be at least 1.5.0, otherwise it cannot be used.

First, you need to deploy fish-speech(>=1.5.0) and start the API service. Note that the API service must be started. Starting only the web UI, which means only opening the browser webpage, is not enough. The default API address is http://127.0.0.1:8080/v1/tts

Please refer to the relevant documentation for specific deployment and API startup methods.

fish-speech tts project open source address https://github.com/fishaudio/fish-speech

Documentation address https://speech.fish.audio

Filling in the API in the Video Translation Software

After deploying and starting, confirm that the API is running, then open the video translation software, go to the settings menu in the upper left corner - Fish TTS, and fill in the corresponding Fish TTS API in the API address. If you have not modified it, the default is http://127.0.0.1:8080/v1/tts

Next, fill in the reference audio and reference text.

The reference audio is the wav audio file of the voice you want to use. The optimal audio duration is 10s-20s, clear and without background noise. Please place the audio file in the root directory of this translation software, together with sp.exe.

If you want to use the audio named 1.wav, place it in the same folder as sp.exe or sp.py. Assume the corresponding text content of 1.wav is "Hello, my dear friends"

Then you need to fill in

1.wav#Hello, my dear friends

If you also want to use 2.wav and want to put it in a dedicated folder, you can create a new directory in the folder where sp.exe or sp.py is located. Suppose the new directory name is wavs, and its corresponding text is "The weather is nice today, let's go swimming", then fill in

1.wav#Hello, my dear friends
wavs/2.wav#The weather is nice today, let's go swimming

If the reference text is very long, just keep the natural line breaks, do not manually add carriage returns or line breaks.

After saving, go to the main interface and select FishTTS in "Voiceover Channel", then you can see the added reference audio in "Voiceover Role"

Precautions

  1. Please use simple English or numbers to name the parameter audio, do not include spaces. If you want to put all the reference audio in a folder (the folder needs to be created in the directory where sp.exe is located), use simple English or numbers for the folder name as well, do not include spaces.
  2. Fish-speech supports a limited number of languages, please ensure that the text is within the supported range
  3. If you have added reference audio in the settings, but the corresponding role is not displayed in the main interface, please switch the voiceover channel to edgeTTS first, and then switch back to FishTTS