The core principle of video translation software is: to recognize the text from the speaking sounds in the video, then translate the text into the target language text, then dub the translated text, and finally embed the dubbing and text into the video.
It can be seen that the first step is to recognize the text from the speaking sounds in the video, and the recognition accuracy directly affects the subsequent translation and dubbing.
Things to note when using Google recognition
- You need to fill in the network proxy, otherwise you will not be able to connect
- Google's speech recognition function is not strong and cannot distinguish and return punctuation marks.
- Suitable for audio recognition with clean background sound and clear and accurate human voice
How to use
Select GoogleSpeech from the mode drop-down box in the software interface. When this item is selected, there is no need to select the model and segmentation method.
Advantages and disadvantages
Advantages: No need to download the model, save system resources, simple to use
Disadvantages: Proxy required, slightly worse effect
Proxy issue
When using Google Gemini and other services, you must use a proxy for well-known reasons. The general format is http://127.0.0.1:number port number
. If you have confirmed that you are using a system proxy and can access it in your browser, but you don't know how to fill in the network proxy address, then execute the following command to confirm whether the system proxy is enabled correctly.
Press and hold the Windows key + R key
, enter ms-settings:network-proxy
in the pop-up Run window, and then click OK
If the pop-up settings panel is similar to the following, it means that the system proxy has been set correctly, and you do not need to fill in the "Network Proxy Address" in the software.