Skip to content

Using the Deepgram.com Speech Recognition API

Starting from v2.92, support for the Deepgram.com speech recognition API has been added. This is an international AI service that offers $200 in free credits upon registration, which is sufficient for a period of use.

  1. Go to the URL https://deepgram.com/, register, log in, and access the console at https://console.deepgram.com/

  1. After logging in, click the large green "Create API Key" button in the console.

A pop-up will appear as shown below:

Enter any English letters in the first text box, then click the bottom "Create" button. The SK will be displayed—remember to copy it, as shown below:

  1. Open Menu → Speech Recognition Settings → Deepgram Window

  • API Key: Enter the key copied in the previous step into the API Key field.

  • Silence Duration: You can keep the default value of 200 (i.e., 200ms). If the video to be recognized has fast speech, you can reduce it to 150. If the speech is slower or there are more silent pauses, you can increase it to 500 or 800.

  1. Note: Deepgram platform has poor support for Chinese. Whether using the subtitles directly returned by Deepgram or re-segmenting based on word-level timestamps, punctuation marks are missing, which leads to suboptimal subtitle segmentation. To optimize this, the software uses the Ali Chinese punctuation restoration model to re-segment the text. Please select "Chinese Re-segmentation" in the software interface.