DeepSpeech MultiLanguage Server

bruno-fischer · December 10, 2020, 7:44am

Good morning,

I have a basic question about using DeepSpeech.

I have the following project. I am setting up an Ubuntu 20.04 LTS server (headless) and want to install DeepSpeech there.
The task of the server should be to send audio files (vmtl ogg, opus) from a program via network to the server and let it process them. I do not know which language the audio files contain. As return I want to have the recognized text. I want to continue working with this in the program.

How can I implement this? The program I use to deliver the data has a Python interface.

Do I have to download different language models, if so where can I find them?

Greetings from Berlin
Bruno

othiele · December 10, 2020, 9:10am

DeepSpeech does not recognize the language (de/en/…) for you, maybe have several models transcribe your audio and see what gets good confidence values?

Please search before you ask, have a look at this discourse and the release and you’ll quickly find all available models.

DeepSpeech has Python bindings, so you can easily load several models. Look at the docs and the examples.

bruno-fischer · December 10, 2020, 1:47pm

Hello @othiele !
Thanks for your input.

I will look @ the links.

Topic		Replies	Views
New project: deepspeech websocket server & client DeepSpeech	23	5614	September 16, 2020
Cloud Speech API with Common Voice Common Voice development	3	2070	December 4, 2017
Running a deepspeech pre-trained model DeepSpeech	4	1211	December 22, 2019
How to test deepspeech on my computer? DeepSpeech	6	7960	March 8, 2018
I need help DeepSpeech	9	1272	November 23, 2019

DeepSpeech MultiLanguage Server

Related topics