Plans to add Greek language detection/dictionary?

James_T · February 20, 2021, 12:05pm

Hi all, please bear with me since im completely new to this, doing baby steps to understand how this works.
So, an audio(wma,wav) to text application would REALLY help in my job and a friendly user from Reddit pointed me to this. He tested it and told me for English at least, the results were pretty awesome!
My problem? I need Greek audio and i guess a big(?) dictionary since its a medical job, no idea if medical terms are in regular dictionaries(no latin words, since most medical words have a Greek root).
Are there any plans to add Greek? Obviously im talking about the voice recognition and text and not the software itself.
Is the software compatible with external dictionaries? For example, another software that i was looking into(does the opposite, text to audio) had a 5.5gb Greek dictionary(its a .tgz file).
Please forgive me if i asked a stupid question, but ive been searching for an offline tool like this for ages. Since its a medical job, i cant do anything online for securities reasons.

othiele · February 20, 2021, 12:30pm

Welcome, unfortunately there are currently no public Greek models.

Yes, you need both, annotated audio and textual data (language model).

AFAIK, not at the moment.

Build a language model from textual data.

Check the guidelines, they’ll guide you to docs and the playbook. And help you ask the right questions.

James_T · February 20, 2021, 2:19pm

thanks for the reply! so if i understand this correctly, it seems i can build at least a compatible greek dictionary using the one i mentioned? so i would only be missing the audio recognition?

othiele · February 20, 2021, 3:02pm

Start small, then build from there.

Search the forum as suggested, there were some people working on it. Connect

Topic		Replies	Views
Mozilla Voice STT in the Wild! DeepSpeech	31	11879	August 25, 2020
Speech to text from audio file DeepSpeech learning	5	840	February 10, 2021
Speech-to-text json result with time per word DeepSpeech	3	1128	October 19, 2018
Transcription Results very bad in english DeepSpeech	16	1179	October 7, 2020
Force alignment (synchronize audio with text) DeepSpeech	9	4226	October 28, 2019

Plans to add Greek language detection/dictionary?

Related topics