I have seen your help for metadata on the ticket link below.
This doesnot help me yet.
Can you please give command line how to incorporate client.cc while STT model running with deepspeech to get the proper json format and speaker info. I tried several ways. I am confuse and still unsuccessful.
I have tried like this to incorporate client.cc
lissyx
((slow to reply) [NOT PROVIDING SUPPORT])
2
Those are multiple questions here. Can you explicit exactly what you want to achieve ?
We don’t have any “speaker info” anyhow.
This command line makes no sense at all. What is this --extended …/DeepSpeech-0.5.1/native_client/client.cc --json --extended ? What are you trying to achieve ?
first of all, thank you so much for kind help.
I have simple .wav file. I want to get it into text, and later I am aligning with mozilla align.
But, I realized that after alignment, I am missing the speaker in json format. When I am getting text from deepspeech, it is also simple text file and nothing in addition.
Can you guide me how and where to get that metadata as speaker info?
or where I am making mistake?
lissyx
((slow to reply) [NOT PROVIDING SUPPORT])
4
This is expected.
We don’t have such thing, so I don’t know what you are talking about.
lissyx
((slow to reply) [NOT PROVIDING SUPPORT])
9
Please, can you articulate a complete and descriptive sentence ? I absolutely don’t understand what you want. I wish to help you, but honestly, I am loosing my time right now trying to do divination out of five words.
Dear Sir,
suppose if i have an audio with a conference or lecture of a professor. I am transcribing into text. Then from text, I need only what the key speaker has spoken. Or, what professor has explained. Only that text can be extracted out if I have the metadata with info like speaker 1, speaker 2 and so. This I think has explained in DSAlign but I don’t know how to get it out.
lissyx
((slow to reply) [NOT PROVIDING SUPPORT])
11
Ok, then the problem is that you did not read what I said earlier: we don’t have that information
Ok. Is there any open source tool or library which helps in getting such? If you can guide as you are much more expert of this domain. or any advice what steps need to follow?
lissyx
((slow to reply) [NOT PROVIDING SUPPORT])
13
This is not something we have any use, so I have no advice to share.
Thank you so much. You are all really great persons.
One more question.
in alignment, is there any flag, or option that instead of fragments, I get json for each word?
lissyx
((slow to reply) [NOT PROVIDING SUPPORT])
17
You can implement that yourself, from the Metadata structure.