DSAlign - unable to download nativeclient.tar.xz for 0.6.0

I am trying to setup DSAlign as per instructions here

bin/lm-dependencies.sh
step is failing because the native_client.tar.xz for version 0.6.0 is not available. I checked the 0.6.0 release notes and that is failing with the missing file too.

python3 util/taskcluster.py --target .
Downloading https://community-tc.services.mozilla.com/api/index/v1/task/project.deepspeech.deepspeech.native_client.v0.6.0.cpu/artifacts/public/native_client.tar.xz
Traceback (most recent call last):
File “util/taskcluster.py”, line 153, in
main()
File “util/taskcluster.py”, line 147, in main
maybe_download_tc(target_dir=args.target, tc_url=get_tc_url(args.arch, args.artifact, args.branch))
File “util/taskcluster.py”, line 57, in maybe_download_tc
urllib.request.urlretrieve(tc_url, target_file, reporthook=(report_progress if progress else None))
File “/usr/lib/python3.6/urllib/request.py”, line 188, in urlretrieve
with contextlib.closing(urlopen(url, data)) as fp:
File “/usr/lib/python3.6/urllib/request.py”, line 163, in urlopen
return opener.open(url, data, timeout)
File “/usr/lib/python3.6/urllib/request.py”, line 472, in open
response = meth(req, response)
File “/usr/lib/python3.6/urllib/request.py”, line 582, in http_response
‘http’, request, response, code, msg, hdrs)
File “/usr/lib/python3.6/urllib/request.py”, line 510, in error
return self._call_chain(*args)
File “/usr/lib/python3.6/urllib/request.py”, line 444, in _call_chain
result = func(*args)
File “/usr/lib/python3.6/urllib/request.py”, line 590, in http_error_default
raise HTTPError(req.full_url, code, msg, hdrs, fp)
urllib.error.HTTPError: HTTP Error 404: Not Found

This link that the script is referring to is not available for download
Can you please point me in the right direction here? I want to use DSAlign to check my training dataset for 0.7.0 version (CUDA 10.0, cudnn 7.6.5, tensorflow 1.15.2, ubuntu 16.04 with Python 3.6.7) on RTX 2080 Ti 11GB 2 GPU machine.

Some old taskcluster stuff is gone, but @Tilman_Kamp should know whether the files are still somewhere or whether you must switch to 0.7.

Thanks @othiele

I did try changing the version to 0.7.0 in bin/lm_dependencies.sh that script did not have any issues installing kenlm and ds 0.7.0 but I get an error when I run the align script

Blockquote
Traceback (most recent call last):
File “DSAlign-master/align/align.py”, line 691, in
main()
File “DSAlign-master/align/align.py”, line 469, in main
trie_path
File “/usr/lib/python3.6/subprocess.py”, line 576, in check_call
retcode = call(*popenargs, **kwargs)
File “/usr/lib/python3.6/subprocess.py”, line 557, in call
with Popen(*popenargs, **kwargs) as p:
File “/usr/lib/python3.6/subprocess.py”, line 947, in init
restore_signals, start_new_session)
File “/usr/lib/python3.6/subprocess.py”, line 1551, in _execute_child
raise child_exception_type(errno_num, err_msg)
FileNotFoundError: [Errno 2] No such file or directory: ‘dependencies/deepspeech/generate_trie’

Looks like this is because of the changes from 0.6.0 to 0.7.0. I am working on DS 0.7.0 version so would not mind getting that version to work with DSAlign

@Tilman_Kamp please let me know if there is a way to switch to 0.7.0 or get 0.6.0 working with DSAlign
Thanks

Ah, ok, versions are usually incompatible, so 0.6 probably won’t work with 0.7 code, but maybe Tilman can say more

Please download 0.6.1 from github release page.

1 Like