I am contributing to Common Voice by adding sentences to the corpus. The problem is it is difficult to find Public Domain content according to the US laws. I am not entirely clear about US laws regarding works of deceased authors going public domain.
In India, once an author is deceased, their works go public domain 60 years after their day of passing. This is established under the Copyright Act, 1957.
That makes it difficult to find public domain text which are written the way we Bengalis speak. Bengali used to spoken in one way (Chalit or common-tounge) and written in one way (Sadhu or Sage). This divide started to vanish only some decades ago. So, the works of authors who wrote in the way we speak are mostly still under copyright.
Very few works of literature exist in public domain which are in common-tongue.
I have found a novel which is written in common-tongue and is public domain under Indian Law. It was published in 1934. Does that mean it is not in public domain under US Law?
Two directions sought-
- Can we add text which are in public domain in the country of origin, but might be under copyright in the US?
- More clarification about US Copyright Law. Is it true that work published after 1925 are still not in public domain?