Scraping news sites/subtitles -- license question

Yeah, given that it’s an international project we need to be really careful about copyright laws that only apply in one country, because there’s no way of knowing if a Thai news article is actually a syndication / translation of an item from a British/American/whatever newswire service that does have copyright and terms of service attached. As a rule, unless the source itself explicitly specifies that the text is public domain / CC0 we’re unlikely to accept it.