I prepare Polish dataset based on Europarl parallel corpus, I wrote details in new topic Polish dataset from Europarl - help needed since I need a help with QA. I think this dataset is sometimes specific, but most sentences are useful, I hope. Besides some “political” language there is many common sentences in modern Polish and common geographical names.
1 Like
Just a little reminder that this is still an option for 20 European languages.