How are the dev/test/train datasets split?

They will be re-generated using the CorporaCreator which will not maintain the previous splits.

1 Like