Privacy concerns about dataset metadata

Similar question here. The proposal states that:

Which broadly makes sense to me (not a speech recognition expert): I understand that in general, the more data, the better. But (again, as a non-expert) it’s difficult to imagine how location will be used when people build tools using this data. What are some examples of applications in which this information is useful? I think having this kind of context in the proposal could better inform the type of feedback someone gives.