I just realized (after some empirical tests) that applying the limit function on a Dataset produces a new Dataset with only 1 partition. How come ?
I can't find any related questions. And reading the source code on LocalLimit and GlobalLimit didn't provide any insight as I'm not familiar with the internals.
This can be problematic as one might want to use something like limit(1000000) for whatever reason.