I have a problem with a large object (400mb pickled) I need to use in a UDF.
The object is pickled and on every worker but I don't know how to have it load on a worker outside the UDF which causes it to be reloaded for every row.
Broadcast hasn't really helped the overhead of loading this up for every task crashes everything in my dev environment.