changes:
- increased max message size from `2**16` to `2**64`
- check every second for whether all blocks have finished in the server event loop
- use context manager within run_blockwise to ensure process terminates
- remove deep copy in the calls to `shrink`. This was significantly slowing down the scheduler with excessive copying