503 error when using xarray.concat #108

MatthewJA · 2020-08-24T07:13:29Z

I'm getting a 503 error with the following server-side error log:

Traceback (most recent call last):
  File "/env/lib/python3.6/site-packages/tornado/tcpclient.py", line 143, in on_connect_done
    stream = future.result()
tornado.iostream.StreamClosedError: Stream is closed
During handling of the above exception, another exception occurred:
Traceback (most recent call last):
  File "/env/lib/python3.6/site-packages/tornado/web.py", line 1703, in _execute
    result = await result
  File "/env/lib/python3.6/site-packages/jupyter_server_proxy/websocket.py", line 97, in get
    return await self.http_get(*args, **kwargs)
  File "/env/lib/python3.6/site-packages/jupyter_server_proxy/handlers.py", line 359, in http_get
    return await self.proxy(port, proxied_path)
  File "/env/lib/python3.6/site-packages/jupyter_server_proxy/handlers.py", line 225, in proxy
    response = await client.fetch(req, raise_error=False)
  File "/env/lib/python3.6/site-packages/tornado/iostream.py", line 1226, in connect
    self.socket.connect(address)
OSError: [Errno 99] Cannot assign requested address
[I 2020-08-24 07:09:09.191 SingleUserNotebookApp kernelmanager:179] Kernel started: 33c4e177-ff22-4b83-915c-6af0374e1a54, name: python3

It occurs only when I use xarray.concat on a reasonably large dataset. I'm using the XL server and expect to have sufficient memory to store the whole thing.

The text was updated successfully, but these errors were encountered:

whatnick · 2020-08-24T23:59:09Z

Possibly related to pod status not arriving in Hub cleanly : jupyterhub/kubespawner#233 . Needs further work outside of DEA scope in JupyterHub upstream.

MatthewJA · 2020-08-25T00:31:36Z

concat produced a ~30-60 GiB array, but spiked to 131 GB during operation. On the XXL sandbox this was fine and no 503 error was observed.

whatnick · 2020-08-28T04:43:12Z

@MatthewJA can we resolve this ?

robbibt · 2020-08-28T05:39:10Z

@MatthewJA Just a heads-up: xr.concat is an extremely memory hungry function which will typically cause memory spikes twice the total memory of its inputs as it combines them... there's often not much alternative to using it (until recently it was one of the main problems with the load_ard function), but if you're using it for large datasets you might want to make sure all your data types are as efficient as possible (e.g. using float32 instead of float64, or ideally int16s)

whatnick closed this as completed Aug 28, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

503 error when using xarray.concat #108

503 error when using xarray.concat #108

MatthewJA commented Aug 24, 2020

whatnick commented Aug 24, 2020

MatthewJA commented Aug 25, 2020

whatnick commented Aug 28, 2020

robbibt commented Aug 28, 2020 •

edited

Loading

503 error when using xarray.concat #108

503 error when using xarray.concat #108

Comments

MatthewJA commented Aug 24, 2020

whatnick commented Aug 24, 2020

MatthewJA commented Aug 25, 2020

whatnick commented Aug 28, 2020

robbibt commented Aug 28, 2020 • edited Loading

robbibt commented Aug 28, 2020 •

edited

Loading