Remove 2011 Remove Informatics Remove Machine Learning Remove Visualization
article thumbnail

Build efficient, cross-Regional, I/O-intensive workloads with Dask on AWS

AWS Big Data

Dataset Variables Disk Size Xarray Dataset Size Region ERA5 2011–2020 (120 netcdf files) 53.5GB 364.1 ERA5 ( historic_temp_regridded ) us-east-1 1512 711 427 202 Difference ( propogated pool ) us-west-2 and us-east-1 1527 906 469 251 The following graph visualizes the performance and scale.