article thumbnail

Build efficient, cross-Regional, I/O-intensive workloads with Dask on AWS

AWS Big Data

Amazon’s Open Data Sponsorship Program allows organizations to host free of charge on AWS. These datasets are distributed across the world and hosted for public use. Data scientists have access to the Jupyter notebook hosted on SageMaker. The notebook is able to connect and run workloads on the Dask scheduler.