Parallelism with Python - Parallel DataFrames with Dask

Parallelism with Python - Parallel DataFrames with Dask

Tags
python
tutorial
dev
date
Mar 20, 2023
  • Data use case
  • What is pandas good for
    • Vectorized operations
  • Dask introduction
    • Dask basic example
    • What is a worker?
      • Threads vs Processes vs Cluster
      • When to use when
    • What is a partition?
  • Best use case → Map partitions
    • Dataframe example
 
Â