Data Strategie

Day-1 of learning Pyspark

Reddit r/dataengineering

Summary

A Reddit user is starting to learn PySpark for ETL purposes and will use AWS Glue to run and orchestrate his data pipelines. He plans to share daily updates on his progress and questions to maintain discipline.

Read the full article