Python and PySpark starter for AWS Glue data engineering onboarding.
Data Engineering Onboarding Starter is a 10-step kit for learning data engineering with Python and PySpark on AWS Glue. It includes example ETL scripts, sample datasets, infrastructure-as-code templates, and CI/CD workflows to deploy and test Glue jobs, suited for engineers building serverless ETL pipelines on AWS.
0