Astro: SDK

Astro: SDK

Speed up your DAG development with the Astro SDK

rate limit

Code not recognized.

Overview

If you want to speed up your DAG development dramatically, this course is for you!
It covers the fundamental of the Astro SDK and what you need to manipulate data blazingly fast within your pipelines.

The course includes lectures, demonstrations, and exercises.

🎯 Objectives

By the end of this course, you will be able to:

  • Easily switch between multiple databases and file sources in your pipeline.
  • Manipulate tables from different schemas and platforms.
  • Run templated SQL queries.
  • Load and Transform tables directly in tables or using DataFrames.
  • Use Tables and Files as Datasets.
  • Mix the Astro SDK with Dynamic Task Mapping.

👥 Audience

Who should take this course:

  • Data Engineers
  • Data Analysts
  • Data Scientists
  • Software Engineers

🚦 Prerequisites

💻 Setup Requirements

You need to have the following:

  • Docker and Docker compose on your computer (cf: get Docker)
  • The Astro CLI
  • An internet connection with permission to access GitHub and download Python dependencies
  • Access to a web browser

Curriculum0 hr 50 min

  • Introduction
  • Welcome! 0 hr 3 min
  • Why Astro SDK? 0 hr 4 min
  • The Ingredients
  • Spinning the Table 0 hr 3 min
  • Bring me that File! 0 hr 7 min
  • More on Load File Operator
  • Lights Camera Action!
  • Set the Stage 0 hr 7 min
  • SDK in action 0 hr 10 min
  • Practice: Using SDK with Snowflake and S3 💪🏻
  • Data Quality Checks 0 hr 9 min
  • Power of SDK
  • SDK & Datasets 0 hr 4 min
  • Dynamic Tasks with SDK 0 hr 5 min
  • Practice: Putting Everything Together
  • Finishing Up
  • Quiz!
  • Resources
  • Summary
  • How was It?

Overview

If you want to speed up your DAG development dramatically, this course is for you!
It covers the fundamental of the Astro SDK and what you need to manipulate data blazingly fast within your pipelines.

The course includes lectures, demonstrations, and exercises.

🎯 Objectives

By the end of this course, you will be able to:

  • Easily switch between multiple databases and file sources in your pipeline.
  • Manipulate tables from different schemas and platforms.
  • Run templated SQL queries.
  • Load and Transform tables directly in tables or using DataFrames.
  • Use Tables and Files as Datasets.
  • Mix the Astro SDK with Dynamic Task Mapping.

👥 Audience

Who should take this course:

  • Data Engineers
  • Data Analysts
  • Data Scientists
  • Software Engineers

🚦 Prerequisites

💻 Setup Requirements

You need to have the following:

  • Docker and Docker compose on your computer (cf: get Docker)
  • The Astro CLI
  • An internet connection with permission to access GitHub and download Python dependencies
  • Access to a web browser

Curriculum0 hr 50 min

  • Introduction
  • Welcome! 0 hr 3 min
  • Why Astro SDK? 0 hr 4 min
  • The Ingredients
  • Spinning the Table 0 hr 3 min
  • Bring me that File! 0 hr 7 min
  • More on Load File Operator
  • Lights Camera Action!
  • Set the Stage 0 hr 7 min
  • SDK in action 0 hr 10 min
  • Practice: Using SDK with Snowflake and S3 💪🏻
  • Data Quality Checks 0 hr 9 min
  • Power of SDK
  • SDK & Datasets 0 hr 4 min
  • Dynamic Tasks with SDK 0 hr 5 min
  • Practice: Putting Everything Together
  • Finishing Up
  • Quiz!
  • Resources
  • Summary
  • How was It?