Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
Intro to Airflow - From Zero to Hero
Search
Sponsored
·
Your Podcast. Everywhere. Effortlessly.
Share. Educate. Inspire. Entertain. You do you. We'll handle the rest.
→
Lee Wei
February 17, 2024
1.1k
0
Share
Embed
Copy iframe code
Copy JS code
Copy link
Start on current slide
Intro to Airflow - From Zero to Hero
Lee Wei
February 17, 2024
More Decks by Lee Wei
See All by Lee Wei
YourAl Is Only As Good As Your Data Pipeline
leew
0
7
觸發觸發器器,那個你可能不熟的 Apache Airflow 元件
leew
0
84
20251127 開源菜雞的隨意雜談
leew
0
49
住手啊,這樣用 Python 寫程式沒有人會獲得幸福的
leew
0
91
Unlocking the Future of Data Pipeline
leew
0
270
Hold on! You have a data team in PyCon Taiwan!
leew
0
55
朝聖之路
leew
0
130
20250328 Airflow 3.0 The First Glance
leew
0
410
踏入開源的第一步
leew
0
270
Featured
See All Featured
How To Speak Unicorn (iThemes Webinar)
marktimemedia
1
480
Bioeconomy Workshop: Dr. Julius Ecuru, Opportunities for a Bioeconomy in West Africa
akademiya2063
PRO
1
140
How to Talk to Developers About Accessibility
jct
2
230
Leading Effective Engineering Teams in the AI Era
addyosmani
9
2k
Testing 201, or: Great Expectations
jmmastey
46
8.2k
Kristin Tynski - Automating Marketing Tasks With AI
techseoconnect
PRO
0
270
DBのスキルで生き残る技術 - AI時代におけるテーブル設計の勘所
soudai
PRO
65
55k
Technical Leadership for Architectural Decision Making
baasie
3
400
End of SEO as We Know It (SMX Advanced Version)
ipullrank
3
4.2k
16th Malabo Montpellier Forum Presentation
akademiya2063
PRO
0
140
The Illustrated Guide to Node.js - THAT Conference 2024
reverentgeek
1
380
Introduction to Domain-Driven Design and Collaborative software design
baasie
1
840
Transcript
Intro to Airflow From Zero to Hero
❗Disclaimer❗
As this is an Airflow sharing, ...
None
$ cat weilee.py __name__ = 李唯 / Wei Lee __what_i_am_doing__
= [ Software Engineer @ Astronomer, Volunteers @ PyCon Taiwan, Maintainer of commitizen-tools, ] __github__ = G Lee-W __linkedin__ = l clleew __site__ = p http://wei-lee.me
File "weilee.py", line 1 __name__ = 李唯 / Wei Lee
^^^ SyntaxError: invalid syntax $ python weilee.py
Let's start with FAQ
Question 1 Are you an airflow committer?
No
Question 2 Are folks in your team all committers?
Well...You tell me. At least I'm not.
Question 3 You must have made significant contributions to Airflow
before joining Astronomer, right?
This is the sole PR I made before joining.
Question 4 Does Astronomer have enough votes to push/block whatever
feature?
The truth is I don't know. From what I heard,
we could, but we don't do it this way
What is Airflow Apache Airflow™ is an open-source platform for
developing, scheduling, and monitoring batch-oriented workflows. - from the Official Site
What Airflow is not? ❌ • The best framework to
run event-based workflows infinitely. • The best streaming solution.
Why Airflow • dynamic • scalable • interactive through UI,
CLI, REST API • extensible
Why Airflow A large user base
Why Airflow Active Development
Why Airflow Integrations with 3-party projects "JSCZUF "MJCBCB "NB[PO "QQSJTF
"TBOB "SBOHP%# "QBDIF4QBSL "QBDIF1JOPU "QBDIF1JH "QBDIF-JWZ "QBDIF,ZMJO "QBDIF,BGLB "QBDIF)JWF "QBDIF)%'4 "QBDIF'MJOL "QBDIF%SVJE "QBDIF%SJMM "QBDIF$BTTBOESB "QBDIF#FBN %PDLFS %JTDPSE %JOHEJOH ECU %BUBEPH %BUBCSJDLT $PNNPO42- $PIFSF ,VCFSOFUFT $FMFSZ +JSB *#.$MPVEBOU )551 )BTIJDPSQ H31$ (PPHMF 'BDFCPPL '51 'BDFCPPL &YBTPM &MBTUJDTFBSDI 0QFO-JOFBHF 0QFO"* 0QFO'BB4 0%#$ /FPK .Z42- .POHP%# 8JO3. .442- 1431 .JDSPTPGU1PXFS4IFMM .JDSPTPGU"[VSF +FOLJOT +%#$ *."1 *OGMVY%# 1BQFSNJMM 1BHFSEVUZ 0SBDMF 0QTHFOJF 0QFO4FBSDI 4FHNFOU 4BNCB 4BMFTGPSDF 3FEJT 1SFTUP 1PTUHSF42- 1JOFDPOF 1H7FDUPS 5BCMFBV 5BCVMBS 44) 42-JUF 4OPXGMBLF 4.51 4MBDL 4JOHVMBSJUZ 4'51 4FOEHSJE 7FSUJDB 5SJOP 5FMFHSBN ;FOEFTL :BOEFY 8FBWJBUF
Why Airflow?
Why Airflow
How does Airflow work?
How does Airflow work? DAG Example
None
How Airflow works?
How does Airflow work? Trigger a DAG
How does Airflow work?
How does Airflow work? WebUI
How does Airflow work? XCom
How does Airflow work? Components Web Server Metadata Database Scheduler
Triggerer Worker DAG Processor
How does Airflow work? DAG authors change DAGs code Web
Server Metadata Database Scheduler Triggerer Worker DAG Processor
How does Airflow work? Users trigger DAGs from WebUI Web
Server Metadata Database Scheduler Triggerer Worker DAG Processor
How does Airflow work? Users trigger DAGs from WebUI Web
Server Metadata Database Scheduler Triggerer Worker DAG Processor
How does Airflow work? Users trigger DAGs from WebUI Web
Server Metadata Database Scheduler Triggerer Worker DAG Processor
How does Airflow work? Users trigger DAGs from WebUI Web
Server Metadata Database Scheduler Triggerer Worker DAG Processor
How does Airflow work? Deferrable Operators Traditional Deferrable Ref: https://docs.astronomer.io/learn/deferrable-operators
How does Airflow work? Deferrable Operators
How does Airflow work? Deferrable Operators Web Server Metadata Database
Scheduler Triggerer Worker DAG Processor
How does Airflow work? Deferrable Operators Web Server Metadata Database
Scheduler Triggerer Worker DAG Processor
How does Airflow work? Deferrable Operators Web Server Metadata Database
Scheduler Triggerer Worker DAG Processor
How does Airflow work? Deferrable Operators Web Server Metadata Database
Scheduler Triggerer Worker DAG Processor
How does Airflow work? Deferrable Operators Web Server Metadata Database
Scheduler Triggerer Worker DAG Processor
Dataset Data-aware scheduling
Dataset Data-aware scheduling
Dataset Data-aware scheduling
None
Dataset Data-aware scheduling
Dataset Data-aware scheduling
Dataset Data-aware scheduling
Dataset Data-aware scheduling
Dataset Data-aware scheduling
None
Taskflow a cleaner way to write DAG
Upcoming things
Hybrid Execution
DAG Versioning (?)
Astronomer products
Deprecating
Deprecating (?)
Ask-Astro
Ask-Astro Slack bot in Airflow slack
astronomer-cosmos
astro-cli
astro-cli $ astro dev start
Astronomer Click "Get Started Free"!
Upcoming evens
Airflow Taipei Meetup March 9th @ Dcard
Remember I mentioned I'm a ...
Maintainer of commitizen-tools
Volunteer of PyCon Taiwan
BTW we also use Airflow
Guess who taught us Airflow
Link to this slide
None