Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
Intro to Airflow - From Zero to Hero
Search
Sponsored
·
Your Podcast. Everywhere. Effortlessly.
Share. Educate. Inspire. Entertain. You do you. We'll handle the rest.
→
Lee Wei
February 17, 2024
1.1k
0
Share
Embed
Copy iframe code
Copy JS code
Copy link
Start on current slide
Intro to Airflow - From Zero to Hero
Lee Wei
February 17, 2024
More Decks by Lee Wei
See All by Lee Wei
YourAl Is Only As Good As Your Data Pipeline
leew
0
7
觸發觸發器器,那個你可能不熟的 Apache Airflow 元件
leew
0
84
20251127 開源菜雞的隨意雜談
leew
0
49
住手啊,這樣用 Python 寫程式沒有人會獲得幸福的
leew
0
91
Unlocking the Future of Data Pipeline
leew
0
270
Hold on! You have a data team in PyCon Taiwan!
leew
0
55
朝聖之路
leew
0
130
20250328 Airflow 3.0 The First Glance
leew
0
410
踏入開源的第一步
leew
0
270
Featured
See All Featured
Hiding What from Whom? A Critical Review of the History of Programming languages for Music
tomoyanonymous
2
850
[Rails World 2023 - Day 1 Closing Keynote] - The Magic of Rails
eileencodes
38
2.9k
B2B Lead Gen: Tactics, Traps & Triumph
marketingsoph
0
140
Utilizing Notion as your number one productivity tool
mfonobong
4
320
Game over? The fight for quality and originality in the time of robots
wayneb77
1
200
Side Projects
sachag
455
43k
Leo the Paperboy
mayatellez
7
1.8k
Jess Joyce - The Pitfalls of Following Frameworks
techseoconnect
PRO
1
170
How People are Using Generative and Agentic AI to Supercharge Their Products, Projects, Services and Value Streams Today
helenjbeal
1
210
16th Malabo Montpellier Forum Presentation
akademiya2063
PRO
0
140
Git: the NoSQL Database
bkeepers
PRO
432
67k
How to Build an AI Search Optimization Roadmap - Criteria and Steps to Take #SEOIRL
aleyda
1
2.1k
Transcript
Intro to Airflow From Zero to Hero
❗Disclaimer❗
As this is an Airflow sharing, ...
None
$ cat weilee.py __name__ = 李唯 / Wei Lee __what_i_am_doing__
= [ Software Engineer @ Astronomer, Volunteers @ PyCon Taiwan, Maintainer of commitizen-tools, ] __github__ = G Lee-W __linkedin__ = l clleew __site__ = p http://wei-lee.me
File "weilee.py", line 1 __name__ = 李唯 / Wei Lee
^^^ SyntaxError: invalid syntax $ python weilee.py
Let's start with FAQ
Question 1 Are you an airflow committer?
No
Question 2 Are folks in your team all committers?
Well...You tell me. At least I'm not.
Question 3 You must have made significant contributions to Airflow
before joining Astronomer, right?
This is the sole PR I made before joining.
Question 4 Does Astronomer have enough votes to push/block whatever
feature?
The truth is I don't know. From what I heard,
we could, but we don't do it this way
What is Airflow Apache Airflow™ is an open-source platform for
developing, scheduling, and monitoring batch-oriented workflows. - from the Official Site
What Airflow is not? ❌ • The best framework to
run event-based workflows infinitely. • The best streaming solution.
Why Airflow • dynamic • scalable • interactive through UI,
CLI, REST API • extensible
Why Airflow A large user base
Why Airflow Active Development
Why Airflow Integrations with 3-party projects "JSCZUF "MJCBCB "NB[PO "QQSJTF
"TBOB "SBOHP%# "QBDIF4QBSL "QBDIF1JOPU "QBDIF1JH "QBDIF-JWZ "QBDIF,ZMJO "QBDIF,BGLB "QBDIF)JWF "QBDIF)%'4 "QBDIF'MJOL "QBDIF%SVJE "QBDIF%SJMM "QBDIF$BTTBOESB "QBDIF#FBN %PDLFS %JTDPSE %JOHEJOH ECU %BUBEPH %BUBCSJDLT $PNNPO42- $PIFSF ,VCFSOFUFT $FMFSZ +JSB *#.$MPVEBOU )551 )BTIJDPSQ H31$ (PPHMF 'BDFCPPL '51 'BDFCPPL &YBTPM &MBTUJDTFBSDI 0QFO-JOFBHF 0QFO"* 0QFO'BB4 0%#$ /FPK .Z42- .POHP%# 8JO3. .442- 1431 .JDSPTPGU1PXFS4IFMM .JDSPTPGU"[VSF +FOLJOT +%#$ *."1 *OGMVY%# 1BQFSNJMM 1BHFSEVUZ 0SBDMF 0QTHFOJF 0QFO4FBSDI 4FHNFOU 4BNCB 4BMFTGPSDF 3FEJT 1SFTUP 1PTUHSF42- 1JOFDPOF 1H7FDUPS 5BCMFBV 5BCVMBS 44) 42-JUF 4OPXGMBLF 4.51 4MBDL 4JOHVMBSJUZ 4'51 4FOEHSJE 7FSUJDB 5SJOP 5FMFHSBN ;FOEFTL :BOEFY 8FBWJBUF
Why Airflow?
Why Airflow
How does Airflow work?
How does Airflow work? DAG Example
None
How Airflow works?
How does Airflow work? Trigger a DAG
How does Airflow work?
How does Airflow work? WebUI
How does Airflow work? XCom
How does Airflow work? Components Web Server Metadata Database Scheduler
Triggerer Worker DAG Processor
How does Airflow work? DAG authors change DAGs code Web
Server Metadata Database Scheduler Triggerer Worker DAG Processor
How does Airflow work? Users trigger DAGs from WebUI Web
Server Metadata Database Scheduler Triggerer Worker DAG Processor
How does Airflow work? Users trigger DAGs from WebUI Web
Server Metadata Database Scheduler Triggerer Worker DAG Processor
How does Airflow work? Users trigger DAGs from WebUI Web
Server Metadata Database Scheduler Triggerer Worker DAG Processor
How does Airflow work? Users trigger DAGs from WebUI Web
Server Metadata Database Scheduler Triggerer Worker DAG Processor
How does Airflow work? Deferrable Operators Traditional Deferrable Ref: https://docs.astronomer.io/learn/deferrable-operators
How does Airflow work? Deferrable Operators
How does Airflow work? Deferrable Operators Web Server Metadata Database
Scheduler Triggerer Worker DAG Processor
How does Airflow work? Deferrable Operators Web Server Metadata Database
Scheduler Triggerer Worker DAG Processor
How does Airflow work? Deferrable Operators Web Server Metadata Database
Scheduler Triggerer Worker DAG Processor
How does Airflow work? Deferrable Operators Web Server Metadata Database
Scheduler Triggerer Worker DAG Processor
How does Airflow work? Deferrable Operators Web Server Metadata Database
Scheduler Triggerer Worker DAG Processor
Dataset Data-aware scheduling
Dataset Data-aware scheduling
Dataset Data-aware scheduling
None
Dataset Data-aware scheduling
Dataset Data-aware scheduling
Dataset Data-aware scheduling
Dataset Data-aware scheduling
Dataset Data-aware scheduling
None
Taskflow a cleaner way to write DAG
Upcoming things
Hybrid Execution
DAG Versioning (?)
Astronomer products
Deprecating
Deprecating (?)
Ask-Astro
Ask-Astro Slack bot in Airflow slack
astronomer-cosmos
astro-cli
astro-cli $ astro dev start
Astronomer Click "Get Started Free"!
Upcoming evens
Airflow Taipei Meetup March 9th @ Dcard
Remember I mentioned I'm a ...
Maintainer of commitizen-tools
Volunteer of PyCon Taiwan
BTW we also use Airflow
Guess who taught us Airflow
Link to this slide
None