Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
[Journal club] Improved Mean Flows: On the Chal...
Search
Semantic Machine Intelligence Lab., Keio Univ.
PRO
December 24, 2025
Technology
200
0
Share
Embed
Copy iframe code
Copy JS code
Copy link
Start on current slide
[Journal club] Improved Mean Flows: On the Challenges of Fastforward Generative Models
Semantic Machine Intelligence Lab., Keio Univ.
PRO
December 24, 2025
More Decks by Semantic Machine Intelligence Lab., Keio Univ.
See All by Semantic Machine Intelligence Lab., Keio Univ.
[Journal club ] PHyCLIP: ðð-Product of Hyperbolic Factors Unifies Hierarchy and Compositionality in Vision-Language Representation Learning
keio_smilab
PRO
0
42
[Journal club] ReMEmbR: Building and Reasoning Over Long-Horizon Spatio-Temporal Memory for Robot Navigation
keio_smilab
PRO
0
100
[Journal club] ReLaGS: Relational Language Gaussian Splatting
keio_smilab
PRO
0
100
[Journal club] Flow as the Cross-Domain Manipulation Interface
keio_smilab
PRO
0
90
Mobi-ð: Mobilizing Your Robot Learning Policy
keio_smilab
PRO
0
160
A Gentle Introduction to Transformers
keio_smilab
PRO
16
6.9k
FlowAR: Scale-wise Autoregressive Image Generation Meets Flow Matching
keio_smilab
PRO
0
58
[Journal club] VLA-Adapter: An Effective Paradigm for Tiny-Scale Vision-Language-Action Model
keio_smilab
PRO
0
140
[Journal club] MemER: Scaling Up Memory for Robot Control via Experience Retrieval
keio_smilab
PRO
0
140
Other Decks in Technology
See All in Technology
ãããã·ããã¹ããããããšã³ã·ããã¢ããšã¯äœãïŒ
ryooob
0
290
è±SaaSïŒFDEãæ¯ããããããžã§ãã³ã°ãšåé¢èšèš
knih
0
300
MySQL & MySQL HeatWave Report - June 2026
freshdaz
0
100
10幎éã®ããã°çºä¿¡ãæ¯ãè¿ã£ãŠèŠããWebã¢ããªã±ãŒã·ã§ã³ãšã³ãžãã¢ãšããŠã®è»è·¡
stefafafan
0
180
WebGIS AI Agentã®ç޹ä»
_shimizu
0
550
äœãåãããèŠæ¥µããåãž â AIæä»£ã«åºãããšã³ãžãã¢ã®äŸ¡å€ãšåœ¹å²
rince
0
330
AI Agentãã·ã¹ãã ã«çµã¿èŸŒãåã«ãããåãåã£ãŠã¿ã
hayama17
0
130
GitHub Copilot ææ°ã¢ããããŒã â ãäžæ©å ãã®å®è·µæŽ»çšè¡
moulongzhang
5
1.7k
OTel à Datadog ã§ ãAI掻çšããèšæž¬ããæ¹åã«ç¹ãã
shihochan
2
630
èªåã詳ãããªãé åã§AIã䜿ã #ãããã¹2026
konifar
20
7.4k
Kiro Ambassador ãç®æã話
k_adachi_01
0
130
培åºèšè«ïŒECS vs EKSïŒ
daitak
3
1.7k
Featured
See All Featured
Into the Great Unknown - MozCon
thekraken
41
2.6k
Dominate Local Search Results - an insider guide to GBP, reviews, and Local SEO
greggifford
PRO
0
200
Evolution of real-time â Irina Nazarova, EuRuKo, 2024
irinanazarova
9
1.4k
JAMstack: Web Apps at Ludicrous Speed - All Things Open 2022
reverentgeek
1
480
Intergalactic Javascript Robots from Outer Space
tanoku
273
27k
Imperfection Machines: The Place of Print at Facebook
scottboms
270
14k
How to Grow Your eCommerce with AI & Automation
katarinadahlin
PRO
1
210
Storybookã®UI Testing Handbookãèªãã
zakiyama
31
6.8k
We Are The Robots
honzajavorek
0
250
From Ï to Pie charts
rasagy
0
220
The AI Revolution Will Not Be Monopolized: How open-source beats economies of scale, even for LLMs
inesmontani
PRO
3
3.5k
A Tale of Four Properties
chriscoyier
163
24k
Transcript
ææµŠåæç 究宀 効尟幞暹 Improved Mean Flows: On the Challenges of Fastforward
Generative Models Zhengyang Geng1,2,3,* Yiyang Lu4,2,â Zongze Wu3 Eli Shechtman3 J. Zico Kolter1 Kaiming He2 1CMU 2MIT 3Adobe 4THU Geng, Z., Lu, Y., Wu, Z., Shechtman, E., Kolter, J. Z., & He, K. (2025). Improved Mean Flows: On the Challenges of Fastforward Generative Models. arXiv preprint arXiv:2512.02012, 2025
æŠèŠ 2 âª èæ¯ïžMean Flows J 1-NFEã§âŸŒå質ãªâœ£æ L æ°åŒã«ç²ãè¿äŒŒãååš âª
ææ¡ïžImproved Mean Flows J Mean Flows ã«ãããæ°åŒçãªåé¡ãæ¹å J æè»ãª Classifier Free Guidance J in-context conditioning ã«ãã軜éå âª çµæ J 1-NFE ã§å€ãã® Multi-NFE ã¢ãã«ãäžåã
èæ¯ïžMean Flows ã®æ°åŒã¯äžæ£ç¢º 3 âª æ¡æ£ã¢ãã«ã Flow Matching ã¯âŸŒæ§èœã ãèšç®ã³ã¹ãã⟌ã âª
ODE ãè§£ãéã«å€ãã® NFE ãå¿ èŠ âª {1, few}-NFEã®ã¢ãã«ãå°é ⪠Mean Flows [Geng+, NeurIPS25] ⪠ç¬éé床ã§ã¯ãªãå¹³åé床ãäºæž¬ J 1-NFE ã§âŸŒå質ãªâœ£æãå¯èœ L GTã®èšç®ãå°é£ âª äžæ£ç¢ºãªè¿äŒŒïŒåŸè¿°ïŒ MoFlow [Fu+, CVPR25] Mean Flows [Geng+, NeurIPS25] Mean Flows [Geng+, NeurIPS25] ð§! ïžæå» ð ã«ããããã€ãºä»ãããŒã¿, ð¡, ðïžæå»
é¢é£ç ç©¶ïžMean Flows ã®æ¹å 4 âŒ¿æ³ ç¹åŸŽ AlphaFlow [Zhang+, 25] Flow
Matching ãã MeanFlow ãžæ®µéçã«ç§»âŸãã ã«ãªãã¥ã©ã åŠç¿âŒ¿æ³ Decoupled MeanFlow [Lee+, 25] äºååŠç¿æžã¿ Flow Matching ã¢ãã«ã fine-tuning ã㊠MeanFlow ã¢ãã«ãžå€æ CMT [Hu+, 25] äºååŠç¿ãšäºåŸåŠç¿ã®éã«âŒè²«æ§æå€±ã✀ããäžéåŠç¿ãå°âŒ äºåŸåŠç¿ã«ããã MeanFlow ã¢ãã«ã®æ§èœãåäž Decoupled Meanflow [Lee+, 25] AlphaFlow [Zhang+, 25]
⪠ã¢ãã«ã¯ä»»æã®æå» ð ããä»»æã®æå» ð¡ ãžã®å¹³åé床ãäºæž¬ 1ã¹ããã✣æãå¯èœïŒð = 0, ð¡
= 1ïŒ âª æå€±é¢æ° åæïŒ1/2ïŒ: Mean FlowsïŒæŠèŠïŒ 5 ð¡, ð â [0, 1] ð¥~ð!"#" ð~ð$%&'% (e.g. ã¬ãŠã¹ååž ) ð¢( ïžãã¥ãŒã©ã«ãããã¯ãŒã¯ sg ã» ïžstop gradient JVP ã» ïžJacobian Vector Product Mean Flows [Geng+, NeurIPS25]
åæïŒ2/2ïŒ: Mean FlowsïŒå°åºã»åé¡ç¹ïŒ 6 ð¢ ð§&, ð, ð¡ + ð¡
â ð ð ðð¡ ð¢ ð§&, ð, ð¡ = ð£ ð§& ð ðð¡ ð¡ â ð ð¢ ð§&, ð, ð¡ = ð ðð¡ - ' & ð£ ð§( ðð ð¢ ð§&, ð, ð¡ = ð£ ð§& â ð¡ â ð ð ðð¡ ð¢ ð§&, ð, ð¡ , where ç©ã®åŸ®å ⹠⹠⹠⎠ð§" ð¥ ð ð¡ â ð ãæããåŸïŒð¡ ã§åŸ®å
åæïŒ2/2ïŒ: Mean FlowsïŒå°åºã»åé¡ç¹ïŒ 7 ð¢ ð§&, ð, ð¡ + ð¡
â ð ð ðð¡ ð¢ ð§&, ð, ð¡ = ð£ ð§& ð ðð¡ ð¡ â ð ð¢ ð§&, ð, ð¡ = ð ðð¡ - ' & ð£ ð§( ðð ð¢ ð§&, ð, ð¡ = ð£ ð§& â ð¡ â ð ð ðð¡ ð¢ ð§&, ð, ð¡ , where ç©ã®åŸ®å = ð â ð¥, 0, 1 / ð# , ð$ , ð" ð¢ = ð â ð¥ / ð# ð¢ + 0 / ð$ ð¢ + 1 / ð" ð¢ J JVPã✀ããŠèšç®å¯èœ ð ðð¡ ð¢ ð§" , ð, ð¡ = ðð§" ðð¡ ð# ð¢ + ðð ðð¡ ð$ ð¢ + ðð¡ ðð¡ ð" ð¢ ⹠⹠⹠⎠ð§" ð¥ ð ð¡ â ð ãæããåŸïŒð¡ ã§åŸ®å
åæïŒ2/2ïŒ: Mean FlowsïŒå°åºã»åé¡ç¹ïŒ 8 ð¢ ð§&, ð, ð¡ + ð¡
â ð ð ðð¡ ð¢ ð§&, ð, ð¡ = ð£ ð§& ð ðð¡ ð¡ â ð ð¢ ð§&, ð, ð¡ = ð ðð¡ - ' & ð£ ð§( ðð ð¢ ð§&, ð, ð¡ = ð£ ð§& â ð¡ â ð ð ðð¡ ð¢ ð§&, ð, ð¡ , where ç©ã®åŸ®å = ð â ð¥, 0, 1 / ð# , ð$ , ð" ð¢ = ð â ð¥ / ð# ð¢ + 0 / ð$ ð¢ + 1 / ð" ð¢ J JVPã✀ããŠèšç®å¯èœ ð ðð¡ ð¢ ð§" , ð, ð¡ = ðð§" ðð¡ ð# ð¢ + ðð ðð¡ ð$ ð¢ + ðð¡ ðð¡ ð" ð¢ ⹠⹠⹠⎠L åé¡â ð§" ð¥ ð ð¡ â ð ãæããåŸïŒð¡ ã§åŸ®å ð ðð ð â ð L åé¡â¡ L Marginal Velocity ã Conditional Velocity ã§è¿äŒŒ
ææ¡âŒ¿æ³ (1/4) : ð-ð¥ðšð¬ð¬ (å¹³åé床ã§ã¯ãªãç¬éé床ãååž°) 9 ⪠ð-ð¥ðšð¬ð¬ ⪠ð-ð¥ðšð¬ð¬
â , where L GTã®èšç®ãå°é£ J ç¬éé床ãååž° ð£ ð§& = ð¢ ð§& , ð, ð¡ + ð¡ â ð ð ðð¡ ð¢ ð§& , ð, ð¡ ð¢ ð§&, ð, ð¡ = ð£ ð§& â ð¡ â ð ð ðð¡ ð¢ ð§&, ð, ð¡ ⎠ð¢454
ææ¡âŒ¿æ³ (2/4) : JVPãžã®äžé©åãªâŒâŒãæ¹å 10 ⪠Mean Flows (MF) âª
Improved Mean Flows (iMF) = ð â ð¥, 0, 1 / ð# , ð$ , ð" ð¢ = ð â ð¥ / ð# ð¢ + 0 / ð$ ð¢ + 1 / ð" ð¢ åæ²: Marginal Velocity ã Conditional Velocity ã§è¿äŒŒ ð ðð¡ ð¢ ð§" , ð, ð¡ = ðð§" ðð¡ ð# ð¢ + ðð ðð¡ ð$ ð¢ + ðð¡ ðð¡ ð" ð¢ L äžé©åãªè¿äŒŒ (ð¢6 ; ð£6 ) , where ð£6 (ð§4 , ð¡) = ð¢6 (ð§4 , ð¡, ð¡) L MFã®æå€±ã¯å¢å
ææ¡âŒ¿æ³ (3/4) : æšè«æã«ãã©ã¡ãŒã¿ã決å®å¯èœãªCFG 11 ⪠åæ: Classifier Free Guidance
(CFG) J æšè«æã«æ¡ä»¶âŒâŒ{æ, ç¡}ã®ã¢ãã«ã®éã¿ã¥ãåã✀ããŠæ§èœåäž âª Mean Flowsã«ãããCFG ⪠Flexible Guidance (ð ãæ¡ä»¶ãšããŠâŒâŒ) J æšè«æã« ð ãæ±ºå®å¯èœãªæè»ãªèšèš & ð£# ð§$ , ð¡ ð) = 1 + ð ð£% ð§$ , ð¡| ð â ð ð£% ð§$ , ð¡ ð) & ð£$ = ð ð â ð¥ â 1 â ð ð¢ # &'( ð§, ð¡, ð¡ where ð¢)() &'( = & ð£$ â ð¡ â ð JVP(ð¢ # &'(; & ð£$ ) , â sg ð¢!"! #$" , ðïžæ¡ä»¶ ðïžç¡æ¡ä»¶ ðïžã¬ã€ãã³ã¹ã¹ã±ãŒã« L èšç·Žæã« ð ãèšå® < | ð, Ï ð§$ | ð, Ï
ææ¡âŒ¿æ³ (4/4) : In-context Conditioning ã«ãã軜éå 12 ⪠åæïžDiT ã¢ãŒããã¯ãã£
⪠æ¡ä»¶ã¯ AdaLN-zero ã§åŠç âª å šãŠã®æ¡ä»¶ãåã«âŸããŠãã L è€æ°æ¡ä»¶ãé©åã«æ±ããªã L ãã©ã¡ãŒã¿æ°ãå€ã n ä»ã®æ¡ä»¶ä»ãã¯æ§èœãäžâŒå ⪠Improved In-context Conditioning ⪠æ¡ä»¶ããã€ãºã« concat ã㊠DiT ã«âŒâŒ J æ¡ä»¶ã®ããŒã¯ã³æ°ãè€æ°åã«ããããšã§å®âœ€å¯èœã« (class token: 8, ãã®ä»: 4) J AdaLN-zero ãåãé€ããŠ33%軜éå (e.g. 133M â 89M) DiT [Peebles+, ICCV23]
ææ¡âŒ¿æ³ (4/4) : In-context Conditioning ã«ãã軜éå 13 ⪠åæïžDiT ã¢ãŒããã¯ãã£
⪠æ¡ä»¶ã¯ AdaLN-zero ã§åŠç âª å šãŠã®æ¡ä»¶ãåã«âŸããŠãã L è€æ°æ¡ä»¶ãé©åã«æ±ããªã L ãã©ã¡ãŒã¿æ°ãå€ã n ä»ã®æ¡ä»¶ä»ãã¯æ§èœãäžâŒå ⪠Improved In-context Conditioning ⪠æ¡ä»¶ããã€ãºã« concat ã㊠Transformer ã«âŒâŒ J æ¡ä»¶ã®ããŒã¯ã³æ°ãè€æ°åã«ããããšã§å®âœ€å¯èœã« (class token: 8, ãã®ä»: 4) J AdaLN-zero ãåãé€ããŠè»œéå (e.g. 133M â 89M) DiT [Peebles+, ICCV23] AdaLN-zero Cross-Attn In-context Conditioning
ææ¡âŒ¿æ³ (4/4) : In-context Conditioning ã«ãã軜éå 14 ⪠åæïžDiT ã¢ãŒããã¯ãã£
⪠æ¡ä»¶ã¯ AdaLN-zero ã§åŠç âª å šãŠã®æ¡ä»¶ãåã«âŸããŠãã L è€æ°æ¡ä»¶ãé©åã«æ±ããªã L ãã©ã¡ãŒã¿æ°ãå€ã n ä»ã®æ¡ä»¶ä»ãã¯æ§èœãäžâŒå ⪠Improved In-context Conditioning ⪠æ¡ä»¶ããã€ãºã« concat ã㊠DiT ã«âŒâŒ J æ¡ä»¶ã®ããŒã¯ã³æ°ãè€æ°åã«ããããšã§å®âœ€å¯èœã« (class token: 8, ãã®ä»: 4) J AdaLN-zero ãåãé€ããŠ33%軜éå (e.g. 133M â 89M) DiT [Peebles+, ICCV23]
å®éšèšå® 15 ⪠ã¿ã¹ã¯ïžã¯ã©ã¹ã©ãã«ã«åºã¥ãç»å✣æ ⪠ããŒã¿ã»ããïžImageNetïŒ256x256ïŒ âª âœ£æâœ æ³ âª æœåšç©ºéã§âœ£æ
(4x32x32) ⪠äºååŠç¿æžã¿ã®VAEã䜿✀ ⪠{1,2}-NFE ✣æ ⪠è©äŸ¡ææšïžFréchet Inception Distance (FID) ⪠ããŒããŠã§ã¢æ§æïžèšèŒãªã ⪠èšç·ŽæéïžèšèŒãªã
å®éççµæïž1-NFE ã§å€ãã® Multi-NFE ã¢ãã«ãäžåã 16 ⪠Multi-NFE ã¢ãã«ãšâœèŒããŠ500~2000åã®1ã®æšè«åæ°ã ãæ§èœã¯åç J scratch
ããèšç·Žå¯èœïŒ=èžçãªãïŒãªã®ã§æåž«ã¢ãã«ãäžèŠ âŸŒå質ã ã èšç®ã³ã¹ãã⟌ã L
Ablation studies 17 J CFG (Flexible Guidance) ã✀ããããšã§æ§èœåäž J in-context
conditioning ã«ãã軜éåãã€æ§èœåäž â» aux. head ⊠ð£% ✀㮠head ã远å
宿§ççµæïžâŸŒå質ãªç»åã✣æ 18
ãŸãšã 19 âª èæ¯ïžMean Flows J 1-NFEã§âŸŒå質ãªâœ£æ L æ°åŒã«ç²ãè¿äŒŒãååš âª
ææ¡ïžImproved Mean Flows J Mean Flows ã«ãããæ°åŒçãªåé¡ãæ¹å J æè»ãª Classifier Free Guidance J in-context conditioning ã«ãã軜éå âª çµæ J 1-NFE ã§å€ãã® Multi-NFE ã¢ãã«ãäžåã