Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
第3回関東kaggler会 🤔 妙だな... (Jun Koda)
Search
Jun Koda
February 15, 2025
3.9k
12
Share
Embed
Copy iframe code
Copy JS code
Copy link
Start on current slide
第3回関東kaggler会 🤔 妙だな... (Jun Koda)
Jun Koda
February 15, 2025
More Decks by Jun Koda
See All by Jun Koda
画像ディープラーニングコンペの基本
junkoda
6
2.7k
Featured
See All Featured
Measuring Dark Social's Impact On Conversion and Attribution
stephenakadiri
2
210
The Success of Rails: Ensuring Growth for the Next 100 Years
eileencodes
47
8.2k
Reality Check: Gamification 10 Years Later
codingconduct
0
2.2k
Unsuck your backbone
ammeep
672
58k
The Curse of the Amulet
leimatthew05
1
13k
The State of eCommerce SEO: How to Win in Today's Products SERPs - #SEOweek
aleyda
2
11k
svc-hook: hooking system calls on ARM64 by binary rewriting
retrage
2
290
エンジニアに許された特別な時間の終わり
watany
107
250k
"I'm Feeling Lucky" - Building Great Search Experiences for Today's Users (#IAC19)
danielanewman
231
23k
Side Projects
sachag
455
43k
GraphQLとの向き合い方2022年版
quramy
50
15k
Digital Projects Gone Horribly Wrong (And the UX Pros Who Still Save the Day) - Dean Schuster
uxyall
0
1.6k
Transcript
🤔 ົͩͳ... ᅳాɹ३ Jun Koda
None
ܦྺ • ςΩαεେֶͰཧͷ PhD • ӉͷγϛϡϨʔγϣϯ • ͦͷޙɺΦʔετϥϦΞɺΠλϦΞͰ ϙευΫݚڀऀ Ϗοάόϯޙ΄΅Ұ༷ͩͬͨμʔΫϚλʔ͕
ສ༗ҾྗͰҾ͔Ε͍͋ͬͯ͘
ܦྺ • ςΩαεେֶͰཧͷ PhD • ӉͷγϛϡϨʔγϣϯʢμʔΫϚλʔಉ࢜ͷສ༗Ҿྗʣ • ͦͷޙɺΦʔετϥϦΞɺΠλϦΞͰϙευΫݚڀऀ • ίϩφՒͷϩοΫμϯͰମௐΛ่͠ຊʹΓྍཆ
• Kaggle ͷۚͰ Kaggle ΛΕྑ͍ͷͰʁˠ ಇ͖ʹग़Α͏ͱͯ͠Δ
গͷਓ͔͠ؾ͍͍ͮͯͳ͍͜ͱʹؾ͍ͮͯҰಥഁͰۚϝμϧΛऔΔλΠϓ
ճస augmentationͰείΞ͕Լ͕Δʁ 🤔 ົͩͳ... • ࢲͷҰ൪ͷۀ • ͜ΕΛ͖͔͚ͬʹ1Ґʹ Google Research
- Identify Contrails to Reduce Global Warming (2023)
Benetech 4 + 3 min
ਪͷ͕࣌ؒ͘ͳͬͨ 🤔 ົͩͳ... • άϥϑը૾͔Β x y ͷʢɾจࣈʣΛಡΉ • ࢲؾ͔ͮͳ͔͚ͬͨͲ
discussion ʹ͋ͬͨ • A. Ϟσϧͷੑೳ্͕͕Εͦ͏͍͏͜ͱ͋ΔΑ => Ұཧ͋Δ • ੜϞσϧͷ͍ҙຯෆ໌ͷग़ྗͰ࣌ؒΛແବʹ͏ Benetech - Making Graphs Accessible (2023)
https://x.com/aryyyyy221/status/1670952873872744449 Private ςετσʔλ͕ͬͦ͜Γফ͑ͯͨ
Line Scatter 50 - 65% Dot Δҙຯͳ͠ Horizontal Bar Δҙຯͳ͠
Vertical Bar ͻͲ͍
• Scatter ͷ৳ͼͨ͘͞Μ͋ͬͨͷͰɺۚϝμϧऔΕ͔ͨ • άϥϑͷׂ߹Ѳ͓͖ͯͩͬͨ͘͠ • ͰݮΔલʹௐ͍ͯͨΒɺݮͬͨ͜ͱΛͰ͖͔ͨʁ 15Ґ → ௐΔํ๏
"LB Probing" bilzard ͞Μʮࠓ·ͰʹKaggleίϯϖͰͬͨLB Probingख๏ʹ͍ͭͯʯ https://zenn.dev/bilzard/articles/lb-probing-technique
USPTO 7 + 4 min
είΞֶ͕తʹ͋Γ͑ͳ͍ 🤔 ົͩͳ... USPTO - Explainable AI for Patent Professionals
ʢಛڐίϯϖʣ ͜ͷ໘ੵ 0.5 λʔήοτৗʹ 50 ݸͰ ਖ਼ղ࠷େͰ 25ݸ ֶతʹ࠷ߴ0.5ͩͱࢥͬͨΒ submission Ͱ 0.55? 0.8 ͑ΔΒ͍͠ ͕࣌ؒͳͯ͘͏Ήʹ ؒҧͬͨධՁࢦඪΛ࠷దԽͯ͠͠·ͬͨ 🔺ʮKaggle ͱධՁࢦඪΛ࠷దԽ͢Δڝٕʯ ਅͷධՁࢦඪΛѲͰ͖ͳ͔ͬͨͱল ؔ౦ Kagger ձʢୈ1ճʣcharmq ͞Μ KaggleͷऔΓΈํ ~validationฤ~ TP 25ݸͰείΞ 0.8 Λӽ͑Δ Theo Viel https://www.kaggle.com/competitions/uspto-explainable-ai/discussion/522199 Competition metric ͕ͱҧ͏ 1? ධՁࢦඪ MAP@50 ໘ੵ ~ 0.85
είΞ͕ 0.99? 🤔 ົͩͳ... ख࣋ͪͷΞϧΰϦζϜ࠷దԽख๏͕ҧ͍͗͢ΔͷͰɺ ͦ͏ݴ͏͜ͱ͋Δ͞ => ഊऀ ࣮ࢲʹͰ͖Δ hack
/ magic ͕͋ͬͨ
աૄίϯϖͰϫϯνϟϯۚϝμϧ Ͱɺ̍̌͘Β͍͔͠ࢀՃͯ͠ͳ͍͔Βʙ ͱݴ͏ಀ͛ಓ͢Βഁյ͞ΕΔ ← ίί
Ariel Ͱ͋ͬͨ Competition metric ͕ͱҧ͏ OK Ṗͷఆ NeurIPS - Ariel
Data Challenge 2024 Metric ͕յΕ͍ͯΔͱࢥͬͨΒʁ Test set ͷෆ֬ఆੑͱ metric ͷ͍͕ٙࠞ͟ΔͷͰ͍͠ LB probing ͱಉ͘͡ ੍ޚͰ͖ΔͷΛ (σ) ΛมԽͤͯ͞ είΞͷมԽΛݟΔ AmbrosM https://www.kaggle.com/competitions/ariel-data-challenge-2024/discussion/528114
select յΕ͍ͯͳ͍ The Pragmatic Programmer ɹʰୡਓϓϩάϥϚʔʱ • ·ͣࣗΛ͓ٙ͏ • ྫྷ੩ʹߟ͓͔͔͑ͯͬͨ͠Β
discussion Ͱ૬ஊͯ͠ΈΑ͏ʢಛʹ evaluation metric ͷ͜ͱͳΒʣ • CZIIΛٹͬͨέϩοϐઌੜྲྀੴ
Ventilator 11 + 5 min ۚϝμϧʹ݁ͼ͍ͭͨكͳྫ (1)
ਫ਼͕ྑ͗͢Δ 🤔 ົͩͳ... Google Brain - Ventilator Pressure Prediction (2021)
https://www.kaggle.com/competitions/ventilator-pressure-prediction/ Input: ਓݺٵثͷٵೖόϧϒ ut Target: ഏʢܕʣͷۭؾѹྗ pt ࣌ܥྻίϯϖ u0 p0 u1 p1 u2 p2 ut pt ɾɾɾ ɾɾɾ ํ LSTM ͳͲ Input Target ࣌ؒ →
1. ࠷ॳײ 2. ٻɾ୯७Խ: ࠷ॳͷ p0 ͳΜͯΘ͔ΔΘ͚ͳ͍ͷͰʁ 3. ࣮ݧ: ࣮ࡍ
u0 ͚ͩͰ p0 ͷޡࠩ 0.5 (MAE) 4. Bi-LSTM ͩͱ p0 ͷޡࠩ 0.15 5. ·͢·͓͔͍͢͠ u0 p0 t0 ͚ͩ ޡࠩ Δp0 ~ 0.5 u0 p0 u1 u2 ut Input Target Bi-LSTM ޡࠩ Δp0 ~ 0.15 (MAE) ະདྷͷೖྗ͕աڈͷѹྗʹӨڹΛ༩͑Δ͕ͣͳ͍ Կނʁ ? ɾɾɾ ਫ਼͕ྑ͗͢Δ ࣌ؒ → ୯७Խ
PID ੍ޚ Proportional - Integral - Derivative Controller u0 p0
u1 u2 ut Input Target • աڈͷग़ྗ͕ະདྷͷೖྗʹӨڹΛ༩͍͑ͯͨʂ • ग़ྗͷઢܕࣸ૾ͷ݁ՌͰ͋Δೖྗ͔Βग़ྗΛٯࢉ ͢Δઢܕͷͩͬͨʂ • K ͕৭ʑͳͷͰ LSTM શʹղ͚ͳ͍ ɾɾɾ ࣌ؒ → <latexit sha1_base64="cb481LZ0rfS2icOpNjk8woetUrI=">AAACKHicbZDNSgMxFIUz/tb6V3XpJlhERSgzIupGLLoR3FSwttDWIZPJtMFMJiR3hDL0cdz4Km5EFOnWJzFTu9DqhcDHOfdyc0+gBDfgukNnanpmdm6+sFBcXFpeWS2trd+aJNWU1WkiEt0MiGGCS1YHDoI1lWYkDgRrBPcXud94YNrwRN5AX7FOTLqSR5wSsJJfOkt3YQ+f4itfYZXjvkWO21yC795Bru3s4RB2RkaI25EmNAvVIAthYPv9UtmtuKPCf8EbQxmNq+aXXtthQtOYSaCCGNPyXAWdjGjgVLBBsZ0apgi9J13WsihJzEwnGx06wNtWCXGUaPsk4JH6cyIjsTH9OLCdMYGemfRy8T+vlUJ00sm4VCkwSb8XRanAkOA8NRxyzSiIvgVCNbd/xbRHbBRgsy3aELzJk//C7UHFO6ocXh+Wq+fjOApoE22hXeShY1RFl6iG6oiiR/SM3tC78+S8OB/O8Lt1yhnPbKBf5Xx+ATQ9ooM=</latexit> u(t) = Kpp(t) + Ki Z t 0 p(t0)dt0 + Kd dp dt (t) ग़ྗ (target)ɺͦͷඍੵΛೖྗʹ͏ͱ͍͍ײ͡ʹ੍ޚͰ͖Δ
Ұ෦ͷޡࠩΛશʹ 0 ʹͯ͠4Ґ ࠷ऴ
Ventilator ·ͱΊ 1. ҧײɾײ 2. ҧײΛߜΓࠐΉɻݴޠԽ 3. ࣮ݧͯ͠Λ໌֬ʹ͢Δ 4. ԾઆʢLeakage?ʣɾݕূ
=> ͦ͏Ͱͳ͍ 5. ͑Λݟͨʢओ࠵ऀͷจΛಡΜͩʣ
ඈߦػӢ 16 + 8 min
ճస augmentationͰੑೳ͕Լ͕Δʁ 🤔 ົͩͳ... • Ӵը૾͔ΒඈߦػӢΛݟ͚ͭΔ Semantic Segmention λεΫ •
7 timestep ͘Β͍ͷ࣌ؒํ͋Δ 2 + 1 ࣍ݩσʔλɻσʔλٿʹଟΊ • ࣌ͷ࣮ݧϩά Gmail ΛৼΓฦͬͯΈΑ͏ Google Research - Identify Contrails to Reduce Global Warming (2023) https://www.kaggle.com/competitions/google-research-identify-contrails-reduce-global-warming/
ΜͰΔ D4 (rot90 ͱస) augmentation Λ ͬͨΒείΞ͕མͪͯɺ͍Ζ͍ ΖͬͯݩʹΒͳ͍ ʮରশੑΛ࣋ͨͳ͍͜ͱͳΜͯ͋Δʁʯ ཧֶରশੑେ͖
ճసରশੑ͕͋Δͣͱͩ͜ΘͬͯΔ rot90 ͱస augmentation Ͱ Dice score 0.666 → 0.638 Լ͕Δ 7 / 3 U-Net ʹ rot90 + స augmentation ΛೖΕͨ
ཌ 7 / 4 ށͬͯΔ ॳΊͯͷ͜ͱͰށ͍ͬͯΔ ՄࢹԽͷ༧ఆ
• ࣌ؒ࣍ݩΛ͏ϓϩδΣΫτฒߦͯͬͯͨ͠ͷͰɺͦͬͪʹͬͯΔ • ؾసେࣄ • ͻͱͭͷࣄʹϋϚͬͯ͠·ͬͯྑ͘ͳ͍ 7 / 5 ಀආͨ͠ʂ
7 / 6 ಥવͯ͢Λཧղͯ͠Δ 0.1 pixel ޡ২ɻ0.5 pixel ͷ͜ͱͩͱࢥ͏ ೖྗΛճస͢ΔͱͲ͏ͳΔʁམͪண͍ͯϖϯͱࢴΛͬͯߟ͑ͯΈΔඞཁ͕͋Δͱࢥ͏
1st-place solution Jun Koda https://www.kaggle.com/competitions/google-research-identify-contrails-reduce-global-warming/discussion/430618 ճసͨ͠ೖྗΛϞσϧʹೖΕΔ࣮ݧ ճసͨ͠ϥϕϧͱൺֱ ՄࢹԽ & είΞܭࢉ ճసͯ͠είΞಉ͡Ͱ͋ͬͯ΄͍͠
·ͩԿϐΫηϧͣΕͯΔ͔͔ͬͯͳ͍ ※ 0.5 pixel γϑτ torch grid_sample() ΛͬͯΔ ೖྗΛճస͢Δͱճసͨ͠ग़ྗͱҰக͢ΔΑ͏ʹͳͬͨ ճసͨ͠ೖྗΛϞσϧʹೖΕΔ࣮ݧ
ճసͨ͠ϥϕϧͱൺֱ ՄࢹԽ & είΞܭࢉ ճసͯ͠είΞಉ͡Ͱ͋ͬͯ΄͍͠
resnest26d Local CV No augmentation: 0.666 (10 epochs) Bad rot
aug: 0.638 0.5 pixel fi xed rot: 0.687 (60 epochs) ͨͿΜ͜ΕͰۜݍʹೖͬͨͷͰɺۜݍҎ্ͷਓΈΜͳ 0.5 pixel ิਖ਼ͯ͠Δͷ͔ͱࢥͬͨɻ Discussion ͰճసTTAͱ͔ݴͬͯͨ͠ɻ ໎ݴʮΘΓͱΈΜͳؾ͍ͨͱࢥ͏͚Ͳʯ 7 / 8 ζϨΛิਖ਼ͨ͠ϞσϧͰείΞ͕৳ͼͯΔ
1. ճస augmentation ͰείΞ͕େ͖͘མͪΔ͕ͣͳ͍ͱͩ͜ΘΔɻগͳ͘ͱؙҰߟ͑ͨ 2. είΞ͕ѱԽ͢ΔϝΧχζϜΛߟ͑Δɻ 3. ࿈ଓ࠲ඪ͔Β pixel ͷ
fl oor ʁʢԾઆʣ 4. ճసͨ͠ೖྗΛϞσϧʹೖΕͯΈΔͱ͍͏࣮ݧʢݕূʣ 5. ཧղ ඈߦػӢ·ͱΊ • ແҙࣝʹͣͬͱॲཧ͞Ε͍ͯΔͷ͔ɺ;ͱؾͮ͘ɻࢄาϩʔυόΠΫ • ͷ݈߁ͷͨΊʹӡಈେࣄ • ߹ʹΑͬͯకΛܾΊͯɺͩ͜ΘΓΛࣺͯͯఫୀ͢Δඞཁ • ॏྗίϯϖͰ͗͢͠Δ͜ͱʹͩ͜Θͬͯɺਖ਼͍͠ಓʹΪϦΪϦͰ෮ؼͰ͖ͨ
🤔 ົͩͳ • ົͩͳ Kaggle ͷʹཱͭ • ົͩͳത࢜՝ఔͰ͑ΒΕͨ • ਖ਼ղΛ୭Βͳ͍ݚڀͰɺ݁Ռ͕ਖ਼͍͔͠ɾཧղͰ͖Δ͔ɾ͓͔͍͠ࣄແ͍͔ٞ͢Δ
• ςΩαεͷࢣঊʹ͑ΒΕͨ • ົͩͳݚڀۀͷʹͨͭ • ͲΜͳ༏ΕͨਓͰʢಛʹઐ֎Ͱʣؒҧ͏ɻAI͕ᘳʹͳͬͯ͏ਓؒෆશ • ົͩͳҧײʹؾ͖ͮɺṖΛղ͍ͯɺؒҧ͍Λగਖ਼͢Δྗ • ࣗͷؒҧ͍͋ΕνʔϜͷऑΧόʔ͢Δ͜ͱ • ࠷ॳͷҧײ͕͍͠ɻֶΜͰߟ͑ͨܦݧ 24 + 1 min
1000ԁࡳͰλόί1ݸ… ົͩͳ… ໊୳ఁίφϯ (18) ੨ࢁ߶ণ গαϯσʔίϛοΫε
None