Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
Lux AI 34th Place Solution
Search
Kyohei Uto
December 28, 2021
Technology
430
0
Share
Embed
Copy iframe code
Copy JS code
Copy link
Start on current slide
Lux AI 34th Place Solution
My 34th place solution in Lux AI competition @kaggle
https://www.kaggle.com/c/lux-ai-2021
Kyohei Uto
December 28, 2021
More Decks by Kyohei Uto
See All by Kyohei Uto
Kaggle過去コンペ上位解法をAIエージェントでレポートする
kuto5046
5
3.6k
Kaggle - Lux AI season3 9th place solution
kuto5046
0
210
kaggle Eedi solution
kuto5046
0
390
Kaggle Eediコンペ振り返り
kuto5046
7
2k
CMI 13th place solution
kuto5046
0
290
Kaggle H&Mコンペ振り返り
kuto5046
0
3.3k
Kaggleシミュレーションコンペで強化学習に取り組むときのTips
kuto5046
22
12k
タクシー予約を支えるMLモデルの継続的改善
kuto5046
1
3.9k
H&M 23th place solution
kuto5046
0
520
Other Decks in Technology
See All in Technology
Claude Codeとのおしゃべりでセマンティックモデルの定義からダッシュボード作成まで完成させる
nic_sugiyama
0
110
Oracle AI Database@Azure:サービス概要のご紹介
oracle4engineer
PRO
6
2k
SONiCのLinuxベースを活かしたZabbix監視
sonic
0
180
Socrates × Looker 〜セマンティックレイヤーで進化するデータ分析エージェント〜
hanon52_
3
2.4k
ACE-Step-1.5で見る 音楽生成AIのしくみと“破綻だけ直す”Retake機能の開発【zennfes spring 2026 登壇資料】
personabb
1
480
新しいVibe Codingと”自走”について
watany
6
330
【セミナー資料】Claude Code をセキュアに使うための考え方と設定の勘どころ / Claude Code Webinar 20260616
masahirokawahara
2
360
エラーバジェットのアラートのタイミングを考える.pdf
kairim0
0
150
【2026年版】 ベクトル検索䛸 Embedding最前線
mocobeta
0
180
Claude Code の Sandbox 機能を Anthropic Sandbox Runtime(srt) で試そう!/lets-play-anthropic-sandbox-runtime
tomoki10
1
620
アンオフィシャルな、オフィシャルからのお願い
wyamazak_devrel
0
110
AmazonRoute 53ではじめてのドメイン取得!HTTPS化までの道のりを整理してみた
usanchuu
3
140
Featured
See All Featured
GraphQLの誤解/rethinking-graphql
sonatard
75
12k
Building the Perfect Custom Keyboard
takai
2
790
Sharpening the Axe: The Primacy of Toolmaking
bcantrill
46
2.9k
Mozcon NYC 2025: Stop Losing SEO Traffic
samtorres
1
250
A better future with KSS
kneath
240
18k
Getting science done with accelerated Python computing platforms
jacobtomlinson
2
230
Paper Plane (Part 1)
katiecoart
PRO
0
9k
Music & Morning Musume
bryan
47
7.2k
How to audit for AI Accessibility on your Front & Back End
davetheseo
0
430
ピンチをチャンスに:未来をつくるプロダクトロードマップ #pmconf2020
aki_iinuma
128
56k
So, you think you're a good person
axbom
PRO
2
2.1k
How To Speak Unicorn (iThemes Webinar)
marktimemedia
1
480
Transcript
Lux AI Challenge Copyright 2021 @kuto_bopro Meta Kaggle Collection of
episodes ・Team: Toad Brigade ・LB score > 1900 ・only win game ・about 1000 episodes(3 submissions) Unet Imitation Learning approach inspired by nosound(@zharch) obs horizontal flip vertical flip random roll(-5~5) TTA obs obs Global features (8ch,4,4) Observation map (17ch, 32, 32) Policy map (3ch,32,32) ・Units counts (×2) ・Citytiles counts (×2) ・Research points (×2) ・turn / cycle Data Sampling ・Random sampling up to 4 units actions in each turn ・Downsampling center actions Extract units policy from each units position Image reference: https://www.lux-ai.org/ ・Units position/cooldown/resource (×2) ・Citytiles position/cooldown/fuel-lightupkeep ratio (×2) ・Wood/Coal/Uranium positions ・Road level ・Effective map area Create 8 pattern policy maps and apply mean UNet model Decide citytile actions by simple rule Create 4 batch by rotation input (4 batches) Policy maps (4batch, 3ch, 32, 32) Final policy map (6ch, 32, 32) Hierarchize move actions (shared by nosound) output 3ch policy map (4 batches) 90° 180° 270° 90° 180° 270° 0ch: Center Action → batch mean 1ch: Move Action 1st batch: north 2nd batch: west 3rd batch: south 4th batch: east 2ch: Build City Action → batch mean kuto(@kuto0633) Final policy map (6ch,32,32) Observation maps(4batch, 17ch, 32,32) 0ch: Move Center 1ch: Move North 2ch: Move West 3ch: Move South 4ch: Move East 5ch: Build City Calculate 4 move actions as one direction State Value (for RL and MCTS but not work) 16 64 64 128 128 256 256 256 256 8 256 256 +8 256 256 128 + 256 128 128 64 + 128 64 64 3 32×32 32×32 32×32 16×16 16×16 16×16 8×8 8×8 8×8 4×4 4×4 4×4 32×32 32×32 32×32 16×16 16×16 8×8 8×8 FC BN ReLU FC 264→64 64→1 Conv2d BatchNorm2d, ReLU MaxPooling2d Upsample Concatenate Private LB: 34th (score 1570)