Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
定規とコンパスと ChainerRL
Search
horiem
June 09, 2018
Technology
0
1.1k
定規とコンパスと ChainerRL
強化学習を使って作図問題を解く
Chainer Meetup #07, 9th Jun 2018
horiem
June 09, 2018
Tweet
Share
More Decks by horiem
See All by horiem
局所保存性・相似変換対称性を満たす機械学習モデルによる数値流体力学
yellowshippo
1
89
ICML 読み会: Graph Neural PDE Solvers with Conservation and Similarity-Equivariance
yellowshippo
0
300
物理シミュレーションと数理最適化の知見を導入した機械学習手法
yellowshippo
1
1.7k
対称性のある機械学習による物理現象の解析
yellowshippo
4
2.4k
Physics-Embedded Neural Networks: Graph Neural PDE Solvers with Mixed Boundary Conditions
yellowshippo
1
620
物理現象の性質を反映させたグラフニューラルネットワークによる偏微分方程式の学習
yellowshippo
2
1.1k
物理シミュレーションの機械学習 に関する近年の動向と研究紹介
yellowshippo
4
14k
有限要素法を機械学習したい!
yellowshippo
0
3.7k
Julia と微分方程式でメシを食う
yellowshippo
1
2.9k
Other Decks in Technology
See All in Technology
Wantedly での Datadog 活用事例
bgpat
1
510
podman_update_2024-12
orimanabu
1
280
re:Invent 2024 Innovation Talks(NET201)で語られた大切なこと
shotashiratori
0
310
Amazon VPC Lattice 最新アップデート紹介 - PrivateLink も似たようなアップデートあったけど違いとは
bigmuramura
0
200
祝!Iceberg祭開幕!re:Invent 2024データレイク関連アップデート10分総ざらい
kniino
3
310
DevFest 2024 Incheon / Songdo - Compose UI 조합 심화
wisemuji
0
110
KubeCon NA 2024 Recap: How to Move from Ingress to Gateway API with Minimal Hassle
ysakotch
0
210
alecthomas/kong はいいぞ / kamakura.go#7
fujiwara3
1
300
Turing × atmaCup #18 - 1st Place Solution
hakubishin3
0
490
レンジャーシステムズ | 会社紹介(採用ピッチ)
rssytems
0
150
5分でわかるDuckDB
chanyou0311
10
3.2k
WACATE2024冬セッション資料(ユーザビリティ)
scarletplover
0
210
Featured
See All Featured
Art, The Web, and Tiny UX
lynnandtonic
298
20k
[Rails World 2023 - Day 1 Closing Keynote] - The Magic of Rails
eileencodes
33
2k
Git: the NoSQL Database
bkeepers
PRO
427
64k
A better future with KSS
kneath
238
17k
GraphQLとの向き合い方2022年版
quramy
44
13k
The Psychology of Web Performance [Beyond Tellerrand 2023]
tammyeverts
45
2.2k
Documentation Writing (for coders)
carmenintech
66
4.5k
Building a Scalable Design System with Sketch
lauravandoore
460
33k
A Modern Web Designer's Workflow
chriscoyier
693
190k
Rebuilding a faster, lazier Slack
samanthasiow
79
8.7k
Being A Developer After 40
akosma
87
590k
Side Projects
sachag
452
42k
Transcript
ఆنͱίϯύεͱ ChainerRL Chainer Meetup #07, 9th Jun 2018 horiem@yellowshippo
ChainerRL Ͱ
࡞ਤΛղ͖͍ͨ
࡞ਤ • ఆنͱίϯύε͚ͩΛͬͯతͷਤܗΛඳ͘ http://mathworld.wolfram.com/GeometricConstruction.html
σϞ
ͷલʹ
ਤͷݟํ ֶशϞσϧʹ͢ใ ʢObservationʣ ਓؒ༻ తͷਤܗ ར༻Մೳͳ
σϞ
શମ૾ ڥ ΤʔδΣϯτ ߦಈ ؍ଌ
શମ૾ ڥ ΤʔδΣϯτ [p0_x, p0_y] [p1_x, p1_y] …… ը૾ ͷใ
[shape_flag, pi, pj] ৽͍͠ਤܗ
ωοτϫʔΫΞʔΩςΫνϟ Conv MLP MLP Conv MLP [p0_x, p0_y] [p1_x, p1_y]
…… ը૾ ͷใ [shape_flag, pi, pj] ৽͍͠ਤܗ
ωοτϫʔΫΞʔΩςΫνϟ (100, 100) (12, 3) Conv MLP MLP Conv MLP
(2, 12, 12) [p0_x, p0_y] [p1_x, p1_y] …… ը૾ ͷใ [shape_flag, pi, pj] ৽͍͠ਤܗ
ωοτϫʔΫΞʔΩςΫνϟ (100, 100) (12, 3) Conv MLP MLP Conv MLP
(2, 12, 12) [p0_x, p0_y] [p1_x, p1_y] …… ը૾ ͷใ [shape_flag, pi, pj] ৽͍͠ਤܗ = 288
ࢥͬͨ͜ͱͳͲ • ڧԽֶशͬͨ͜ͱͳ͔͚ͬͨͲָ͍͠ • ChainerRL ϥΫͰΑ͍ • ߦಈۭ͕ؒେ͖͍ͷͰݮΒ͍ͨ͠ • AlphaGO
͕ࢀߟʹͳΔ͔ʁ • ίʔυ͖Ε͍ʹͨ͠Βެ։ && ղઆ͠·͢ • n ࣍ํఔࣜΛ ChainerRL Ͱղ͚Δ͔ʁ • ՝֎׆ಈ͖ͳਓɺҰॹʹΓ·͠ΐ͏ʂ