Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
定規とコンパスと ChainerRL
Search
Sponsored
·
Your Podcast. Everywhere. Effortlessly.
Share. Educate. Inspire. Entertain. You do you. We'll handle the rest.
→
horiem
June 09, 2018
Technology
1.2k
0
Share
定規とコンパスと ChainerRL
強化学習を使って作図問題を解く
Chainer Meetup #07, 9th Jun 2018
horiem
June 09, 2018
More Decks by horiem
See All by horiem
Continuous Simplicial Neural Networks
yellowshippo
1
720
局所保存性・相似変換対称性を満たす機械学習モデルによる数値流体力学
yellowshippo
1
400
ICML 読み会: Graph Neural PDE Solvers with Conservation and Similarity-Equivariance
yellowshippo
1
580
物理シミュレーションと数理最適化の知見を導入した機械学習手法
yellowshippo
1
1.8k
対称性のある機械学習による物理現象の解析
yellowshippo
5
3.2k
Physics-Embedded Neural Networks: Graph Neural PDE Solvers with Mixed Boundary Conditions
yellowshippo
1
810
物理現象の性質を反映させたグラフニューラルネットワークによる偏微分方程式の学習
yellowshippo
2
1.2k
物理シミュレーションの機械学習 に関する近年の動向と研究紹介
yellowshippo
4
15k
有限要素法を機械学習したい!
yellowshippo
0
3.9k
Other Decks in Technology
See All in Technology
サイボウズ 開発本部採用ピッチ / Cybozu Engineer Recruit
cybozuinsideout
PRO
10
78k
音声言語モデル手法に関する発表の紹介
kzinmr
0
110
コードや知識を組み込む / Incorporate Code and Knowledge
ks91
PRO
0
160
みんなの「データ活用」を支えるストレージ担当から持ち込むAWS活用/コミュニティー設計TIPS 10選~「作れる」より、「続けられる」設計へ~
yoshiki0705
0
250
社内エンジニア勉強会の醍醐味と苦しみ/tamadev
nishiuma
0
220
「誰一人取り残されない」 AIエージェント時代のプロダクト設計思想 Product Management Summit 2026
mizushimac
1
420
Route 53 Global Resolver で高額課金発生!
otanikohei2023
0
110
20年前の「OSS革命」に学ぶ AI時代の生存戦略
samakada
0
450
クラウドネイティブな開発 ~ 認知負荷に立ち向かうためのコンテナ活用
literalice
0
140
ServiceNow Knowledge 26 の歩き方
manarobot
0
110
エージェントスキルを作って自分のインプットに役立てよう
tsubakimoto_s
0
390
Keeping Ruby Running on Cygwin
fd0
0
170
Featured
See All Featured
How to optimise 3,500 product descriptions for ecommerce in one day using ChatGPT
katarinadahlin
PRO
1
3.5k
Why Our Code Smells
bkeepers
PRO
340
58k
The State of eCommerce SEO: How to Win in Today's Products SERPs - #SEOweek
aleyda
2
10k
Believing is Seeing
oripsolob
1
110
YesSQL, Process and Tooling at Scale
rocio
174
15k
Fantastic passwords and where to find them - at NoRuKo
philnash
52
3.6k
Producing Creativity
orderedlist
PRO
348
40k
How to Talk to Developers About Accessibility
jct
2
180
Taking LLMs out of the black box: A practical guide to human-in-the-loop distillation
inesmontani
PRO
3
2.1k
svc-hook: hooking system calls on ARM64 by binary rewriting
retrage
2
220
Distributed Sagas: A Protocol for Coordinating Microservices
caitiem20
333
22k
Connecting the Dots Between Site Speed, User Experience & Your Business [WebExpo 2025]
tammyeverts
11
890
Transcript
ఆنͱίϯύεͱ ChainerRL Chainer Meetup #07, 9th Jun 2018 horiem@yellowshippo
ChainerRL Ͱ
࡞ਤΛղ͖͍ͨ
࡞ਤ • ఆنͱίϯύε͚ͩΛͬͯతͷਤܗΛඳ͘ http://mathworld.wolfram.com/GeometricConstruction.html
σϞ
ͷલʹ
ਤͷݟํ ֶशϞσϧʹ͢ใ ʢObservationʣ ਓؒ༻ తͷਤܗ ར༻Մೳͳ
σϞ
શମ૾ ڥ ΤʔδΣϯτ ߦಈ ؍ଌ
શମ૾ ڥ ΤʔδΣϯτ [p0_x, p0_y] [p1_x, p1_y] …… ը૾ ͷใ
[shape_flag, pi, pj] ৽͍͠ਤܗ
ωοτϫʔΫΞʔΩςΫνϟ Conv MLP MLP Conv MLP [p0_x, p0_y] [p1_x, p1_y]
…… ը૾ ͷใ [shape_flag, pi, pj] ৽͍͠ਤܗ
ωοτϫʔΫΞʔΩςΫνϟ (100, 100) (12, 3) Conv MLP MLP Conv MLP
(2, 12, 12) [p0_x, p0_y] [p1_x, p1_y] …… ը૾ ͷใ [shape_flag, pi, pj] ৽͍͠ਤܗ
ωοτϫʔΫΞʔΩςΫνϟ (100, 100) (12, 3) Conv MLP MLP Conv MLP
(2, 12, 12) [p0_x, p0_y] [p1_x, p1_y] …… ը૾ ͷใ [shape_flag, pi, pj] ৽͍͠ਤܗ = 288
ࢥͬͨ͜ͱͳͲ • ڧԽֶशͬͨ͜ͱͳ͔͚ͬͨͲָ͍͠ • ChainerRL ϥΫͰΑ͍ • ߦಈۭ͕ؒେ͖͍ͷͰݮΒ͍ͨ͠ • AlphaGO
͕ࢀߟʹͳΔ͔ʁ • ίʔυ͖Ε͍ʹͨ͠Βެ։ && ղઆ͠·͢ • n ࣍ํఔࣜΛ ChainerRL Ͱղ͚Δ͔ʁ • ՝֎׆ಈ͖ͳਓɺҰॹʹΓ·͠ΐ͏ʂ