Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
定規とコンパスと ChainerRL
Search
Sponsored
·
SiteGround - Reliable hosting with speed, security, and support you can count on.
→
horiem
June 09, 2018
Technology
0
1.2k
定規とコンパスと ChainerRL
強化学習を使って作図問題を解く
Chainer Meetup #07, 9th Jun 2018
horiem
June 09, 2018
Tweet
Share
More Decks by horiem
See All by horiem
Continuous Simplicial Neural Networks
yellowshippo
1
680
局所保存性・相似変換対称性を満たす機械学習モデルによる数値流体力学
yellowshippo
1
370
ICML 読み会: Graph Neural PDE Solvers with Conservation and Similarity-Equivariance
yellowshippo
1
540
物理シミュレーションと数理最適化の知見を導入した機械学習手法
yellowshippo
1
1.8k
対称性のある機械学習による物理現象の解析
yellowshippo
5
3.1k
Physics-Embedded Neural Networks: Graph Neural PDE Solvers with Mixed Boundary Conditions
yellowshippo
1
770
物理現象の性質を反映させたグラフニューラルネットワークによる偏微分方程式の学習
yellowshippo
2
1.2k
物理シミュレーションの機械学習 に関する近年の動向と研究紹介
yellowshippo
4
15k
有限要素法を機械学習したい!
yellowshippo
0
3.9k
Other Decks in Technology
See All in Technology
CDKで始めるTypeScript開発のススメ
tsukuboshi
1
210
Azure SRE Agent x PagerDutyによる近未来インシデント対応への期待 / The Future of Incident Response: Azure SRE Agent x PagerDuty
aeonpeople
0
260
AI時代、1年目エンジニアの悩み
jin4
1
140
メルカリのAI活用を支えるAIセキュリティ
s3h
8
5.8k
名刺メーカーDevグループ 紹介資料
sansan33
PRO
0
1k
しろおびセキュリティへ ようこそ
log0417
0
240
いよいよ仕事を奪われそうな波が来たぜ
kazzpapa3
3
320
Sansan Engineering Unit 紹介資料
sansan33
PRO
1
3.8k
Introduction to Sansan for Engineers / エンジニア向け会社紹介
sansan33
PRO
6
66k
EventBridge API Destination × AgentCore Runtimeで実現するLambdaレスなイベント駆動エージェント
har1101
7
290
セキュリティについて学ぶ会 / 2026 01 25 Takamatsu WordPress Meetup
rocketmartue
1
230
Kubecon NA 2025: DRA 関連の Recap と社内 GPU 基盤での課題
kevin_namba
0
110
Featured
See All Featured
Building the Perfect Custom Keyboard
takai
2
680
How to make the Groovebox
asonas
2
1.9k
What's in a price? How to price your products and services
michaelherold
247
13k
The Power of CSS Pseudo Elements
geoffreycrofte
80
6.1k
What’s in a name? Adding method to the madness
productmarketing
PRO
24
3.9k
GraphQLの誤解/rethinking-graphql
sonatard
74
11k
Writing Fast Ruby
sferik
630
62k
Beyond borders and beyond the search box: How to win the global "messy middle" with AI-driven SEO
davidcarrasco
1
46
How to Build an AI Search Optimization Roadmap - Criteria and Steps to Take #SEOIRL
aleyda
1
1.9k
Exploring anti-patterns in Rails
aemeredith
2
240
Jess Joyce - The Pitfalls of Following Frameworks
techseoconnect
PRO
1
62
Are puppies a ranking factor?
jonoalderson
1
2.7k
Transcript
ఆنͱίϯύεͱ ChainerRL Chainer Meetup #07, 9th Jun 2018 horiem@yellowshippo
ChainerRL Ͱ
࡞ਤΛղ͖͍ͨ
࡞ਤ • ఆنͱίϯύε͚ͩΛͬͯతͷਤܗΛඳ͘ http://mathworld.wolfram.com/GeometricConstruction.html
σϞ
ͷલʹ
ਤͷݟํ ֶशϞσϧʹ͢ใ ʢObservationʣ ਓؒ༻ తͷਤܗ ར༻Մೳͳ
σϞ
શମ૾ ڥ ΤʔδΣϯτ ߦಈ ؍ଌ
શମ૾ ڥ ΤʔδΣϯτ [p0_x, p0_y] [p1_x, p1_y] …… ը૾ ͷใ
[shape_flag, pi, pj] ৽͍͠ਤܗ
ωοτϫʔΫΞʔΩςΫνϟ Conv MLP MLP Conv MLP [p0_x, p0_y] [p1_x, p1_y]
…… ը૾ ͷใ [shape_flag, pi, pj] ৽͍͠ਤܗ
ωοτϫʔΫΞʔΩςΫνϟ (100, 100) (12, 3) Conv MLP MLP Conv MLP
(2, 12, 12) [p0_x, p0_y] [p1_x, p1_y] …… ը૾ ͷใ [shape_flag, pi, pj] ৽͍͠ਤܗ
ωοτϫʔΫΞʔΩςΫνϟ (100, 100) (12, 3) Conv MLP MLP Conv MLP
(2, 12, 12) [p0_x, p0_y] [p1_x, p1_y] …… ը૾ ͷใ [shape_flag, pi, pj] ৽͍͠ਤܗ = 288
ࢥͬͨ͜ͱͳͲ • ڧԽֶशͬͨ͜ͱͳ͔͚ͬͨͲָ͍͠ • ChainerRL ϥΫͰΑ͍ • ߦಈۭ͕ؒେ͖͍ͷͰݮΒ͍ͨ͠ • AlphaGO
͕ࢀߟʹͳΔ͔ʁ • ίʔυ͖Ε͍ʹͨ͠Βެ։ && ղઆ͠·͢ • n ࣍ํఔࣜΛ ChainerRL Ͱղ͚Δ͔ʁ • ՝֎׆ಈ͖ͳਓɺҰॹʹΓ·͠ΐ͏ʂ