Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
定規とコンパスと ChainerRL
Search
Sponsored
·
Ship Features Fearlessly
Turn features on and off without deploys. Used by thousands of Ruby developers.
→
horiem
June 09, 2018
Technology
0
1.2k
定規とコンパスと ChainerRL
強化学習を使って作図問題を解く
Chainer Meetup #07, 9th Jun 2018
horiem
June 09, 2018
Tweet
Share
More Decks by horiem
See All by horiem
Continuous Simplicial Neural Networks
yellowshippo
1
700
局所保存性・相似変換対称性を満たす機械学習モデルによる数値流体力学
yellowshippo
1
380
ICML 読み会: Graph Neural PDE Solvers with Conservation and Similarity-Equivariance
yellowshippo
1
560
物理シミュレーションと数理最適化の知見を導入した機械学習手法
yellowshippo
1
1.8k
対称性のある機械学習による物理現象の解析
yellowshippo
5
3.2k
Physics-Embedded Neural Networks: Graph Neural PDE Solvers with Mixed Boundary Conditions
yellowshippo
1
780
物理現象の性質を反映させたグラフニューラルネットワークによる偏微分方程式の学習
yellowshippo
2
1.2k
物理シミュレーションの機械学習 に関する近年の動向と研究紹介
yellowshippo
4
15k
有限要素法を機械学習したい!
yellowshippo
0
3.9k
Other Decks in Technology
See All in Technology
クラウド × シリコンの Mashup - AWS チップ開発で広がる AI 基盤の選択肢
htokoyo
2
260
Scrumは歪む — 組織設計の原理原則
dashi
0
180
AI実装による「レビューボトルネック」を解消する仕様駆動開発(SDD)/ ai-sdd-review-bottleneck
rakus_dev
0
140
[JAWSDAYS2026]Who is responsible for IAM
mizukibbb
0
700
AI時代のSaaSとETL
shoe116
1
150
内製AIチャットボットで学んだDatadog LLM Observability活用術
mkdev10
0
120
進化するBits AI SREと私と組織
nulabinc
PRO
0
190
楽しく学ぼう!ネットワーク入門
shotashiratori
4
3.3k
CyberAgentの生成AI戦略 〜変わるものと変わらないもの〜
katayan
0
230
VPCエンドポイント意外とお金かかるなぁ。せや、共有したろ!
tommy0124
1
620
SRE NEXT 2026 CfP レビュアーが語る聞きたくなるプロポーザルとは?
yutakawasaki0911
1
350
JAWS FESTA 2025でリリースしたほぼリアルタイム文字起こし/翻訳機能の構成について
naoki8408
1
560
Featured
See All Featured
Kristin Tynski - Automating Marketing Tasks With AI
techseoconnect
PRO
0
190
The State of eCommerce SEO: How to Win in Today's Products SERPs - #SEOweek
aleyda
2
9.9k
The Web Performance Landscape in 2024 [PerfNow 2024]
tammyeverts
12
1.1k
Digital Projects Gone Horribly Wrong (And the UX Pros Who Still Save the Day) - Dean Schuster
uxyall
0
740
Design of three-dimensional binary manipulators for pick-and-place task avoiding obstacles (IECON2024)
konakalab
0
380
RailsConf 2023
tenderlove
30
1.4k
The Myth of the Modular Monolith - Day 2 Keynote - Rails World 2024
eileencodes
26
3.4k
What's in a price? How to price your products and services
michaelherold
247
13k
Designing Experiences People Love
moore
143
24k
The MySQL Ecosystem @ GitHub 2015
samlambert
251
13k
Everyday Curiosity
cassininazir
0
160
Art, The Web, and Tiny UX
lynnandtonic
304
21k
Transcript
ఆنͱίϯύεͱ ChainerRL Chainer Meetup #07, 9th Jun 2018 horiem@yellowshippo
ChainerRL Ͱ
࡞ਤΛղ͖͍ͨ
࡞ਤ • ఆنͱίϯύε͚ͩΛͬͯతͷਤܗΛඳ͘ http://mathworld.wolfram.com/GeometricConstruction.html
σϞ
ͷલʹ
ਤͷݟํ ֶशϞσϧʹ͢ใ ʢObservationʣ ਓؒ༻ తͷਤܗ ར༻Մೳͳ
σϞ
શମ૾ ڥ ΤʔδΣϯτ ߦಈ ؍ଌ
શମ૾ ڥ ΤʔδΣϯτ [p0_x, p0_y] [p1_x, p1_y] …… ը૾ ͷใ
[shape_flag, pi, pj] ৽͍͠ਤܗ
ωοτϫʔΫΞʔΩςΫνϟ Conv MLP MLP Conv MLP [p0_x, p0_y] [p1_x, p1_y]
…… ը૾ ͷใ [shape_flag, pi, pj] ৽͍͠ਤܗ
ωοτϫʔΫΞʔΩςΫνϟ (100, 100) (12, 3) Conv MLP MLP Conv MLP
(2, 12, 12) [p0_x, p0_y] [p1_x, p1_y] …… ը૾ ͷใ [shape_flag, pi, pj] ৽͍͠ਤܗ
ωοτϫʔΫΞʔΩςΫνϟ (100, 100) (12, 3) Conv MLP MLP Conv MLP
(2, 12, 12) [p0_x, p0_y] [p1_x, p1_y] …… ը૾ ͷใ [shape_flag, pi, pj] ৽͍͠ਤܗ = 288
ࢥͬͨ͜ͱͳͲ • ڧԽֶशͬͨ͜ͱͳ͔͚ͬͨͲָ͍͠ • ChainerRL ϥΫͰΑ͍ • ߦಈۭ͕ؒେ͖͍ͷͰݮΒ͍ͨ͠ • AlphaGO
͕ࢀߟʹͳΔ͔ʁ • ίʔυ͖Ε͍ʹͨ͠Βެ։ && ղઆ͠·͢ • n ࣍ํఔࣜΛ ChainerRL Ͱղ͚Δ͔ʁ • ՝֎׆ಈ͖ͳਓɺҰॹʹΓ·͠ΐ͏ʂ