Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
DeepRacer for learning RL
Search
貞松政史
April 06, 2019
Technology
0
1.3k
DeepRacer for learning RL
2019.4.6 Developers.IO at OKAYAMA.
貞松政史
April 06, 2019
Tweet
Share
More Decks by 貞松政史
See All by 貞松政史
Amazon Forecast亡き今、我々がマネージドサービスに頼らず時系列予測を実行する方法
sadynitro
0
890
今日のハイライトをシステマティックに
sadynitro
1
66
はじめてのレコメンド〜Amazon Personalizeを使った推薦システム超超超入門〜
sadynitro
2
1.9k
予知保全利用を目指した外観検査AIの開発 〜画像処理AIを用いた外観画像に対する異常検知〜
sadynitro
0
960
20230904_GoogleCloudNext23_Recap_AI_ML
sadynitro
0
850
Foundation Model全盛時代を生きるAI/MLエンジニアの生存戦略
sadynitro
0
940
Amazon SageMakerが存在しない世界線 のAWS上で実現する機械学習基盤
sadynitro
0
250
Amazon SageMakerが存在しない世界線のAWS上で実現する機械学習基盤
sadynitro
0
1.9k
みんな大好き強化学習
sadynitro
0
1.2k
Other Decks in Technology
See All in Technology
AIエージェントを現場で使う / 2025.08.07 著者陣に聞く!現場で活用するためのAIエージェント実践入門(Findyランチセッション)
smiyawaki0820
6
690
【CEDEC2025】ブランド力アップのためのコンテンツマーケティング~ゲーム会社における情報資産の活かし方~
cygames
PRO
0
240
Segment Anything Modelの最新動向:SAM2とその発展系
tenten0727
0
430
마라톤 끝의 단거리 스퍼트: 2025년의 AI
inureyes
PRO
1
690
【CEDEC2025】現場を理解して実現!ゲーム開発を効率化するWebサービスの開発と、利用促進のための継続的な改善
cygames
PRO
0
720
Agent Development Kitで始める生成 AI エージェント実践開発
danishi
0
120
OPENLOGI Company Profile for engineer
hr01
1
37k
【CEDEC2025】『Shadowverse: Worlds Beyond』二度目のDCG開発でゲームをリデザインする~遊びやすさと競技性の両立~
cygames
PRO
1
290
マルチモーダル基盤モデルに基づく動画と音の解析技術
lycorptech_jp
PRO
4
520
o11yツールを乗り換えた話
tak0x00
1
270
AWS re:Inforce 2025 re:Cap Update Pickup & AWS Control Tower の運用における考慮ポイント
htan
1
210
【CEDEC2025】大規模言語モデルを活用したゲーム内会話パートのスクリプト作成支援への取り組み
cygames
PRO
2
770
Featured
See All Featured
[Rails World 2023 - Day 1 Closing Keynote] - The Magic of Rails
eileencodes
35
2.5k
Keith and Marios Guide to Fast Websites
keithpitt
411
22k
Fashionably flexible responsive web design (full day workshop)
malarkey
407
66k
Principles of Awesome APIs and How to Build Them.
keavy
126
17k
Practical Tips for Bootstrapping Information Extraction Pipelines
honnibal
PRO
21
1.4k
Responsive Adventures: Dirty Tricks From The Dark Corners of Front-End
smashingmag
251
21k
XXLCSS - How to scale CSS and keep your sanity
sugarenia
248
1.3M
Fantastic passwords and where to find them - at NoRuKo
philnash
51
3.4k
Building Better People: How to give real-time feedback that sticks.
wjessup
367
19k
Creating an realtime collaboration tool: Agile Flush - .NET Oxford
marcduiker
30
2.2k
Git: the NoSQL Database
bkeepers
PRO
431
65k
Visualizing Your Data: Incorporating Mongo into Loggly Infrastructure
mongodb
47
9.6k
Transcript
4 D 29 26 1 . 0 I 1
& .-* (2 ,0'/4"# 51 83;7 +)
!&%$9( 6: Attention
3 #cmdevio2019
4 os t m ( L @S g E b
i _d L rMI D @ E ( ( ( ) ( e a n k AWS E
5 DeepRacer
6 D
7 ) (
8 …
9 DeepRacer 4 D 26 9 01 .
10 DeepRacer A A A
11
1 2 3
12 DeepRacer
13
14 DeepRacer 1/18
3D AWS DeepRacer League
15 DeepRacer https://aws.amazon.com/jp/deepracer/
16 DeepRacer ! &%$ +)*2 1 '/*2
(#-, 0. "
17 3D AWS RoboMaker Robot Operating System (ROS) Gazebo rqt
18 AWS DeepRacer League ⁻ 0 1 : 9 A
⁻ 9 2 R ⁻ ⁻ D I ⁻ 1 2 ⁻ https://aws.amazon.com/jp/deepracer/league/
19
20 (Artificial Intelligence, AI) (Machine Learning, ML)
NeuralNetwork DeepLearning
21
22 = 1 (
) ( (
23 L N - ) ( - D Q
24 DeepRacer Cliped PPO PPO (Proximal
Policy Optimization) OpenAI2017
25 ( ( )
)
26 1
27 ) () (
28 DeepRacer
29 DeepRacer + + +
30 DeepRacer
31 DeepRacer …
32 DeepRacer
33 orz
34 DeepRacer + + +
35 DeepRacer D A D
36 $ ' + (# &!
%"
37 ( ) ) https://docs.aws.amazon.com/ja_jp /deepracer/latest/developerguide/ deepracer-train-models-define- reward-function.html
38
39 ⁻ 10 ⁻
:
40 SageMaker RL + RoboMaker
41 SageMeker RLRoboMakerGA
42 SageMaker “RL” ⁻ ⁻ M ⁻ M M
⁻ M ⁻ ⁻ ⁻ J M S
43 DeepRacer ) D ( ) ) ( )
44 SageMaker
https://dev.classmethod.jp/machine -learning/sagemaker-robomaker- deepracer-sample/
45 $# "
! https://github.com/awslabs/amazon-sagemaker-examples
46 Jupyter !
47 ( ( )
48
49 2 1
50 2 2
51 (. ( )(
52 $2# " /1(+ $2#,- ! https://docs.aws.amazon.com/ja_jp/deepracer/latest/developerguide/deepracer -iteratively-enhance-reward-functions.html *
$2%) '.0& )
53 Best Practices when training with PPO
(Unity Technologies) https://github.com/Unity-Technologies/ml-agents/blob/master/docs/Training-PPO.md
54 DeepRacer "% ! "%
$# ! !
55 DeepRacer
56 DeepRacer
57 DeepRacer
58 DeepRacer
59 DeepRacer
60
61 • g • + + • M D c
• R S D a k • b • LL e
62 DeepRacer
None