Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
DeepRacer for learning RL
Search
貞松政史
April 06, 2019
Technology
0
1.4k
DeepRacer for learning RL
2019.4.6 Developers.IO at OKAYAMA.
貞松政史
April 06, 2019
Tweet
Share
More Decks by 貞松政史
See All by 貞松政史
Amazon Forecast亡き今、我々がマネージドサービスに頼らず時系列予測を実行する方法
sadynitro
0
1.2k
今日のハイライトをシステマティックに
sadynitro
1
84
はじめてのレコメンド〜Amazon Personalizeを使った推薦システム超超超入門〜
sadynitro
2
2.5k
予知保全利用を目指した外観検査AIの開発 〜画像処理AIを用いた外観画像に対する異常検知〜
sadynitro
0
1.2k
20230904_GoogleCloudNext23_Recap_AI_ML
sadynitro
0
930
Foundation Model全盛時代を生きるAI/MLエンジニアの生存戦略
sadynitro
0
1k
Amazon SageMakerが存在しない世界線 のAWS上で実現する機械学習基盤
sadynitro
0
310
Amazon SageMakerが存在しない世界線のAWS上で実現する機械学習基盤
sadynitro
0
2.1k
みんな大好き強化学習
sadynitro
0
1.3k
Other Decks in Technology
See All in Technology
AIエージェント、 社内展開の前に知っておきたいこと
oracle4engineer
PRO
2
140
Google系サービスで文字起こしから勝手にカレンダーを埋めるエージェントを作った話
risatube
0
180
GCASアップデート(202601-202603)
techniczna
0
130
JAWS Days 2026 楽しく学ぼう! 認証認可 入門/20260307-jaws-days-novice-lane-auth
opelab
11
2.3k
[JAWSDAYS2026]Who is responsible for IAM
mizukibbb
0
680
身体を持ったパーソナルAIエージェントの 可能性を探る開発
yokomachi
1
120
最強のAIエージェントを諦めたら品質が上がった話 / how quality improved after giving up on the strongest AI agent
kt2mikan
0
190
OCI Security サービス 概要
oracle4engineer
PRO
2
13k
AI時代の「本当の」ハイブリッドクラウド — エージェントが実現した、あの頃の夢
ebibibi
0
120
[JAWSDAYS2026][D8]その起票、愛が足りてますか?AWSサポートを味方につける、技術的「ラブレター」の書き方
hirosys_
3
180
Abuse report だけじゃない。AWS から緊急連絡が来る状況とは?昨今の攻撃や被害の事例の紹介と備えておきたい考え方について
kazzpapa3
1
720
Scrumは歪む — 組織設計の原理原則
dashi
0
180
Featured
See All Featured
The Language of Interfaces
destraynor
162
26k
Large-scale JavaScript Application Architecture
addyosmani
515
110k
Become a Pro
speakerdeck
PRO
31
5.8k
More Than Pixels: Becoming A User Experience Designer
marktimemedia
3
350
How to train your dragon (web standard)
notwaldorf
97
6.6k
The Power of CSS Pseudo Elements
geoffreycrofte
82
6.2k
Making Projects Easy
brettharned
120
6.6k
How STYLIGHT went responsive
nonsquared
100
6k
Agile that works and the tools we love
rasmusluckow
331
21k
Done Done
chrislema
186
16k
個人開発の失敗を避けるイケてる考え方 / tips for indie hackers
panda_program
122
21k
Darren the Foodie - Storyboard
khoart
PRO
3
2.9k
Transcript
4 D 29 26 1 . 0 I 1
& .-* (2 ,0'/4"# 51 83;7 +)
!&%$9( 6: Attention
3 #cmdevio2019
4 os t m ( L @S g E b
i _d L rMI D @ E ( ( ( ) ( e a n k AWS E
5 DeepRacer
6 D
7 ) (
8 …
9 DeepRacer 4 D 26 9 01 .
10 DeepRacer A A A
11
1 2 3
12 DeepRacer
13
14 DeepRacer 1/18
3D AWS DeepRacer League
15 DeepRacer https://aws.amazon.com/jp/deepracer/
16 DeepRacer ! &%$ +)*2 1 '/*2
(#-, 0. "
17 3D AWS RoboMaker Robot Operating System (ROS) Gazebo rqt
18 AWS DeepRacer League ⁻ 0 1 : 9 A
⁻ 9 2 R ⁻ ⁻ D I ⁻ 1 2 ⁻ https://aws.amazon.com/jp/deepracer/league/
19
20 (Artificial Intelligence, AI) (Machine Learning, ML)
NeuralNetwork DeepLearning
21
22 = 1 (
) ( (
23 L N - ) ( - D Q
24 DeepRacer Cliped PPO PPO (Proximal
Policy Optimization) OpenAI2017
25 ( ( )
)
26 1
27 ) () (
28 DeepRacer
29 DeepRacer + + +
30 DeepRacer
31 DeepRacer …
32 DeepRacer
33 orz
34 DeepRacer + + +
35 DeepRacer D A D
36 $ ' + (# &!
%"
37 ( ) ) https://docs.aws.amazon.com/ja_jp /deepracer/latest/developerguide/ deepracer-train-models-define- reward-function.html
38
39 ⁻ 10 ⁻
:
40 SageMaker RL + RoboMaker
41 SageMeker RLRoboMakerGA
42 SageMaker “RL” ⁻ ⁻ M ⁻ M M
⁻ M ⁻ ⁻ ⁻ J M S
43 DeepRacer ) D ( ) ) ( )
44 SageMaker
https://dev.classmethod.jp/machine -learning/sagemaker-robomaker- deepracer-sample/
45 $# "
! https://github.com/awslabs/amazon-sagemaker-examples
46 Jupyter !
47 ( ( )
48
49 2 1
50 2 2
51 (. ( )(
52 $2# " /1(+ $2#,- ! https://docs.aws.amazon.com/ja_jp/deepracer/latest/developerguide/deepracer -iteratively-enhance-reward-functions.html *
$2%) '.0& )
53 Best Practices when training with PPO
(Unity Technologies) https://github.com/Unity-Technologies/ml-agents/blob/master/docs/Training-PPO.md
54 DeepRacer "% ! "%
$# ! !
55 DeepRacer
56 DeepRacer
57 DeepRacer
58 DeepRacer
59 DeepRacer
60
61 • g • + + • M D c
• R S D a k • b • LL e
62 DeepRacer
None