Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
DeepRacer for learning RL
Search
貞松政史
April 06, 2019
Technology
0
1.3k
DeepRacer for learning RL
2019.4.6 Developers.IO at OKAYAMA.
貞松政史
April 06, 2019
Tweet
Share
More Decks by 貞松政史
See All by 貞松政史
Amazon Forecast亡き今、我々がマネージドサービスに頼らず時系列予測を実行する方法
sadynitro
0
340
今日のハイライトをシステマティックに
sadynitro
1
46
はじめてのレコメンド〜Amazon Personalizeを使った推薦システム超超超入門〜
sadynitro
1
1.3k
予知保全利用を目指した外観検査AIの開発 〜画像処理AIを用いた外観画像に対する異常検知〜
sadynitro
0
660
20230904_GoogleCloudNext23_Recap_AI_ML
sadynitro
0
730
Foundation Model全盛時代を生きるAI/MLエンジニアの生存戦略
sadynitro
0
830
Amazon SageMakerが存在しない世界線 のAWS上で実現する機械学習基盤
sadynitro
0
190
Amazon SageMakerが存在しない世界線のAWS上で実現する機械学習基盤
sadynitro
0
1.7k
みんな大好き強化学習
sadynitro
0
1k
Other Decks in Technology
See All in Technology
インシデントキーメトリクスによるインシデント対応の改善 / Improving Incident Response using Incident Key Metrics
nari_ex
0
3.9k
ObservabilityCON on the Road Tokyoの見どころ
hamadakoji
0
210
ココナラのセキュリティ組織の体制・役割・今後目指す世界
coconala_engineer
0
220
ソフトウェア開発現代史:製造業とソフトウェアは本当に共存できていたのか?品質とスピードを問い直す
takabow
15
5.3k
サービスローンチを成功させろ! 〜SREが教える30日間の攻略ガイド〜
mmmatsuda
2
4.4k
20250129 Findy_テスト高活用化
dshirae
0
220
Site Reliability Engineering on Kubernetes
nwiizo
6
4.4k
レイクハウスとはなんだったのか?
akuwano
15
2k
SIEMによるセキュリティログの可視化と分析を通じた信頼性向上プロセスと実践
coconala_engineer
1
2.9k
消し忘れリソースゼロへ!私のResource Explorer活用法
cuorain
0
140
プロダクト価値を引き上げる、「課題の再定義」という習慣
moeka__c
0
200
panicを深ぼってみる
kworkdev
PRO
2
140
Featured
See All Featured
[Rails World 2023 - Day 1 Closing Keynote] - The Magic of Rails
eileencodes
33
2k
The Pragmatic Product Professional
lauravandoore
32
6.4k
What's in a price? How to price your products and services
michaelherold
244
12k
What’s in a name? Adding method to the madness
productmarketing
PRO
22
3.3k
Design and Strategy: How to Deal with People Who Don’t "Get" Design
morganepeng
127
19k
A Tale of Four Properties
chriscoyier
157
23k
Producing Creativity
orderedlist
PRO
343
39k
How to train your dragon (web standard)
notwaldorf
89
5.8k
Why Our Code Smells
bkeepers
PRO
335
57k
Responsive Adventures: Dirty Tricks From The Dark Corners of Front-End
smashingmag
251
21k
Learning to Love Humans: Emotional Interface Design
aarron
274
40k
Build The Right Thing And Hit Your Dates
maggiecrowley
34
2.5k
Transcript
4 D 29 26 1 . 0 I 1
& .-* (2 ,0'/4"# 51 83;7 +)
!&%$9( 6: Attention
3 #cmdevio2019
4 os t m ( L @S g E b
i _d L rMI D @ E ( ( ( ) ( e a n k AWS E
5 DeepRacer
6 D
7 ) (
8 …
9 DeepRacer 4 D 26 9 01 .
10 DeepRacer A A A
11
1 2 3
12 DeepRacer
13
14 DeepRacer 1/18
3D AWS DeepRacer League
15 DeepRacer https://aws.amazon.com/jp/deepracer/
16 DeepRacer ! &%$ +)*2 1 '/*2
(#-, 0. "
17 3D AWS RoboMaker Robot Operating System (ROS) Gazebo rqt
18 AWS DeepRacer League ⁻ 0 1 : 9 A
⁻ 9 2 R ⁻ ⁻ D I ⁻ 1 2 ⁻ https://aws.amazon.com/jp/deepracer/league/
19
20 (Artificial Intelligence, AI) (Machine Learning, ML)
NeuralNetwork DeepLearning
21
22 = 1 (
) ( (
23 L N - ) ( - D Q
24 DeepRacer Cliped PPO PPO (Proximal
Policy Optimization) OpenAI2017
25 ( ( )
)
26 1
27 ) () (
28 DeepRacer
29 DeepRacer + + +
30 DeepRacer
31 DeepRacer …
32 DeepRacer
33 orz
34 DeepRacer + + +
35 DeepRacer D A D
36 $ ' + (# &!
%"
37 ( ) ) https://docs.aws.amazon.com/ja_jp /deepracer/latest/developerguide/ deepracer-train-models-define- reward-function.html
38
39 ⁻ 10 ⁻
:
40 SageMaker RL + RoboMaker
41 SageMeker RLRoboMakerGA
42 SageMaker “RL” ⁻ ⁻ M ⁻ M M
⁻ M ⁻ ⁻ ⁻ J M S
43 DeepRacer ) D ( ) ) ( )
44 SageMaker
https://dev.classmethod.jp/machine -learning/sagemaker-robomaker- deepracer-sample/
45 $# "
! https://github.com/awslabs/amazon-sagemaker-examples
46 Jupyter !
47 ( ( )
48
49 2 1
50 2 2
51 (. ( )(
52 $2# " /1(+ $2#,- ! https://docs.aws.amazon.com/ja_jp/deepracer/latest/developerguide/deepracer -iteratively-enhance-reward-functions.html *
$2%) '.0& )
53 Best Practices when training with PPO
(Unity Technologies) https://github.com/Unity-Technologies/ml-agents/blob/master/docs/Training-PPO.md
54 DeepRacer "% ! "%
$# ! !
55 DeepRacer
56 DeepRacer
57 DeepRacer
58 DeepRacer
59 DeepRacer
60
61 • g • + + • M D c
• R S D a k • b • LL e
62 DeepRacer
None