Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
DeepRacer for learning RL
Search
貞松政史
April 06, 2019
Technology
0
1.4k
DeepRacer for learning RL
2019.4.6 Developers.IO at OKAYAMA.
貞松政史
April 06, 2019
Tweet
Share
More Decks by 貞松政史
See All by 貞松政史
Amazon Forecast亡き今、我々がマネージドサービスに頼らず時系列予測を実行する方法
sadynitro
0
1k
今日のハイライトをシステマティックに
sadynitro
1
72
はじめてのレコメンド〜Amazon Personalizeを使った推薦システム超超超入門〜
sadynitro
2
2.2k
予知保全利用を目指した外観検査AIの開発 〜画像処理AIを用いた外観画像に対する異常検知〜
sadynitro
0
1.1k
20230904_GoogleCloudNext23_Recap_AI_ML
sadynitro
0
880
Foundation Model全盛時代を生きるAI/MLエンジニアの生存戦略
sadynitro
0
980
Amazon SageMakerが存在しない世界線 のAWS上で実現する機械学習基盤
sadynitro
0
270
Amazon SageMakerが存在しない世界線のAWS上で実現する機械学習基盤
sadynitro
0
2k
みんな大好き強化学習
sadynitro
0
1.2k
Other Decks in Technology
See All in Technology
アノテーション作業書作成のGood Practice
cierpa0905
PRO
0
310
組織全員で向き合うAI Readyなデータ利活用
gappy50
5
1.8k
GraphRAG グラフDBを使ったLLM生成(自作漫画DBを用いた具体例を用いて)
seaturt1e
1
160
ざっくり学ぶ 『エンジニアリングリーダー 技術組織を育てるリーダーシップと セルフマネジメント』 / 50 minute Engineering Leader
iwashi86
6
3.5k
From Natural Language to K8s Operations: The MCP Architecture and Practice of kubectl-ai
appleboy
0
370
Azure Well-Architected Framework入門
tomokusaba
1
150
RemoteFunctionを使ったコロケーション
mkazutaka
1
150
re:Inventに行くまでにやっておきたいこと
nagisa53
0
750
JSConf JPのwebsiteをGatsbyからNext.jsに移行した話 - Next.jsの多言語静的サイトと課題
leko
2
190
AIの個性を理解し、指揮する
shoota
3
490
dbtとAIエージェントを組み合わせて見えたデータ調査の新しい形
10xinc
7
1.5k
IBC 2025 動画技術関連レポート / IBC 2025 Report
cyberagentdevelopers
PRO
2
220
Featured
See All Featured
GraphQLの誤解/rethinking-graphql
sonatard
73
11k
The Language of Interfaces
destraynor
162
25k
Leading Effective Engineering Teams in the AI Era
addyosmani
7
670
Fashionably flexible responsive web design (full day workshop)
malarkey
407
66k
The Web Performance Landscape in 2024 [PerfNow 2024]
tammyeverts
10
890
Sharpening the Axe: The Primacy of Toolmaking
bcantrill
46
2.5k
Making the Leap to Tech Lead
cromwellryan
135
9.6k
Why You Should Never Use an ORM
jnunemaker
PRO
59
9.6k
Building Better People: How to give real-time feedback that sticks.
wjessup
370
20k
Agile that works and the tools we love
rasmusluckow
331
21k
The Illustrated Children's Guide to Kubernetes
chrisshort
49
51k
How To Stay Up To Date on Web Technology
chriscoyier
791
250k
Transcript
4 D 29 26 1 . 0 I 1
& .-* (2 ,0'/4"# 51 83;7 +)
!&%$9( 6: Attention
3 #cmdevio2019
4 os t m ( L @S g E b
i _d L rMI D @ E ( ( ( ) ( e a n k AWS E
5 DeepRacer
6 D
7 ) (
8 …
9 DeepRacer 4 D 26 9 01 .
10 DeepRacer A A A
11
1 2 3
12 DeepRacer
13
14 DeepRacer 1/18
3D AWS DeepRacer League
15 DeepRacer https://aws.amazon.com/jp/deepracer/
16 DeepRacer ! &%$ +)*2 1 '/*2
(#-, 0. "
17 3D AWS RoboMaker Robot Operating System (ROS) Gazebo rqt
18 AWS DeepRacer League ⁻ 0 1 : 9 A
⁻ 9 2 R ⁻ ⁻ D I ⁻ 1 2 ⁻ https://aws.amazon.com/jp/deepracer/league/
19
20 (Artificial Intelligence, AI) (Machine Learning, ML)
NeuralNetwork DeepLearning
21
22 = 1 (
) ( (
23 L N - ) ( - D Q
24 DeepRacer Cliped PPO PPO (Proximal
Policy Optimization) OpenAI2017
25 ( ( )
)
26 1
27 ) () (
28 DeepRacer
29 DeepRacer + + +
30 DeepRacer
31 DeepRacer …
32 DeepRacer
33 orz
34 DeepRacer + + +
35 DeepRacer D A D
36 $ ' + (# &!
%"
37 ( ) ) https://docs.aws.amazon.com/ja_jp /deepracer/latest/developerguide/ deepracer-train-models-define- reward-function.html
38
39 ⁻ 10 ⁻
:
40 SageMaker RL + RoboMaker
41 SageMeker RLRoboMakerGA
42 SageMaker “RL” ⁻ ⁻ M ⁻ M M
⁻ M ⁻ ⁻ ⁻ J M S
43 DeepRacer ) D ( ) ) ( )
44 SageMaker
https://dev.classmethod.jp/machine -learning/sagemaker-robomaker- deepracer-sample/
45 $# "
! https://github.com/awslabs/amazon-sagemaker-examples
46 Jupyter !
47 ( ( )
48
49 2 1
50 2 2
51 (. ( )(
52 $2# " /1(+ $2#,- ! https://docs.aws.amazon.com/ja_jp/deepracer/latest/developerguide/deepracer -iteratively-enhance-reward-functions.html *
$2%) '.0& )
53 Best Practices when training with PPO
(Unity Technologies) https://github.com/Unity-Technologies/ml-agents/blob/master/docs/Training-PPO.md
54 DeepRacer "% ! "%
$# ! !
55 DeepRacer
56 DeepRacer
57 DeepRacer
58 DeepRacer
59 DeepRacer
60
61 • g • + + • M D c
• R S D a k • b • LL e
62 DeepRacer
None