Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
Rancherでkubeflow構築
Search
nakayamam
March 16, 2019
Technology
3
19k
Rancherでkubeflow構築
Rancher Meetup #07 in Osakaでの発表資料です
nakayamam
March 16, 2019
Tweet
Share
More Decks by nakayamam
See All by nakayamam
rancher/system-toolsを試してみた
nakayamam
0
350
Other Decks in Technology
See All in Technology
AI前提のサービス運用ってなんだろう?
ryuichi1208
8
1.4k
AWS Lambda のトラブルシュートをしていて思うこと
kazzpapa3
2
180
100 名超が参加した日経グループ横断の競技型 AWS 学習イベント「Nikkei Group AWS GameDay」の紹介/mediajaws202411
nikkei_engineer_recruiting
1
170
TypeScriptの次なる大進化なるか!? 条件型を返り値とする関数の型推論
uhyo
2
1.7k
Lexical Analysis
shigashiyama
1
150
The Web Performance Landscape in 2024 [PerfNow 2024]
tammyeverts
0
110
オープンソースAIとは何か? --「オープンソースAIの定義 v1.0」詳細解説
shujisado
10
1.2k
CDCL による厳密解法を採用した MILP ソルバー
imai448
3
160
Making your applications cross-environment - OSCG 2024 NA
salaboy
0
200
Why App Signing Matters for Your Android Apps - Android Bangkok Conference 2024
akexorcist
0
130
Terraform Stacks入門 #HashiTalks
msato
0
360
The Role of Developer Relations in AI Product Success.
giftojabu1
0
140
Featured
See All Featured
Six Lessons from altMBA
skipperchong
27
3.5k
GitHub's CSS Performance
jonrohan
1030
460k
Learning to Love Humans: Emotional Interface Design
aarron
273
40k
JavaScript: Past, Present, and Future - NDC Porto 2020
reverentgeek
47
5k
A Philosophy of Restraint
colly
203
16k
Bootstrapping a Software Product
garrettdimon
PRO
305
110k
Principles of Awesome APIs and How to Build Them.
keavy
126
17k
Product Roadmaps are Hard
iamctodd
PRO
49
11k
Save Time (by Creating Custom Rails Generators)
garrettdimon
PRO
27
840
The World Runs on Bad Software
bkeepers
PRO
65
11k
How to Think Like a Performance Engineer
csswizardry
20
1.1k
GraphQLとの向き合い方2022年版
quramy
43
13k
Transcript
RancherͰkubeflowߏங Rancher Meetup #07 in Osaka Masaki-Nakayama
ࣗݾհ • Masaki-Nakayama @nakayamam2 • KAGOYA JAPAN • Rancher Meetup,
CNJP Kansai
kubeflowʁ IUUQTXXXLVCFqPXPSHEPDTBCPVULVCFqPX
kubeflow? KubernetesͷͨΊͷػցֶशπʔϧΩοτ
kubeflow? ԼهͷํʹΦεεϝ by ެࣜ • TensorFlowϞσϧΛ͞·͟·ͳڥʢϩʔΧϧɺΦϯϓ ϨɺΫϥυͳͲʣͰτϨʔχϯά/ఏڙ͍ͨ͠ • TensorFlowτϨʔχϯάδϣϒΛཧ͢ΔͨΊʹJupyter ϊʔτϒοΫΛ͍͍ͨ
• TensorFlowΛଞͷϓϩηεͱΈ߹Θ͍ͤͨ
kubeflow? kubeflowͷμογϡϘʔυ
kubeflow? Լهͷͷ͕͋Β͔͡Ίೖ͍ͬͯ·͢ • JupyterHub : Jupyter NotebookʹϢʔβʔೝূՃͯ͠ෳਓͰ͑ΔΑ ͏ʹͨ͠ͷ • TFjob
Dashboard: k8sͰTensorFlowτϨʔχϯάδϣϒΛཧͰ͖Δ • Katib Dashboard: ϋΠύʔύϥϝʔλʔνϡʔχϯάͷπʔϧ https://www.slideshare.net/Oshima0x3fd/katib ͕ৄͦ͠͏
ߏங·ͰಓͷΓͦ͏ɾɾɾ
͋ΔRancherΧλϩάΛݟ͍ͯΔͱɾɾ
͋ʂ
None
͔ͯ͠͠Chainer͙͑͢Δɾɾɾʁ
ྲྀΕ 1. GPUΫϥελʔͷߏங on GKE 2. ΫϥελʔΛRancherΠϯϙʔτ 3. RancherΧλϩάͰkubeflowσϓϩΠ ※ͪͳΈʹGKEʹೖΕΔ͚ͩͳΒઐ༻ͷϫϯΫϦοΫσϓϩΠ͕༻ҙ͞ΕͯΔͷͰ
ͦͪΒΛ͏ํ͕ૣ͍͔ https://deploy.kubeflow.cloud/#/
GPUΫϥελʔͷߏங on GKE
GPUબΜͰ࡞ KZVQZUFSIVC͕Ϧιʔε Λ৯͏ͷͰεϖοΫ͕͋ Μ·Γ͍ͱࢮʹ·͢
GPUΫϥελʔ͕Ͱ͖·ͨ͠
ΫϥελʔΛ RancherΠϯϙʔτ
None
ࣗݾॺ໊ͳͷͰͪ͜Β Λ࣮ߦ DMVTUFSBENJOΛϢʔβʔ ʹCJOEJOH
ΠϯϙʔτͰ͖·ͨ͠
RancherΧλϩάͰ kubeflowσϓϩΠ
None
֤ػೳͷ0/0''͕Ͱ͖ΔͬΆ͍ʁ
֤ػೳͷ0/0''͕Ͱ͖ΔͬΆ͍ʁ
None
6*
None
Ϣʔβʔ࡞͢Δ
ΠϝʔδΛࢦఆͯ͠TQBXO
ϫʔΫεϖʔε͕࡞͞ΕΔ
ϫʔΫεϖʔε͕ग़དྷ্͕Δ
QZUIPOίʔυΛ͙࣮͢ߦͰ͖Δ
None
UFOTPSqPXͷδϣϒͷ࡞͕Ͱ͖Δ
> kubectl get crd NAME AGE backendconfigs.cloud.google.com 2h scalingpolicies.scalingpolicy.kope.io 2h
studyjobs.kubeflow.org 1h tfjobs.kubeflow.org 1h ͋Εɺchainer operator͕͍ͳ͍ɾɾɾ
None
{"log":"2019/03/14 17:57:17 info: manifest \"kubeflow/templates/ chainer-rbac.yaml\" is empty. Skipping. \n","stream":"stderr","time":"2019-03-14T17:57:17.702047111Z"}
{"log":"2019/03/14 17:57:17 info: manifest \"kubeflow/templates/ chainer-crd.yaml\" is empty. Skipping. \n","stream":"stderr","time":"2019-03-14T17:57:17.702898257Z"} {"log":"2019/03/14 17:57:17 info: manifest \"kubeflow/templates/ chainer-operator.yaml\" is empty. Skipping. \n","stream":"stderr","time":"2019-03-14T17:57:17.702904447Z"} RancherαʔόʔͷϩάΛݟΔͱ Ͳ͏manifestʹө͞Ε͍ͯͳ͍Α͏ͩ
issueग़͠ͱ͖·ͨ͠ʂ
Thanks!