Kubernetes 地端自建 v.s. GKE，哪個更適合你？ @Devfest Taipei 2024

Kubernetes 地端自建 vs GKE 哪個更適合你? Taipei Johnny Sung

Full stack developer Johnny Sung (宋岡諺) https://fb.com/j796160836 https://blog.jks.co ff ee/
https://www.slideshare.net/j796160836 https://github.com/j796160836

大綱 •Kubernetes (K8s) 基本概念 •Kubernetes (K8s) 元件的概念 •地端架設實務 •開
一台 GKE 吧 •關於 GPU

High Availability https://soco-st.com/18158 高可用性

https://blog.whmcs.com/133514/demystifying-high-availability-for-whmcs ( Active / Standby )

CAP 定理 • 一致性（Consistency） •可用性（Availability） •分區容錯性（Partition tolerance）
https://zh.wikipedia.org/zh-tw/CAP%E5%AE%9A%E7%90%86 https://medium.com/nerd-for-tech/understand-cap-theorem-751f0672890e

https://medium.com/how-gipi-learn/%E5%B0%8A%E9%87%8D-%E9%9C%80%E8%A6%81%E9%9D%A0%E5%B0%88%E6%A5%AD%E5%8E%BB%E8%B4%8F%E5%9B%9E%E4%BE%86-8fdecf676fe5

https://medium.com/%E5%BE%8C%E7%AB%AF%E6%96%B0%E6%89%8B%E6%9D%91/cap%E5%AE%9A%E7%90%86101-3fdd10e0b9a

大部分都要改程式 https://soco-st.com/18158 要做到高可用性就 Infrastructure
做探討

https://javascript.plainenglish.io/what-is-a-server-explanation-for-young-developers-2511d8b313b7

https://ithelp.ithome.com.tw/articles/10250841 Virtual Machine (VM) vs Docker

https://upload.wikimedia.org/wikipedia/commons/6/67/Kubernetes_logo.svg

https://www.cncf.io/blog/2024/06/06/unveiling-the-10-year-kubernetes-anniversary-logo/

https://www.linuxfoundation.org/kubernetes-10-year-logo-contest

開發工程師視角的 Kubernetes 可能是你？ https://soco-st.com/20498

https://soco-st.com/20498 我知道！就是檔！

是什麼？可以吃嗎？ https://soco-st.com/20498

想想以前 Docker 的時代

Created by hanis tusiyani from Noun Project https://thenounproject.com/icon/server-7086299/  https://thenounproject.com/icon/data-center-7086329/  https://www.pngwing.com/en/free-png-ztqam
docker run -v ./www:/usr/share/nginx/html:ro -p 80:80 -d nginx docker run 指令一次起單一服務

docker run -v ./www:/usr/share/nginx/html:ro -p 80:80 -d nginx version: "3" services: nginx: image: nginx volumes: - ./www:/usr/share/nginx/html:ro ports: - 80:80 docker run 指令 docker-compose.yml 一次起多組服務一次起單一服務

Created by hanis tusiyani from Noun Project docker run -v ./www:/usr/share/nginx/html:ro -p 80:80 -d nginx version: "3" services: nginx: image: nginx volumes: - ./www:/usr/share/nginx/html:ro ports: - 80:80 docker run 指令 docker-compose.yml • deployment.yml • services.yml • rbac.yml • config-map.yml • …. 一次起多組服務 Kubernetes 多組服務部署在多台主機上一次起單一服務

docker-compose version: "3" services: nginx: image: nginx volumes: - ./www:/usr/share/nginx/html:ro
ports: - 80:80 • 服務部署 • 磁碟 • 網路

對應 Kubernetes 的元件 • 服務部署 → Deployment / Pod •
磁碟 → PersistentVolumeClaim (PVC) / Con fi gMap / Secret • 網路 → Service / Ingress 永久磁碟儲存需求會自動 1:1 對應 PersistentVolume (PV) 地端 K8s 預設沒有 LoadBalancer 可用

Kustomize Kustomize 是一個 Kubernetes 的配置管理工具，可以透過定制資源的配置來簡化
Kubernetes 的部署。它專注於以聲明式方式修改和管理 Kubernetes manifest 檔案，不需要動態生成配置。使用者可以建立基礎配置的 "基底"，然後在不同環境（如開發、測試和生產）中進行客製化覆蓋。Kustomize 允許合併或替換 YAML 檔案的部分，使得配置更加模組化和可重用。它現在是 Kubernetes 的一部分，可以直接透過 kubectl 命令行工具使用。 https://zlaval.medium.com/kustomize-template-free-kubernetes-application-management-3d70ca9d2e05

Kustomize 檔案架構 https://thenounproject.com/icon/ fi le-6897025/ https://thenounproject.com/icon/puzzle-6850847/ deployment.yml services.yml config-map.yml …
kustomization.yaml

一個網站服務的基本元件

Pod Container https://thenounproject.com/icon/ram-7094983/ https://thenounproject.com/icon/hard-disk-7094988/ https://thenounproject.com/icon/network-5355161/ https://thenounproject.com/icon/history-5019532/ https://thenounproject.com/icon/central-processing-unit-7095000/ https://thenounproject.com/icon/form-6622708/  https://thenounproject.com/icon/approval-6293848/ 網站服務的基本元件

Pod Container https://thenounproject.com/icon/ram-7094983/ https://thenounproject.com/icon/hard-disk-7094988/ https://thenounproject.com/icon/network-5355161/ https://thenounproject.com/icon/history-5019532/ https://thenounproject.com/icon/central-processing-unit-7095000/ https://thenounproject.com/icon/form-6622708/  https://thenounproject.com/icon/approval-6293848/ Service
Created by Mada Creative 網站服務的基本元件

Pod Container Deployment ReplicaSet https://thenounproject.com/icon/ram-7094983/ https://thenounproject.com/icon/hard-disk-7094988/ https://thenounproject.com/icon/network-5355161/ https://thenounproject.com/icon/history-5019532/ https://thenounproject.com/icon/central-processing-unit-7095000/ https://thenounproject.com/icon/form-6622708/ 
https://thenounproject.com/icon/approval-6293848/ by Muhammad Naufal Subhiansyah from Noun Project by Muhammad Naufal Subhiansyah from Noun Project Service Created by Mada Creative 網站服務的基本元件

https://thenounproject.com/icon/approval-6293848/ by Muhammad Naufal Subhiansyah from Noun Project by Muhammad Naufal Subhiansyah from Noun Project Service Created by Mada Creative PVC PersistentVolumeClaim PersistentVolume PV 1:1 網站服務的基本元件

Deployment (部署) • 定義一個 Pod 的部署方式
• Replicas 要幾份 • 設定參數 • Con fi gMap, Secret • Resources Limit (CPU, memory) • VolumeMounts  （使用的 PersistentVolumeClaim PVC） apiVersion: apps/v1 kind: Deployment metadata: labels: app: my-deployment name: my-deployment namespace: my-namespace spec: replicas: 1 selector: matchLabels: app: my-deployment template: metadata: labels: app: my-deployment spec: containers: - image: my_image:1.0 name: my_image resources: requests: memory: 64Mi cpu: 250m limits: memory: 128Mi cpu: 500m ports: - containerPort: 5000 name: my_image volumeMounts: - name: my-pvc mountPath: /mydata - name: my-pvc mountPath: /data/output volumes: - name: my-pvc persistentVolumeClaim: claimName: my-pvc

https://thenounproject.com/icon/approval-6293848/ by Muhammad Naufal Subhiansyah from Noun Project by Muhammad Naufal Subhiansyah from Noun Project Service Created by Mada Creative PVC PersistentVolumeClaim PersistentVolume PV Created by Andika Cahya Fitriani from the Noun Project Provisioner StorageClass 1:1 網站服務的基本元件還有更多...

docker-compose version: "3" services: nginx: image: nginx volumes: - ./www:/usr/share/nginx/html:ro
ports: - 80:80 • 服務部署 • 磁碟 • 網路

對應 Kubernetes 的元件 • 服務部署 → Deployment / StatefulSet /
Pod • 磁碟 → PersistentVolumeClaim (PVC) / Con fi gMap / Secret • 網路 → Service / Ingress 永久磁碟儲存需求會自動 1:1 對應 PersistentVolume (PV) 地端 K8s 預設沒有 LoadBalancer 可用

維運工程師視角的 Kubernetes 可能還是你？ https://soco-st.com/18158

K8s 的各種選擇 •作業系統 OS •K8s distro •Container Runtime •CNI (Container
Network Interface) •CRI (Container Runtime Interface)

K8s 的各種選擇 •作業系統 OS •Ubuntu? Redhat?

K8s 的各種選擇 •Container Runtime •docker? containerd? cri-o?

K8s 的各種選擇 •K8s distro •社群版 •kubeadm? Rancher? •商用版
•OpenShift? VMWare Tanzu?

K8s 的各種選擇 •CNI (Container Network Interface) •Flannel? Calico? Cilium?

通通綜合起來... https://soco-st.com/18158

不用啦！用 Google Cloud 就好 ☺

我給你一個預設選項吧！ •作業系統 OS：ubuntu •K8s distro：kubeadm •Container Runtime: docker
•CNI (Container Network Interface): fl annel •CRI (Container Runtime Interface): cri-dockerd https://soco-st.com/21673 https://en.m.wikipedia.org/wiki/File:UbuntuCoF.svg https://www.docker.com/company/newsroom/media-resources/

不只這些 https://soco-st.com/18158

•StorageClass •Metric server •ArgoCD •Prometheus + Grafana K8s 常安裝的元件部署
監控儲存 K8s擴展

Kubernetes 的元件介紹

https://mrdevops.hashnode.dev/kubernetes-architecture

K8s 重點元件 •kubelet 主服務，確保各元件有正常運作 •kube-apiserver 主要核心，提供 Kubernetes HTTP
API •kube-scheduler 排程分配器，把 Pod 分到合適的 node https://github.com/coredns https://github.com/etcd-io/etcd

K8s 重點元件 •etcd Key-Value 資料庫，有一致性與高可用
的特色 •CoreDNS •Network CNI •Container Runtime (CRI) https://github.com/coredns https://github.com/etcd-io/etcd

https://www.cncf.io/

光提到 K8s 主元件就有這些了

若是週邊系統就更多了 And More…

https://blog.jks.co ff ee/on-premise-self-host-kubernetes-k8s-setup-redhat https://blog.jks.co ff ee/on-premise-self-host-kubernetes-k8s-setup-ubuntu

大致步驟 •<每台都做> 關掉 Swap •<每台都做> 安裝 Docker •<每台都做> 安裝
kubelet、kubeadm、kubectl •<每台都做> 安裝 cri-dockerd •<每台都做> 設定 /etc/hosts •設定 Control plane node •設定 Worker node •<Control plane 做> 安裝 Helm 套件管理程式 •<Control plane 做> 安裝 Flannel CNI •<Control plane 做> 測試檢查叢集

GKE vs 地端 • 一個在雲端一個在地端（廢話） •按時收費（彈性收費） vs
一台實體機 N 百萬 •機房的電力、冷氣、門禁、消防設施、合規性...

https://www.ithome.com.tw/tech/87704

https://www.kubecost.com/kubernetes-autoscaling/kubernetes-hpa/

? Worker node Worker node 當服務滿載需要拓展的時候...

💰 💰 💰 💰 💰 💰💰💰💰💰💰 💰 💰 💰💰💰💰💰 💰
💰 ? 實體機實體機如果是實體機？

不用啦！用 Google Cloud 就好 ☺

Google Kubernetes Engine (GKE) 架構

https://cloud.google.com/kubernetes-engine/docs/concepts/cluster-architecture

開一台 GKE 吧 •選 Autopilot or Standard 模式
•選地區 •選 kubernetes 版本 https://soco-st.com/21673

GKE 二種模式 •Autopilot 模式 •Standard 模式 GPU 請選這種 https://console.cloud.google.com/kubernetes/add

https://k8s.ithome.com.tw/2024/workshop-page/3348

區域 •距離（回應速度） •跨 AZ (Availability zone) 有容錯性 (Partition tolerance) 但有跨區流量費
•直接反應 💰 https://mapsvg.com/maps/world 台灣節點 (asia-east1) https://cloud.google.com/kubernetes-engine/pricing?hl=zh-tw

K8s 版本 •可用性 v.s. 版本穩定性 •可用最新版前
一版較穩定版

熟手的朋友可以用 Terraform 或 CLi 指令 https://soco-st.com/18158

連接 GKE •打開 Cloud Shell •選專案 •載入金鑰 gcloud
config set project [PROJECT_ID] gcloud container clusters get-credentials [CLUSTER_NAME] \ --region=[COMPUTE_REGION] https://cloud.google.com/kubernetes-engine/docs/how-to/cluster-access-for-kubectl

雲端 GKE

地端 kubeadm + Flannel

https://www.onlogic.com/blog/what-is-a-gpu-a-beginners-guide/ 關於 GPU

想玩地端 LLM ?

首先，你要有張 NVIDIA 的卡（誤）

https://mises.org/mises-daily/understanding-price-money

GPU 相關 •NVIDIA driver •NVIDIA CUDA •GPU Operator

GPU Operator 重點元件 •Device Plugin •GPU Feature Discovery (GFD) •DCGM
•DCGM Exporter • …

https://info.nvidia.com/how-to-use-gpus-on-kubernetes-webinar.html

GPU K8s 大致步驟 •裝 NVIDIA driver（.run的版本） •裝 NVIDIA Cuda
•裝 NVIDIA Container Toolkit •下指令 patch con fi g 綁定 Containerd •裝 Kubernetes •裝 GPU Operator

https://realfood.tesco.com/recipes/rainbow-cake.html GPU 怎麼切？

https://aws.amazon.com/tw/blogs/containers/gpu-sharing-on-amazon-eks-with-nvidia-time-slicing-and-accelerated-ec2-instances/ https://developer.nvidia.com/blog/improving-gpu-utilization-in-kubernetes/

GPU 切割方式看起來有五種，但其實只有二種 https://soco-st.com/18158

Time slicing •分時多工的原理 •vRAM 不限制 MPS •Multi-Thread 方
式分配 •vRAM 每份固定大 (Multi-Process Service)

MIG •硬體層面切割 GPU •指定型號才有（例如：A100, H100） •Blackwell 或
Hopper™ 系列 vGPU •NVIDIA 支援 GPU 虛擬化 •要軟體授權 (Multi-Instance GPU) (virtual GPU) https://www.nvidia.com/en-us/technologies/multi-instance-gpu/

https://www.nvidia.com/zh-tw/data-center/resources/vgpu-evaluation/

GPU Mode 比較 •Time slicing: Memory 不限制，Process 間會排擠 •MPS:
軟體性均分 •MIG: 硬體層級分割 https://cloud.google.com/kubernetes-engine/docs/concepts/timesharing-gpus

https://www.youtube.com/watch?v=Q2GuTUO170w

加入 GPU 進集群 •選卡片 •共用 GPU •注意預算
💰

加入 GPU 進集群 https://cloud.google.com/kubernetes-engine/docs/how-to/gpus#gcloud

台灣節點有嗎？有！ NAME: nvidia-l4 ZONE: asia-east1-a DESCRIPTION: NVIDIA L4 NAME: nvidia-l4-vws
ZONE: asia-east1-a DESCRIPTION: NVIDIA L4 Virtual Workstation NAME: nvidia-tesla-p100 ZONE: asia-east1-a DESCRIPTION: NVIDIA Tesla P100 NAME: nvidia-tesla-p100-vws ZONE: asia-east1-a DESCRIPTION: NVIDIA Tesla P100 Virtual Workstation NAME: nvidia-tesla-t4 ZONE: asia-east1-a DESCRIPTION: NVIDIA T4 NAME: nvidia-tesla-t4-vws ZONE: asia-east1-a DESCRIPTION: NVIDIA Tesla T4 Virtual Workstation NAME: nvidia-l4 ZONE: asia-east1-b DESCRIPTION: NVIDIA L4 NAME: nvidia-l4-vws ZONE: asia-east1-b DESCRIPTION: NVIDIA L4 Virtual Workstation NAME: nvidia-l4 ZONE: asia-east1-c DESCRIPTION: NVIDIA L4 NAME: nvidia-l4-vws ZONE: asia-east1-c DESCRIPTION: NVIDIA L4 Virtual Workstation NAME: nvidia-tesla-p100 ZONE: asia-east1-c DESCRIPTION: NVIDIA Tesla P100 NAME: nvidia-tesla-p100-vws ZONE: asia-east1-c DESCRIPTION: NVIDIA Tesla P100 Virtual Workstation NAME: nvidia-tesla-t4 ZONE: asia-east1-c DESCRIPTION: NVIDIA T4 NAME: nvidia-tesla-t4-vws ZONE: asia-east1-c DESCRIPTION: NVIDIA Tesla T4 Virtual Workstation NAME: nvidia-tesla-v100 ZONE: asia-east1-c DESCRIPTION: NVIDIA V100 台灣節點 (asia-east1) 列出 Google cloud 上有的 GPU/TPU gcloud compute accelerator-types list

把玩開源 LLM •Gemma 採用與建立 Gemini 模型時相同的研究成果和技術，開源
LLM 模型 •Ollama https://ollama.com/ •Open webui https://openwebui.com/

https://medium.com/@dilipkashyap15/googles-new-ai-model-gemini-now-available-in-bard-here-is-how-to-use-259386d6bd68

https://ai.google.dev/gemma?hl=zh-tw#gemma-2

Takeways • 一個網站服務的基本元件 •Kubernetes 內部的元件 •GPU 的服用方
式

Q & A https://www.sherpany.com/en/resources/digital-transformation/cloud-computing/cloud-computing-de fi nition/

如果你想要用 AI •Vertex AI 串接 API https://cloud.google.com/vertex-ai •Cloud Run
跑單次服務 https://cloud.google.com/run •GKE 適用長期部署架構 https://cloud.google.com/kubernetes-engine https://theaiagency.co.nz/

Kubernetes 地端自建 v.s. GKE，哪個更適合你？ @Devfest Taipe...

Kubernetes 地端自建 v.s. GKE，哪個更適合你？ @Devfest Taipei 2024

More Decks by Johnny Sung

Other Decks in Technology

Featured

Transcript