Exploring best practices for a greener Kubernetes [Cloud Native Bergen]

A Greener Kubernetes Exploring Best Practices

3Rs: Reduce, Reuse, Recycle Image: RawPixel

Refuse Rethink Reduce Reuse Repair Updated: 9Rs Refurbish Remanufacture Repurpose
Recycle Recover

From waste management to workload orchestration Brought to you by
the CNCF’s Technical Advisory Group ENV project Sustainable Kubernetes.

Refuse Reduce Repair The 5Rs of Kubernetes Resize Reschedule Repeat

Run as few Nodes as possible

Read the Energy Proportionality Whitepaper from Google Utilize each Node
as much as possible

https://cast.ai/kubernetes-cost-benchmark Each abstraction layer comes with waste

Refuse: you can say no to Pods

Refuse: you can say no to Pods Use admission controllers
to enforce good practices. Use-case #1: you offer node pools tailored to specific workloads. Use-case #2: You want to enforce better utilization from the start.

Refuse: you can say no to Pods Benefits? - Potentially
better node/resource utilization. Drawbacks? - Setting requests & limits isn’t a one-time action. - Will only fit particular use-cases. Use admission controllers to enforce good practices.

Reduce: run what you truly need

Reduce: run what you truly need Turn off unneeded workloads,
permanently or temporarily. The most environmentally- -friendly code is the code we choose not to write. Identify unnecessary work. Prevent zombie drift. Employ the scream test.

Reduce: run what you truly need Turn off unneeded workloads,
permanently or temporarily. Few services need to be always-on. - Employ kube-green (namespaces on a schedule). - Leverage KEDA (event-driven autoscaler).

Repair: broken workloads are a waste

Repair: broken workloads are a waste Workloads can be visibly
or invisibly broken, and they waste resources. A Pod that doesn’t heal will reserve resources on a Node each time it’s restarted and scheduled. Mutating webhooks can scale down Crash-looping workloads down to 0.

Repair: broken workloads are a waste Workloads can be visibly
or invisibly broken, and they waste resources. Invisible for Kubernetes, visible for well-defined alerts that recognize apps running idle but using a lot of resources.

Resize: adjust capacity as needs change

Resize: adjust capacity as needs change Rightsizing is at the
heart of sustainable system architecture. Reasons for cloud waste: - Lack of visibility, - Overprovisioning (wrong request/limit settings), - Leaving cloud resources idle, - Low usage of spot machines.

Screenshot from Kubecost recommending instance types: https://www.kubecost.com

Resize: adjust capacity as needs change Optimize by rightsizing. Ensure
visibility. Pick newer machine types. Employ advanced autoscaling. Periodically review resource requests/limits. Use leftover capacity by spot nodes (temporary solution).

Reserving capacity shows cost can be a bad metric for
sustainability*.

Reschedule: even out usage spikes

Reschedule: even out usage spikes Shift demand in time or
in space. Distribute your workloads around the clock based on resource usage (easier) or grid carbon intensity (harder). - move around job schedules - Karpenter, WattTime, …

Reschedule: even out usage spikes Shift demand in time or
in space. Spacial shifting might be the single most impactful decision you make, but has challenges: - latency, - feature availability.

Screenshot from Cloud Carbon Footprint: https://www.cloudcarbonfootprint.org

Repeat: instead of running non-stop

Repeat: instead of running non-stop Identify workloads that can run
on a schedule. 200MB of RAM 24/24h (Deployment) vs 2/24h (CronJob) Good use-case: - Exports, batch jobs, database backups. Bad use-case: - API server.

Repeat: instead of running non-stop Identify workloads that can run
on a schedule. Benefits? - Cut down on idling workloads. Drawbacks? - Potentially time-intensive changes to code & architecture.

Refuse Reduce Repair Once again: the 5Rs of Kubernetes Resize
Reschedule Repeat

without having a lot of time. Centering environmental sustainability in
our everyday work

Full talk at the GSF Oslo Github Kristina Devochko Platform
Engineer GSF Oslo group founder https://github.com/gsf-oslo

CREDITS: This presentation template was created by Slidesgo , including
icons by Flaticon , infographics & images by Freepik Thank you! Marta Paciorkowska Platform Engineer @ Oda [email protected] Mastodon: @[email protected]

Exploring best practices for a greener Kubernet...

Exploring best practices for a greener Kubernetes [Cloud Native Bergen]

Marta Paciorkowska

More Decks by Marta Paciorkowska

Other Decks in Technology

Featured

Transcript

A Greener Kubernetes Exploring Best Practices

3Rs: Reduce, Reuse, Recycle Image: RawPixel

Refuse Rethink Reduce Reuse Repair Updated: 9Rs Refurbish Remanufacture Repurpose

From waste management to workload orchestration Brought to you by

Refuse Reduce Repair The 5Rs of Kubernetes Resize Reschedule Repeat

Run as few Nodes as possible

Read the Energy Proportionality Whitepaper from Google Utilize each Node

https://cast.ai/kubernetes-cost-benchmark Each abstraction layer comes with waste

Refuse: you can say no to Pods

Refuse: you can say no to Pods Use admission controllers

Refuse: you can say no to Pods Benefits? - Potentially

Reduce: run what you truly need

Reduce: run what you truly need Turn off unneeded workloads,

Reduce: run what you truly need Turn off unneeded workloads,

Repair: broken workloads are a waste

Repair: broken workloads are a waste Workloads can be visibly

Repair: broken workloads are a waste Workloads can be visibly

Resize: adjust capacity as needs change

Resize: adjust capacity as needs change Rightsizing is at the

Screenshot from Kubecost recommending instance types: https://www.kubecost.com

Resize: adjust capacity as needs change Optimize by rightsizing. Ensure

Reserving capacity shows cost can be a bad metric for

Reschedule: even out usage spikes

Reschedule: even out usage spikes Shift demand in time or

Reschedule: even out usage spikes Shift demand in time or

Screenshot from Cloud Carbon Footprint: https://www.cloudcarbonfootprint.org

Repeat: instead of running non-stop

Repeat: instead of running non-stop Identify workloads that can run

Repeat: instead of running non-stop Identify workloads that can run

Refuse Reduce Repair Once again: the 5Rs of Kubernetes Resize

without having a lot of time. Centering environmental sustainability in

Full talk at the GSF Oslo Github Kristina Devochko Platform

CREDITS: This presentation template was created by Slidesgo , including