Pod Autoscaling

Autoscaling is a widely accepted practice in cloud computing which entails automatically adjusting the amount of computational resources based on load.

Autoscaling automates resource management and as such has a significant impact on cluster costs.

Well configured autoscaling can make your cluster lean, efficient and reliable.

Badly configured autoscaling can generate waste and reduce availability.

Kubernetes comes with add-ons that allow us to autoscale pods in 2 distinct ways:

Horizontally

By changing the number of pod replicas in a workload (Deployment, StatefulSet)
Vertically

By changing the amount of resources (CPU, memory) allocated to each individual pod in a workload.

Horizontal Pod Autoscaling

Horizontal pod autoscaling is enabled by the following add-ons:

(Follow the links for autoscaling optimization recommendations)

HPA
KEDA

Vertical Pod Autoscaling

Horizontal pod autosscaling is enabled by the following add-ons:

(Follow the links for autoscaling optimization recommendations)

Brought to you by PerfectScale