Member-only story

Horizontal Pod Autoscaling with Kubernetes using external metric (cluster worker nodes count)

6 min readNov 18, 2024

In the world of Kubernetes or Microservices , You might have heard of 2 types of scaling —

1 — Vertical (adding more power to existing nodes — i.e more cpu and memory)

2 — Horizontal (adding more instances of resource (worker node) to handle the demand)

In this article , we would be looking at an example of Horizontal scaling where Kubernetes HPA is leveraged as it helps us autoscale a kubernetes deployment based on some underlying metric/information. So in this case the resource which will be auto-scaled is a kubernetes pod — it’s replicas will increase or decrease with demand.

Requirement — Consider a case that we have a deployment running nginx container and we need to scale them based on number of nodes in the system , meaning — if worker nodes == 2 , nginx replicas should scale to 2 ; If worker nodes == 3 , nginx replicas should scale to 3 and so on.

A typical HPA definition looks like ->

apiVersion: autoscaling/v2
kind: HorizontalPodAutoscaler
metadata:
  name: nginx-hpa
  namespace: kube-system
spec:
  scaleTargetRef:
    apiVersion: apps/v1
    kind: Deployment
    name: nginx
  minReplicas: 2
  maxReplicas: 3
  metrics:
    - type: External
      external:
        metric:
          name: node_scale_indicator
        target:
          type: Value
          value: 2

where -
scaleTargetRef — target deployment which needs to be autoscaled (nginx)

min and maxReplicas — the min and max limit of replicas hpa will manage to scale

metrics — type — the type of metric hpa bases the autoscaling decision on. Here we have used a metric from kube-state-metrics which fits for an external metric.

metric — name → name of the external metrics
target — value -> if the result of metric is higher or lower than target , the autoscaling-up and autoscaling-down is performed respectively.

How HPA works / Requirements for HPA to work

1 → If we want to use HPA based on resource (CPU and Memory) metrics -

Horizontal Pod Autoscaling with Kubernetes using external metric (cluster worker nodes count)

How HPA works / Requirements for HPA to work

Create an account to read the full story.

Written by Sairav Dev

No responses yet

More from Sairav Dev

Building a simple Crypto Trading Bot using Python Flask , TradingView and CoinDCX APIs

The functionality of our trading app can be summarized as follows :-

Using Spring cloud gateway for microservices app

An API Gateway acts as a reverse proxy sitting between client and micro services, routing requests from client side to various…

PineScript : Write your own stock trading strategy / Indicator on TradingView

Being a trader working with Technical Analysis , we all look at multiple indictors like RSI , MACD , Moving Averages , Bollinger Bands ……

Writing a shell script to automate running multiple services/tasks in different terminals in macOS

What is a shell ?

Recommended from Medium

Vault + Kubernetes Auth: The Certificate Management Solution I Wish I’d Known Earlier

The Problem I Faced

How we handled pod kills (Crash Loop Backoff) due to memory spikes while running heavy scripts in…

🤔 Problem Statement

Advanced Kubernetes Tutorial Every DevOps Engineer Has Been Searching For — Part 1

Deploying a production-ready 3-tier (React frontend + Flask backend + Postgres)application on EKS with real-world setup

This new IDE from Google is an absolute game changer

This new IDE from Google is seriously revolutionary.

How I Scaled a Go Backend to Handle 1 Million Requests per Second

From 100 Requests to 1 Million: My Journey in Scaling a Go Backend

GitOps & Argo CD in 2025: Simplifying Kubernetes Deployments with Automation

Right now, we have GitLab in place but haven’t fully embraced GitOps. In other words, while Git helps us track code versions, we don’t yet…