#kubernetes

5 posts

Kubernetes Resource Requests And Limits: The Numbers That Decide If Your Cluster Is Stable
Most teams set CPU and memory requests by guessing. The result is over-provisioning that wastes money or under-provisioning that causes evictions. Here is the practical method for picking each number, the difference between requests and limits, and why CPU limits are often a mistake.

October 11, 2024
kubernetes devops performance
Service Mesh: When Istio Or Linkerd Earns Its Operational Cost, And When Not
Service mesh promises automatic mTLS, traffic shifting, and observability. The operational cost is real — Istio doubles a cluster's control-plane complexity. Here is the honest framework for whether your team needs a mesh, the lighter alternatives, and the migration that doesn't break production.

April 12, 2024
kubernetes devops distributed-systems
Pod Disruption Budgets: The K8s Object That Keeps Your Service Up During Cluster Maintenance
You set up rolling deploys carefully. Then a node drains during cluster upgrade and takes 80% of your pods at once. PodDisruptionBudget is the manifest that says “never evict more than N at a time.” Three lines of YAML, real production benefits.

January 5, 2024
kubernetes devops reliability
Kubernetes Autoscaling Beyond CPU: The Custom-Metric HPA Pattern That Actually Works
Default HPA scales on CPU, which is wrong for most modern workloads. Memory, queue depth, request rate, and custom business metrics are what actually correlate with “need more pods.” Here is the working setup with custom metrics, the formula HPA uses, and the four mistakes that cause flapping.

September 15, 2023
kubernetes devops reliability
Kubernetes Liveness And Readiness Probes: The Difference That Causes Half Your Outages
Most teams configure liveness and readiness probes identically and wonder why a slow database makes Kubernetes restart their pods in a death spiral. Here is what each probe is actually for, the right endpoint shape for each, and the four-line config that turns an outage into a non-event.

January 20, 2023
kubernetes devops reliability