Integrating Vpa with Hpa and Karpenter in Production

Adding VPA to HPA and Karpenter requires careful policy configuration, resource allocation adjustments.

May 7, 2026 JR

3 minute read

Adding VPA to HPA and Karpenter requires careful policy configuration, resource allocation adjustments, and monitoring to avoid conflicts and ensure stability.

Problem Context

Running HPA for application scaling and Karpenter for node provisioning introduces dynamic resource management. Adding Vertical Pod Autoscaler (VPA) automates CPU/memory requests but risks over-provisioning, scheduling conflicts, or eviction storms if misconfigured.

Actionable Workflow

Audit Existing HPA/Karpenter Behavior

Check current HPA metrics and scaling thresholds:

kubectl get hpa --all-namespaces  
kubectl describe hpa <hpa-name>

Review Karpenter node templates and scaling policies:

kubectl get karpenterconfigs  
kubectl describe karpenterconfig default

Create VPA Policy with Constraints

Use a non-default update mode (e.g., Auto for stateless, Off for stateful) to control pod restarts.

Example policy (YAML):

apiVersion: autoscaling.k8s.io/v1  
kind: VerticalPodAutoscaler  
metadata:  
  name: my-app-vpa  
spec:  
  target:  
    name: my-app  
  updatePolicy:  
    auto: {}  
  constraints:  
    min:  
      cpu: 100m  
      memory: 256Mi  
    max:  
      cpu: 2  
      memory: 4Gi

Adjust Karpenter Node Selectors

Ensure Karpenter node templates match VPA’s recommended resource ranges to avoid scheduling failures.

Example Karpenter node template adjustment:

spec:  
  templates:  
  - name: vpa-aware  
    nodeSelector:  
      karpenter.kubernetes.io/managed: "true"  
    taints:  
    - key: vpa.toleration  
      value: "true"  
      effect: NoSchedule

Monitor and Validate

Watch for VPA recommendations and HPA interactions:

kubectl get vpa --watch  
kubectl logs -f <vpa-controller-pod>

Check for evictions or scheduling issues:

kubectl get events --sort-by=.metadata.creationTimestamp  
kubectl describe pod <problematic-pod>

Tooling

kubectl: For direct inspection of VPA/HPA/Karpenter states.
Karpenter Dashboard: Visualize node provisioning decisions.
Prometheus/Grafana: Alert on sudden resource spikes or evictions.
VPA Controller Logs: Debug recommendation cycles.

Tradeoffs and Caveats

Eviction Risk: VPA’s Auto mode may trigger unnecessary pod restarts during traffic spikes, conflicting with HPA’s horizontal scaling.
Resource Overhead: VPA’s recommendations may exceed Karpenter’s node capacity, leading to Evictable node states.
Delay in Adaptation: VPA’s recommendation cycle (default 15s) can lag behind sudden load changes, requiring HPA to handle short-term spikes.

Troubleshooting Common Issues

Symptom	Cause	Fix
Pods stuck in `Terminating`	VPA update conflicts with node draining	Increase Karpenter `drainTimeout` (e.g., `5m`)
VPA not updating recommendations	Controller not deployed or RBAC misconfigured	Verify VPA controller deployment and permissions: `kubectl get pods -n kube-system \| grep vpa`
Karpenter ignoring VPA constraints	Node template resource limits too tight	Align node template `min`/`max` with VPA recommendations
HPA scaling down during VPA updates	VPA reduces requests, triggering HPA to scale	Use HPA’s `scaleDownBehavior` to delay scaling: `kubectl edit hpa <name>`

Final Notes

Integrating VPA with HPA and Karpenter works best when:

Workloads are stateless or have graceful restart mechanisms.
Resource constraints are explicitly set to prevent over-provisioning.
Monitoring covers all three controllers to detect conflicts early.

Test in staging with synthetic load before enabling in production.

Source thread: Running HPA + Karpenter. How can I add VPA to the mix?

blog

Home

About

Blog

Projects

Posts

Categories

Contact

Recent Posts

Managing Third-party Kubernetes Tool Upgrades in Production

Patch Copy.fail in Production: Diagnosis and Mitigation Steps

Cis Vs Stig in Container Security: Tradeoffs and Practical Implementation

Building a 2026 Google Sre/platform Roadmap: from Foundations to Production

Securing Untrusted Pods in Kubernetes with Runtime Isolation