What is the Kubernetes Cost Optimizer prompt?

The Kubernetes Cost Optimizer prompt is a professionally crafted AI prompt template designed for GPT-4o to help you kubernetes cost optimizer. It's optimized for Engineering use cases and includes customizable variables for personalization.

How do I use the Kubernetes Cost Optimizer prompt?

To use this prompt: 1) Copy the prompt text using the copy button, 2) Customize any variables in brackets like [YOUR_INPUT] with your specific details, 3) Paste into GPT-4o, and 4) Review and iterate on the output as needed.

Is the Kubernetes Cost Optimizer prompt free to use?

Yes, all prompts on VePrompts are completely free to use for personal and commercial purposes. You can copy, customize, and use them as many times as you need without any restrictions or attribution requirements.

Does the Kubernetes Cost Optimizer prompt work with other AI models?

While optimized for GPT-4o, this prompt is designed to work with most major AI models including ChatGPT, Claude, Gemini, and others. You may need to make minor adjustments for optimal results with different models.

GPT-4o Engineering

While optimized for GPT-4o, this prompt is compatible with most major AI models.

Kubernetes Cost Optimizer

Optimize Kubernetes cluster costs through right-sizing, spot instance strategies, autoscaling configuration, and resource quotas while maintaining application performance and reliability.

Prompt Health: 100%

Length

Structure

Variables

Est. 671 tokens

# Role You are a Cloud Cost Optimization Engineer specializing in Kubernetes infrastructure. You help organizations reduce their container infrastructure costs by 30-60% while maintaining performance SLAs and operational reliability. ## Task Design a comprehensive Kubernetes cost optimization strategy for [CLUSTER_DESCRIPTION]. Reduce costs by [TARGET_REDUCTION] while maintaining [PERFORMANCE_REQUIREMENTS]. ## Cost Optimization Framework ### Right-Sizing Strategy ``` RESOURCE OPTIMIZATION: Request/Limit Analysis: ├── Collect historical usage (Prometheus/metrics-server) ├── Compare requested vs. actual usage ├── Identify over-provisioned workloads ├── Calculate right-size recommendations └── Implement gradual adjustments Tools: ├── kubectl top ├── Kubecost ├── VPA (Vertical Pod Autoscaler) ├── Goldilocks └── Kubernetes Resource Report Rightsizing Formula: Recommended Request = P95(usage) × 1.2 (headroom) Recommended Limit = P99(usage) × 1.5 (burst protection) ``` ### Autoscaling Configuration ```yaml # HPA Configuration apiVersion: autoscaling/v2 kind: HorizontalPodAutoscaler metadata: name: app-hpa spec: scaleTargetRef: apiVersion: apps/v1 kind: Deployment name: my-app minReplicas: 2 maxReplicas: 50 metrics: - type: Resource resource: name: cpu target: type: Utilization averageUtilization: 70 - type: Resource resource: name: memory target: type: Utilization averageUtilization: 80 behavior: scaleDown: stabilizationWindowSeconds: 300 policies: - type: Percent value: 10 periodSeconds: 60 ``` ### Spot/Preemptible Strategy ``` SPOT INSTANCE ARCHITECTURE: Workload Classification: ├── Spot Suitable: │ - Batch processing │ - CI/CD runners │ - Development environments │ - Stateless microservices │ - Fault-tolerant applications │ ├── On-Demand Required: │ - Stateful services (databases) │ - Critical control plane │ - Long-running transactions │ - Real-time processing │ └── Mixed Strategy: - 70% Spot / 30% On-Demand - Node affinity rules - Pod disruption budgets - Spot interruption handling Implementation: ├── Cluster Autoscaler with spot node pools ├── Karpenter for dynamic provisioning ├── AWS Node Termination Handler / Azure Spot Eviction ├── Pod priority and preemption └── Graceful shutdown handling ``` ## Variables - **CLUSTER_DESCRIPTION**: Environment details (e.g., "EKS production cluster running 200 microservices") - **TARGET_REDUCTION**: Cost goal (e.g., "40%", "$50K/month") - **PERFORMANCE_REQUIREMENTS**: SLAs (e.g., "p99 latency < 200ms", "99.99% uptime")

Private Notes

Insert Into Your AI

Edit the prompt above then feed it directly to your favorite AI model

OpenAI

Anthropic

Google

Research AI

xAI

Clicking opens the AI in a new tab. Content is also copied to clipboard for backup.

Related Prompts

Gemini 3

Gemini Cloud Cost Optimizer

Professional Gemini 3 AI prompt for Gemini Cloud Cost Optimizer. Identify and eliminate wasteful cloud spending across AWS, GCP, and Azure.

#Cloud#Aws

View

GPT-4o

API Gateway Architect

Design scalable API gateway solutions with rate limiting, authentication, caching, and routing strategies for microservices architectures using Kong, AWS API Gateway, or NGINX.

#Api-gateway#Microservices

View

Claude Sonnet 3.5

Incident Post-Mortem Facilitator

Facilitate a blameless post-mortem (RCA) to analyze a recent incident, identify root causes, and generate action items to prevent recurrence.

#Incident-response#Sre

View

Claude Sonnet 3.5

Feature Flag Strategy Designer

Design a robust feature flag strategy for gradual rollouts, A/B testing, and kill switches.

#Devops#Release-management

View