Komodor is a Kubernetes management platform that empowers everyone from Platform engineers to Developers to stop firefighting, simplify operations and proactively improve the health of their workloads and infrastructure.
Proactively detect & remediate issues in your clusters & workloads.
Automatically analyze and reconcile drift across your fleet.
Easily operate & manage K8s clusters at scale.
Reduce costs without compromising on performance.
Meet Klaudia, Your AI-powered SRE Agent
Empower developers with self-service K8s troubleshooting.
Simplify and accelerate K8s migration for everyone.
Fix things fast with AI-powered root cause analysis.
Automate and optimize AI/ML workloads on K8s
Easily manage Kubernetes Edge clusters
Smooth Operations of Large Scale K8s Fleets
Bring key K8s insights into your IDP
Guides, blogs, webinars & tools to help you troubleshoot and scale Kubernetes.
Tips, trends, and lessons from the field.
Practical guides for real-world K8s ops.
How it works, how to run it, and how not to break it.
Short, clear articles on Kubernetes concepts, best practices, and troubleshooting.
Infra stories from teams like yours, brief, honest, and right to the point.
Product-focused clips showing Komodor in action, from drift detection to add‑on support.
Live demos, real use cases, and expert Q&A, all up-to-date.
The missing UI for Helm – a simplified way of working with Helm.
Visualize Crossplane resources and speed up troubleshooting.
Validate, clean & secure your K8s YAMLs.
Navigate the community-driven K8s ecosystem map.
Who we are, and our promise for the future of K8s.
Have a question for us? Write us.
Come aboard the K8s ship – we’re hiring!
Here’s what they’re saying about Komodor in the news.
Kubernetes v1.34, codenamed “Of Wind & Will (O’ WaW)”, brings a wide range of enhancements aimed at making clusters more efficient, secure, and easier to manage.
This release delivers 58 enhancements with 23 graduating to Stable, 22 entering Beta, and 13 in Alpha, reflecting the platform’s continued maturation as enterprises scale their container orchestration needs. As anticipated, this release focuses heavily on hardening existing capabilities rather than introducing radical new features—a clear signal of Kubernetes’ evolution toward enterprise-grade reliability.
The standout theme is resource optimization for AI/ML workloads, particularly around GPU management. Dynamic Resource Allocation (DRA) graduating to General Availability represents a game-changer for organizations struggling with GPU scarcity and utilization. As GPU resources become increasingly scarce and expensive across all cloud providers, DRA’s ability to enable GPU sharing, partial allocation, and intelligent scheduling addresses one of today’s most critical infrastructure bottlenecks.
Behind the scenes, the “boring” performance improvements are equally critical—streaming API responses, watch cache optimizations, and scheduler throughput enhancements that keep massive enterprise clusters running smoothly when it matters most. Notably, this version introduces no breaking changes, greatly reducing upgrade risks for production environments.
Here’s a breakdown of the most exciting new features for cluster operators. You can also check out a breakdown of the previous Kubernetes release (v1.33) by our own Udi Hofesh to compare and see the incredible velocity and momentum of the world’s largest open-source project after Linux.
Dynamic Resource Allocation (GA).The Dynamic Resource Allocation (DRA) framework has graduated to General Availability. This lets workloads request hardware devices such as GPUs or FPGAs through Kubernetes APIs in a way that mirrors how Persistent Volumes are provisioned. The resource.k8s.io/v1 APIs (ResourceClaim, DeviceClass, ResourceClaimTemplate, ResourceSlice) are now stable, giving operators a consistent way to expose and manage device resources across clusters.
Linux Swap Support (GA).Kubernetes now officially supports swap on Linux nodes. In the LimitedSwap mode, Pods can use swap within their memory requests, preventing abrupt evictions when a node runs out of RAM. By default, swap remains disabled (NoSwap), but operators can enable it to improve stability for workloads with infrequently accessed memory.
Pod-Level Resource Requests and Limits (Beta).Instead of only defining CPU/memory requests per container, you can now set them at the Pod level. This allows for a “shared budget” across all containers, reducing over-provisioning and improving scheduling accuracy. The Horizontal Pod Autoscaler has also been updated to recognize these new Pod-scoped requests and limits.
DRA Extensions (Beta).Several DRA features have entered Beta:
Storage Improvements.Two storage features are now GA:
Short-Lived ServiceAccount Tokens for Image Pulls (Beta).Pods can now pull images using short-lived, projected ServiceAccount tokens instead of long-lived node credentials. This reduces the attack surface and ensures tokens are rotated automatically per workload.
External ServiceAccount Token Signing (Beta).The API server can delegate ServiceAccount token signing to an external key management system via a gRPC interface. This improves compliance and security by centralizing signing in HSMs or cloud KMS services.
Authorization by Object Selector (GA).You can now enforce authorization policies that only apply if a specific label or field selector is provided. For example, operators can permit listing Pods only when requests are scoped to certain labels.
Restrict Anonymous Requests (GA).Instead of the old all-or-nothing model, you can now explicitly define which API endpoints are allowed anonymously (such as /healthz). This tightens cluster security while still supporting basic unauthenticated probes.
Mutating Admission Policies (Beta).Operators no longer need to run custom webhooks for request mutation. With this new feature, policies can be written declaratively using CEL and applied directly in the API server, simplifying admission control and reducing operational overhead.
Pod Security Updates.
Job Pod Replacement (GA).A new .spec.podReplacementPolicy allows Jobs to create replacement Pods only after the previous one has fully terminated, avoiding resource contention and unintended parallel execution.
Namespace Deletion Ordering (GA).Resources within a namespace are now deleted in a deterministic order. Pods are terminated before their associated policies (like NetworkPolicies) are removed, preventing security gaps and addressing issues like CVE-2024-7598.
Scheduler Improvements.
API Server Performance.
KYAML (Alpha).Kubernetes now supports KYAML, a restricted YAML dialect that eliminates many common YAML pitfalls (e.g., ambiguous parsing). You can try it by setting KUBECTL_KYAML=true and using kubectl get -o kyaml.
Kubectl Preferences (.kuberc, Beta).Users can now configure CLI preferences in a .kuberc file (separate from kubeconfig). This allows operators to define default flags, output formats, and aliases.
Per-Container Restart Policies (Alpha).Pods can now specify different restart behaviors per container, including exit code–based rules. This enables finer-grained fault handling, especially for multi-container workloads.
Other Notables.Environment variable names are now less restricted (all printable characters except = are valid), and the Taint Manager has been refactored into its own controller for cleaner reliability.
Kubernetes v1.34 focuses on maturing long-requested features while strengthening the security and reliability of cluster operations. With Dynamic Resource Allocation, swap support, pod-level resource limits, and significant scheduling and API server improvements, operators have new levers to optimize workloads at scale. Meanwhile, the security updates, ranging from short-lived tokens to external key signing, offer stronger guarantees in multi-tenant and regulated environments.For the full set of details, you can review the official release blog and the release notes.
Share:
Gain instant visibility into your clusters and resolve issues faster.