As part of our client's migration to a Kubernetes (K8S) cluster, the client had reached its compute allocation limit with over 6,000 cores consumed. This created a bottleneck: there wasn’t enough capacity to migrate an additional 150 target pipelines, and job failures increased due to resource constraints.
We performed in-depth analysis of reserved CPU limits versus actual CPU requests across the cluster. Based on those insights, we implemented targeted optimizations to align resource reservations more closely with actual usage.