Installation
- Before you begin
- Install a released version
- Install a custom-configured released version
- Install the latest development version
- Build and install from source
- Install via Helm
- Change the feature gates configuration
- What’s next
Before you begin
Make sure the following conditions are met:
- A Kubernetes cluster with version 1.25 or newer is running. Learn how to install the Kubernetes tools.
- The
SuspendJob
feature gate is enabled. In Kubernetes 1.22 or newer, the feature gate is enabled by default. - (Optional) The
JobMutableNodeSchedulingDirectives
feature gate (available in Kubernetes 1.22 or newer) is enabled. In Kubernetes 1.23 or newer, the feature gate is enabled by default. - The kubectl command-line tool has communication with your cluster.
Kueue publishes metrics to monitor its operators. You can scrape these metrics with Prometheus. Use kube-prometheus if you don’t have your own monitoring system.
The webhook server in kueue uses an internal cert management for provisioning certificates. If you want to use a third-party one, e.g. cert-manager, follow these steps:
- Set
internalCertManagement.enable
tofalse
in config file. - Comment out the
internalcert
folder inconfig/default/kustomization.yaml
. - Enable
cert-manager
inconfig/default/kustomization.yaml
and uncomment all sections with ‘CERTMANAGER’.
Install a released version
To install a released version of Kueue in your cluster, run the following command:
kubectl apply --server-side -f https://github.com/kubernetes-sigs/kueue/releases/download/v0.8.1/manifests.yaml
To wait for Kueue to be fully available, run:
kubectl wait deploy/kueue-controller-manager -nkueue-system --for=condition=available --timeout=5m
Add metrics scraping for prometheus-operator
To allow prometheus-operator to scrape metrics from kueue components, run the following command:
Note
This feature depends on servicemonitor CRD, please ensure that CRD is installed first.
We can follow https://prometheus-operator.dev/docs/prologue/quick-start/
to install it.
kubectl apply --server-side -f https://github.com/kubernetes-sigs/kueue/releases/download/v0.8.1/prometheus.yaml
Add visibility API to monitor pending workloads
To add the visibility API that enables monitoring pending workloads, change the feature gates configuration and set VisibilityOnDemand=true
, and run the following command
kubectl apply --server-side -f https://github.com/kubernetes-sigs/kueue/releases/download/v0.8.1/visibility-api.yaml
See the visibility API for more details.
Uninstall
To uninstall a released version of Kueue from your cluster, run the following command:
kubectl delete -f https://github.com/kubernetes-sigs/kueue/releases/download/v0.8.1/manifests.yaml
Install a custom-configured released version
To install a custom-configured released version of Kueue in your cluster, execute the following steps:
- Download the release’s
manifests.yaml
file:
wget https://github.com/kubernetes-sigs/kueue/releases/download/v0.8.1/manifests.yaml
- With an editor of your preference, open
manifests.yaml
. - In the
kueue-manager-config
ConfigMap manifest, edit thecontroller_manager_config.yaml
data entry. The entry represents the default KueueConfiguration. The contents of the ConfigMap are similar to the following:
apiVersion: v1
kind: ConfigMap
metadata:
name: kueue-manager-config
namespace: kueue-system
data:
controller_manager_config.yaml: |
apiVersion: config.kueue.x-k8s.io/v1beta1
kind: Configuration
namespace: kueue-system
health:
healthProbeBindAddress: :8081
metrics:
bindAddress: :8080
# enableClusterQueueResources: true
webhook:
port: 9443
manageJobsWithoutQueueName: true
internalCertManagement:
enable: true
webhookServiceName: kueue-webhook-service
webhookSecretName: kueue-webhook-server-cert
waitForPodsReady:
enable: true
timeout: 10m
integrations:
frameworks:
- "batch/job"
The integrations.externalFrameworks
field is available in Kueue v0.7.0 and later.
Note
Certain Kubernetes distributions might use batch/jobs to perform maintenance operations. For these distributions, settingmanageJobsWithoutQueueName
to true
without disabling the
batch/job
integration may prevent system-created jobs from executing.- Apply the customized manifests to the cluster:
kubectl apply --server-side -f manifests.yaml
Install the latest development version
To install the latest development version of Kueue in your cluster, run the following command:
kubectl apply --server-side -k "github.com/kubernetes-sigs/kueue/config/default?ref=main"
The controller runs in the kueue-system
namespace.
Uninstall
To uninstall Kueue, run the following command:
kubectl delete -k "github.com/kubernetes-sigs/kueue/config/default?ref=main"
Build and install from source
To build Kueue from source and install Kueue in your cluster, run the following commands:
git clone https://github.com/kubernetes-sigs/kueue.git
cd kueue
IMAGE_REGISTRY=registry.example.com/my-user make image-local-push deploy
Add metrics scraping for prometheus-operator
To allow prometheus-operator to scrape metrics from kueue components, run the following command:
make prometheus
Uninstall
To uninstall Kueue, run the following command:
make undeploy
Install via Helm
To install and configure Kueue with Helm, follow the instructions.
Change the feature gates configuration
Kueue uses a similar mechanism to configure features as described in Kubernetes Feature Gates.
In order to change the default of a feature, you need to edit the kueue-controller-manager
deployment within the kueue installation namespace and change the manager
container arguments to include
--feature-gates=...,<FeatureName>=<true|false>
For example, to enable PartialAdmission
, you should change the manager deployment as follows:
kind: Deployment
...
spec:
...
template:
...
spec:
containers:
- name: manager
args:
- --config=/controller_manager_config.yaml
- --zap-log-level=2
+ - --feature-gates=PartialAdmission=true
The currently supported features are:
Feature | Default | Stage | Since | Until |
---|---|---|---|---|
FlavorFungibility | true | Beta | 0.5 | |
MultiKueue | false | Alpha | 0.6 | |
MultiKueueBatchJobWithManagedBy | false | Alpha | 0.8 | |
PartialAdmission | false | Alpha | 0.4 | 0.4 |
PartialAdmission | true | Beta | 0.5 | |
ProvisioningACC | false | Alpha | 0.5 | 0.6 |
ProvisioningACC | true | Beta | 0.7 | |
QueueVisibility | false | Alpha | 0.5 | |
VisibilityOnDemand | false | Alpha | 0.6 | |
PrioritySortingWithinCohort | true | Beta | 0.6 | |
LendingLimit | false | Alpha | 0.6 | |
MultiplePreemptions | false | Alpha | 0.8 |
What’s next
- Read the API reference for
Configuration
Feedback
Was this page helpful?
Glad to hear it! Please tell us how we can improve.
Sorry to hear that. Please tell us how we can improve.