NVIDIA NIM Cluster Template
Overview
The NVIDIA NIM template provisions a Kubernetes cluster for GPU-accelerated AI/LLM inference using NVIDIA NIM microservices. Use this template when you need to deploy and serve inference with prebuilt, optimized containers and MLOps integration.
Accessing the cluster
Bridge provides two ways to work with your cluster after it is created:
-
Download kubeconfig — Download the cluster kubeconfig from the cluster menu. Use it to access the cluster from your machine or external tools (e.g.,
kubectl, IDEs, or CI/CD) by settingKUBECONFIGor merging it into your default kubeconfig. -
Kubectl Terminal — Use the Kubectl Terminal in the Bridge UI to run kubectl commands, manage resources, and operate the cluster without a separate command-line session.
This guide covers:
- Configuring cluster name, version, and CNI
- Selecting the NVIDIA NIM template and cluster nodes
- Monitoring creation until the cluster is Running
- Viewing cluster details, downloading kubeconfig, and using the Kubectl Terminal
Prerequisites
- Tenant Admin access — Log in as a Tenant Admin to create clusters.
- Compute resources — Bare Metal or Virtual Machine resources allocated to your tenant.
- Port-forward on Bridge node — For cluster creation to succeed, port-forward must be running on the Bridge-deployed node (see note below).
Run the port forwarding on the Bridge node so that cluster creation can complete.
kubectl -n amcop-system port-forward --address 0.0.0.0 svc/ingress-ingress-nginx-controller 443:443
- Keep this command running during cluster creation.
- If you do not have access to the Bridge node, contact your Bridge Super Administrator.
Create an NVIDIA NIM Cluster
Step 1: Start Cluster Creation
- Log in to Armada Bridge as a Tenant Admin.
- In the left sidebar, open Compute → Kubernetes.
- Click Create Cluster.

Step 2: Configure Cluster Details
- Select type as Upstream
- Enter a name and description for the cluster.
- Select the Kubernetes version.
- Select the CNI plugin. Bridge supports Flannel and Cilium.
- (Optional) Enable Install NVIDIA GPU tools if you want GPU tooling on the cluster.
- Click Next.

Step 3: Select Cluster Template
- Choose Deploy NVIDIA NIM for GPU-accelerated inference.
- Click Next.

Step 4: Select Nodes and Create
- Select the cluster node(s) (Bare Metal or Virtual Machine).
- Click Create to start cluster creation.

Step 5: Monitor Cluster Creation
Wait until the cluster status is Running.
- Initializing Control Planes — Status shows Processing.
- Initializing Workers — Status remains Processing.

- When creation completes, Status shows Running.

Step 6: View Cluster Details and Access Tools
Click the cluster name to open the cluster detail view.
-
Overview — Cluster information, and options to scale the cluster, download kubeconfig, access the dashboard, and delete the cluster. The Nodes tab shows the nodes allocated to the cluster.

-
Kubectl Terminal — Click the Kubectl Terminal (arrow) icon to run kubectl commands from the UI.

Ensure all pods are in Running state before using the cluster for workloads. In the Kubectl Terminal, run:
kubectl get pods -A
-
GPUs — Click the GPUs tab to view GPU details for the cluster.

Step 7: Download Kubeconfig
- Click the menu (ellipsis) icon for the cluster.
- Select Kubeconfig to download the cluster kubeconfig file.
