NVIDIA NIM Cluster Template
Overview
The NVIDIA NIM template provisions a Kubernetes cluster for GPU-accelerated AI/LLM inference using NVIDIA NIM microservices. Use this template when you need to deploy and serve inference with prebuilt, optimized containers and MLOps integration.
Accessing the cluster
Bridge provides two ways to work with your cluster after it is created:
-
Download kubeconfig — Download the cluster kubeconfig from the cluster menu. Use it to access the cluster from your machine or external tools (e.g.,
kubectl, IDEs, or CI/CD) by settingKUBECONFIGor merging it into your default kubeconfig. -
Kubectl Terminal — Use the Kubectl Terminal in Bridge UI to run kubectl commands, manage resources, and operate the cluster without a separate command-line session.
This guide covers:
- Configuring cluster name, version, and CNI
- Selecting the NVIDIA NIM template and cluster nodes
- Monitoring creation until the cluster is Running
- Viewing cluster details, downloading kubeconfig, and using the Kubectl Terminal
Prerequisites
- Tenant Admin access — Log in as a Tenant Admin to create clusters.
- Compute resources — Bare Metal or Virtual Machine resources allocated to your tenant.
Create an NVIDIA NIM Cluster
Step 1: Start Cluster Creation
- Log in to Bridge as a Tenant Admin.
- In the left sidebar, open Compute → Kubernetes.
- Click Create Cluster.

Step 2: Configure Cluster Details
- Select type as Upstream
- Enter a name and description for the cluster.
- Select the Kubernetes version.
- Select the CNI plugin. Bridge supports Flannel and Cilium.
- (Optional) Enable Install NVIDIA GPU tools if you want GPU tooling on the cluster.
- Click Next.


Step 3: Select Cluster Template
- Choose Deploy NVIDIA NIM for GPU-accelerated inference.
- Click Next.

Step 4: Select Nodes and Create
- Select the cluster node(s) (Bare Metal or Virtual Machine).
- Click Create to start cluster creation.

Step 5: Monitor Cluster Creation
Wait until the cluster status is Running.
- Initializing Control Planes — Status shows Processing.
- Initializing Workers — Status remains Processing.


- When creation completes, Status shows Running.

Step 6: View Cluster Details and Access Tools
Click the cluster name to open the cluster detail view.
- Overview — Cluster information, and options to scale the cluster, download kubeconfig, access the dashboard, and delete the cluster. The Nodes tab shows the nodes allocated to the cluster.

- Kubectl Terminal — Click the Kubectl Terminal (arrow) icon to run kubectl commands from the UI.

Ensure all pods are in Running state before using the cluster for workloads. In the Kubectl Terminal, run:
kubectl get pods -A
- GPUs — Click the GPUs tab to view GPU details for the cluster.

Step 7: Download Kubeconfig
- Click the menu (ellipsis) icon for the cluster.
- Select Kubeconfig to download the cluster kubeconfig file.
