NVIDIA NIM Cluster Template

Overview

The NVIDIA NIM template provisions a Kubernetes cluster for GPU-accelerated AI/LLM inference using NVIDIA NIM microservices. Use this template when you need to deploy and serve inference with prebuilt, optimized containers and MLOps integration.

Accessing the cluster

Bridge provides two ways to work with your cluster after it is created:

Download kubeconfig — Download the cluster kubeconfig from the cluster menu. Use it to access the cluster from your machine or external tools (e.g., kubectl, IDEs, or CI/CD) by setting KUBECONFIG or merging it into your default kubeconfig.
Kubectl Terminal — Use the Kubectl Terminal in Bridge UI to run kubectl commands, manage resources, and operate the cluster without a separate command-line session.

This guide covers:

Configuring cluster name, version, and CNI
Selecting the NVIDIA NIM template and cluster nodes
Monitoring creation until the cluster is Running
Viewing cluster details, downloading kubeconfig, and using the Kubectl Terminal

Prerequisites

Tenant Admin access — Log in as a Tenant Admin to create clusters.
Compute resources — Bare Metal or Virtual Machine resources allocated to your tenant.

Create an NVIDIA NIM Cluster

Step 1: Start Cluster Creation

Log in to Bridge as a Tenant Admin.
In the left sidebar, open Compute → Kubernetes.
Click Create Cluster.

Create Cluster Button

Step 2: Configure Cluster Details

Select type as Upstream
Enter a name and description for the cluster.
Select the Kubernetes version.
Select the CNI plugin. Bridge supports Flannel and Cilium.
(Optional) Enable Install NVIDIA GPU tools if you want GPU tooling on the cluster.
Click Next.

Cluster Basic Info

Cluster Setup Details

Step 3: Select Cluster Template

Choose Deploy NVIDIA NIM for GPU-accelerated inference.
Click Next.

NVIDIA NIM Template

Step 4: Select Nodes and Create

Select the cluster node(s) (Bare Metal or Virtual Machine).
Click Create to start cluster creation.

Select Cluster Nodes

Step 5: Monitor Cluster Creation

Wait until the cluster status is Running.

Initializing Control Planes — Status shows Processing.
Initializing Workers — Status remains Processing.

Cluster controlplane Processing State

Cluster Processing State

When creation completes, Status shows Running.

Cluster Running State

Step 6: View Cluster Details and Access Tools

Click the cluster name to open the cluster detail view.

Overview — Cluster information, and options to scale the cluster, download kubeconfig, access the dashboard, and delete the cluster. The Nodes tab shows the nodes allocated to the cluster.

Cluster Overview

Kubectl Terminal — Click the Kubectl Terminal (arrow) icon to run kubectl commands from the UI.

Cluster Kubectl Terminal

note

Ensure all pods are in Running state before using the cluster for workloads. In the Kubectl Terminal, run:

kubectl get pods -A

NVIDIA NIM Cluster Template

Overview

Accessing the cluster

Prerequisites

Create an NVIDIA NIM Cluster

Step 1: Start Cluster Creation

Step 2: Configure Cluster Details

Step 3: Select Cluster Template

Step 4: Select Nodes and Create

Step 5: Monitor Cluster Creation

Step 6: View Cluster Details and Access Tools

Step 7: Download Kubeconfig

Next Steps

Overview​

Accessing the cluster​

Prerequisites​

Create an NVIDIA NIM Cluster​

Step 1: Start Cluster Creation​

Step 2: Configure Cluster Details​

Step 3: Select Cluster Template​

Step 4: Select Nodes and Create​

Step 5: Monitor Cluster Creation​

Step 6: View Cluster Details and Access Tools​

Step 7: Download Kubeconfig​

Next Steps​

Overview

Accessing the cluster

Prerequisites

Create an NVIDIA NIM Cluster

Step 1: Start Cluster Creation

Step 2: Configure Cluster Details

Step 3: Select Cluster Template

Step 4: Select Nodes and Create

Step 5: Monitor Cluster Creation

Step 6: View Cluster Details and Access Tools

Step 7: Download Kubeconfig

Next Steps