mirror of https://github.com/NVIDIA/dgx-spark-playbooks.git synced 2026-04-25 11:23:52 +00:00

github-actions[bot] 4d0d20d39f chore: regenerate skills/ from upstream playbooks [skip ci]

2026-04-19 09:25:00 +00:00

1.9 KiB

Raw Blame History

name	description
dgx-spark-nim-llm	Deploy a NIM on Spark — on NVIDIA DGX Spark. Use when setting up nim-llm on Spark hardware.

NIM on Spark

Deploy a NIM on Spark

NVIDIA NIM is containerized software for fast, reliable AI model serving and inference on NVIDIA GPUs. This playbook demonstrates how to run NIM microservices for LLMs on DGX Spark devices, enabling local GPU inference through a simple Docker workflow. You'll authenticate with NVIDIA's registry, launch the NIM inference microservice, and perform basic inference testing to verify functionality.

What you'll accomplish

You'll launch a NIM container on your DGX Spark device to expose a GPU-accelerated HTTP endpoint for text completions. While these instructions feature working with the Llama 3.1 8B NIM, additional NIM including the Qwen3-32 NIM are available for DGX Spark (see them here).

What to know before starting

Outcome: You'll launch a NIM container on your DGX Spark device to expose a GPU-accelerated HTTP endpoint for text completions. While these instructions feature working with the Llama 3.1 8B NIM, additional NIM including the Qwen3-32 NIM are available for DGX Spark (see them here).

What to know before starting

Working in a terminal environment
Using Docker commands and GPU-enabled containers
Basic familiarity with REST APIs and curl commands
Understanding of NVIDIA GPU environments and CUDA

Full playbook: /home/runner/work/dgx-spark-playbooks/dgx-spark-playbooks/nvidia/nim-llm/README.md

1.9 KiB Raw Blame History

NIM on Spark

What you'll accomplish

What to know before starting

What to know before starting

1.9 KiB

Raw Blame History