mirror of
https://github.com/NVIDIA/dgx-spark-playbooks.git
synced 2026-04-23 18:33:54 +00:00
chore: Regenerate all playbooks
This commit is contained in:
parent
6489f7362f
commit
85089bc1c2
@ -44,9 +44,9 @@ completions.
|
|||||||
```bash
|
```bash
|
||||||
nvidia-smi
|
nvidia-smi
|
||||||
```
|
```
|
||||||
- [ ] Docker with NVIDIA Container Toolkit configured
|
- [ ] Docker with NVIDIA Container Toolkit configured, instructions here: https://******.nvidia.com/dgx-docs/review/621/dgx-spark/latest/nvidia-container-runtime-for-docker.html
|
||||||
```bash
|
```bash
|
||||||
docker run --rm --gpus all nvidia/cuda:12.0-base-ubuntu20.04 nvidia-smi
|
docker run -it --gpus=all nvcr.io/nvidia/cuda:13.0.1-devel-ubuntu24.04 nvidia-smi
|
||||||
```
|
```
|
||||||
- [ ] NGC account with API key from https://ngc.nvidia.com/setup/api-key
|
- [ ] NGC account with API key from https://ngc.nvidia.com/setup/api-key
|
||||||
```bash
|
```bash
|
||||||
@ -94,8 +94,6 @@ echo "$NGC_API_KEY" | docker login nvcr.io --username '$oauthtoken' --password-s
|
|||||||
|
|
||||||
Choose a specific LLM NIM from NGC and set up local caching for model assets.
|
Choose a specific LLM NIM from NGC and set up local caching for model assets.
|
||||||
|
|
||||||
> TODO: Replace with actual available NIM container image from NGC catalog
|
|
||||||
|
|
||||||
```bash
|
```bash
|
||||||
export CONTAINER_NAME="nim-llm-demo"
|
export CONTAINER_NAME="nim-llm-demo"
|
||||||
export IMG_NAME="nvcr.io/nim/meta/llama-3.1-8b-instruct-dgx-spark:latest"
|
export IMG_NAME="nvcr.io/nim/meta/llama-3.1-8b-instruct-dgx-spark:latest"
|
||||||
@ -127,7 +125,6 @@ startup messages indicating the service is ready.
|
|||||||
|
|
||||||
Test the deployed service with a basic completion request to verify functionality. Run the following curl command in a new terminal.
|
Test the deployed service with a basic completion request to verify functionality. Run the following curl command in a new terminal.
|
||||||
|
|
||||||
> TODO: Replace NIM_MODEL with actual model identifier from the container
|
|
||||||
|
|
||||||
```bash
|
```bash
|
||||||
curl -X 'POST' \
|
curl -X 'POST' \
|
||||||
@ -158,25 +155,6 @@ curl -X 'POST' \
|
|||||||
|
|
||||||
Expected output should be a JSON response containing a completion field with generated text.
|
Expected output should be a JSON response containing a completion field with generated text.
|
||||||
|
|
||||||
## Step 6. Test additional functionality
|
|
||||||
|
|
||||||
Perform extended validation with different prompts and parameters.
|
|
||||||
|
|
||||||
> TODO: Add tool calling examples if supported by selected model
|
|
||||||
|
|
||||||
```bash
|
|
||||||
curl -X 'POST' \
|
|
||||||
'http://0.0.0.0:8000/v1/completions' \
|
|
||||||
-H 'accept: application/json' \
|
|
||||||
-H 'Content-Type: application/json' \
|
|
||||||
-d '{
|
|
||||||
"model": "<NIM_MODEL>",
|
|
||||||
"prompt": "Explain quantum computing in simple terms:",
|
|
||||||
"max_tokens": 128,
|
|
||||||
"temperature": 0.7
|
|
||||||
}'
|
|
||||||
```
|
|
||||||
|
|
||||||
## Step 6. Troubleshooting
|
## Step 6. Troubleshooting
|
||||||
|
|
||||||
| Symptom | Cause | Fix |
|
| Symptom | Cause | Fix |
|
||||||
@ -185,6 +163,7 @@ curl -X 'POST' \
|
|||||||
| "Invalid credentials" during docker login | Incorrect NGC API key format | Verify API key from NGC portal, ensure no extra whitespace |
|
| "Invalid credentials" during docker login | Incorrect NGC API key format | Verify API key from NGC portal, ensure no extra whitespace |
|
||||||
| Model download hangs or fails | Network connectivity or insufficient disk space | Check internet connection and available disk space in cache directory |
|
| Model download hangs or fails | Network connectivity or insufficient disk space | Check internet connection and available disk space in cache directory |
|
||||||
| API returns 404 or connection refused | Container not fully started or wrong port | Wait for container startup completion, verify port 8000 is accessible |
|
| API returns 404 or connection refused | Container not fully started or wrong port | Wait for container startup completion, verify port 8000 is accessible |
|
||||||
|
| runtime not found | NVIDIA Container Toolkit not properly configured | Run `sudo nvidia-ctk runtime configure --runtime=docker` and restart Docker |
|
||||||
|
|
||||||
## Step 8. Cleanup and rollback
|
## Step 8. Cleanup and rollback
|
||||||
|
|
||||||
|
|||||||
Loading…
Reference in New Issue
Block a user