Merge f75d5817aa into 51615570a7

chore: Regenerate all playbooks
nemotron reqs --trust-remote-code for vllm setup
2026-06-20 21:29:31 +00:00 · 2026-05-22 09:17:56 +08:00 · 2026-05-18 17:50:57 +00:00 · 2026-01-26 08:16:23 -08:00
1 changed files with 3 additions and 3 deletions
--- a/nvidia/vllm/README.md
+++ b/nvidia/vllm/README.md
@ -85,7 +85,7 @@ The following models are supported with vLLM on Spark. All listed models are ava
 | **Nemotron3-Nano** | FP8 | ✅ | [`nvidia/NVIDIA-Nemotron-3-Nano-30B-A3B-FP8`](https://huggingface.co/nvidia/NVIDIA-Nemotron-3-Nano-30B-A3B-FP8) |

 > [!NOTE]
-> The Phi-4-multimodal-instruct models require `--trust-remote-code` when launching vLLM.
+> The Phi-4-multimodal-instruct and Nemotron3-Nano models require `--trust-remote-code` when launching vLLM.

 > [!NOTE]
 > You can use the NVFP4 Quantization documentation to generate your own NVFP4-quantized checkpoints for your favorite models. This enables you to take advantage of the performance and memory benefits of NVFP4 quantization even for models not already published by NVIDIA.
@ -218,7 +218,7 @@ Obtain the vLLM cluster deployment script on both nodes. This script orchestrate

 ```bash
 ## Download on both nodes
-wget https://raw.githubusercontent.com/vllm-project/vllm/refs/heads/main/examples/online_serving/run_cluster.sh
+wget https://raw.githubusercontent.com/vllm-project/vllm/refs/heads/main/examples/ray_serving/run_cluster.sh
 chmod +x run_cluster.sh
 ```

@ -445,7 +445,7 @@ Download the vLLM cluster deployment script on all nodes. This script orchestrat

 ```bash
 ## Download on all nodes
-wget https://raw.githubusercontent.com/vllm-project/vllm/refs/heads/main/examples/online_serving/run_cluster.sh
+wget https://raw.githubusercontent.com/vllm-project/vllm/refs/heads/main/examples/ray_serving/run_cluster.sh
 chmod +x run_cluster.sh
 ```
Author	SHA1	Message	Date
Ago	29d1c044f1	Merge `f75d5817aa` into `51615570a7`	2026-05-22 09:17:56 +08:00
GitLab CI	51615570a7	chore: Regenerate all playbooks	2026-05-18 17:50:57 +00:00
agolajko	f75d5817aa	nemotron reqs --trust-remote-code for vllm setup	2026-01-26 08:16:23 -08:00